Product Ingredient Agent
# Product Ingredient Agent š ### Multimodal Agentic Workflow - [Phidata](https://github.com/phidatahq/phidata/) A Streamlit application that analyzes product ingredients using Gemini AI to provide insights about food and personal care products. ## Features š - **Example Products**: Pre-loaded examples of common products - **Image Upload**: Upload your own product images - **Camera Capture**: Take photos directly through the app - **AI Analysis**: Powered by Google's Gemini 2.0 Flash and Tavily Search - **Ingredient Insights**: Get a detailed analysis of ingredients and their implications ## Demo š„ <Tutorial will be released on YouTube. Meanwhile, watch [Agentic RAG using Phidata](https://www.youtube.com/watch?v=CDC3GOuJyZ0) > ## Installation š 1. Clone the repository: ```bash git clone https://github.com/yourusername/Product-Ingredient-Agent.git cd Product-Ingredient-Agent ``` 2. Create a virtual environment: ```bash python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate ``` 3. Install dependencies: ```bash pip install -r requirements.txt ``` ## Configuration āļø 1. Create a `.env` file in the project root: ```env TAVILY_API_KEY = your_tavily_api_key GOOGLE_API_KEY = your_gemini_api_key ``` 2. Add your example images in the `images/` directory: ``` images/ āāā hide_and_seek.jpg āāā bournvita.jpg āāā lays.jpg āāā shampoo.jpg ``` ## Usage š” 1. Run the Streamlit app: ```bash streamlit run app.py ``` 2. Open your browser and navigate to `http://localhost:8501` 3. Choose one of three options to analyze a product: - Select from example products - Upload your own image - Take a photo using your camera ## Project Structure š ``` product-ingredient-analyzer/ āāā app.py # Main Streamlit application āāā constants.py # System prompts and constants āāā requirements.txt # Project dependencies āāā images/ # Example product images āāā README.md # Project documentat
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.