LLM Agent that performs sentiment analysis of drawings and natural language using a combination of Google Gemini Vision model and GPT-4 Turbo with LlamaIndex.
# Google Gemini Mindfulness Counselor Uses a combination of Google Gemini vision model and GPT-4 Turbo with LlamaIndex to perform sentiment analysis of images and suggest mindfulness exercises. ## Get API keys Before you can use the Gemini API, you must first obtain an API key. If you don't already have one, create a key with one click in Google AI Studio. You'll also need an OpenAI API key. Once you have your API keys create the following Google Colab "secrets" GOOGLE_API_KEY and OPENAI_API_KEY which will contain your respective keys. ## Generating Embeddings Embeddings are generated from the PDFs in the data folder and stored in the storage folder.
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.