6 agents available in the Gemini directory
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Drive for AI agents. Store any file and search by meaning across modalities.
The Pydantic Gemini Processor to be used with Gemini's genai-processors
GenAI Processors is a lightweight Python library that enables efficient, parallel content processing.
A multimodal live AI assistant designed to enhance the browsing experience using Gemini.
A powerful Streamlit app that enables seamless interaction with multiple large language models (LLMs) using various media inputs, featuring advanced options for multi-modality, voice responses, chat, summarization, and agent-based tools.