AI Agent platform for online meeting agents with Google Vertex AI, Datadog and Elevenlabs integration.
# Gemini Meet: Autonomous Meeting Agents Platform **Gemini Meet** is an open-source platform for deploying autonomous AI agents into video calls (Google Meet, Zoom, Teams). Unlike simple transcription bots, Gemini Meet agents are **active participants**: they can speak, use tools (via MCP), and reason in real-time using **Google's Gemini** models (or other providers). [Gemini Meet](https://meetings.juledz.com) ## 🚀 The Platform The platform consists of two parts: 1. **The Dashboard**: A web interface to manage agents, configure tools (MCP), and view live meeting details. 2. **The Agents**: Ephemeral Docker containers spawned by the dashboard that actually join the meetings.  ## ⚡️ Quickstart The recommended way to use Gemini Meet is through the **Dashboard**. ### Prerequisites - **Docker** (Must be running) - **Node.js 18+** & npm - **PostgreSQL** (Or use the docker-compose in `dashboard/` directory) - **Google Cloud Credentials** (For Gemini) ### 1. Build Agent Images The dashboard needs these images to spawn agents. You **must** build them first. Make sure the `.env` file is present in the root directory. ```bash cp .env.example .env ``` Fill in the required variables. Then build the images: ```bash # 1. Build the base image docker build -f docker/Dockerfile -t ghcr.io/gemini-meet:latest . # 2. Build the agent image (used by the dashboard) cd dashboard docker build -f Dockerfile.agent -t gemini-meet-with-node:latest . cd .. ``` ### 2. Start the Dashboard Navigate to the dashboard directory and follow the setup there. ```bash cd dashboard cp .env.example .env # Edit .env with your Google Cloud Credentials and Database URL npm install npm run init-db npm run dev ``` > **Detailed Setup:** For full database setup and production deployment instructions, see the **[Dashboard Documentation](dashboard/README.md)**. ### 3. Create an Agent Open [http
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.