Open Comet is an autonomous AI agent integrated into your Chrome browser. It enables safe, transparent, and multi-modal web automation—allowing AI models to browse, research, and interact with the web just like a human.
<p align="center"> <img src="assets/icons/icon128.png" width="100" alt="Open Comet Logo" /> </p> <h1 align="center">Open Comet</h1> <p align="center"> <strong>The autonomous AI browser agent for deep research & tasks.</strong> </p> <p align="center"> <img src="https://img.shields.io/badge/Version-1.1.0-blue?style=for-the-badge" alt="Version" /> <img src="https://img.shields.io/badge/JavaScript-F7DF1E?style=for-the-badge&logo=javascript&logoColor=black" alt="JavaScript" /> <img src="https://img.shields.io/badge/Chrome_Extension-4285F4?style=for-the-badge&logo=google-chrome&logoColor=white" alt="Chrome Extension" /> <img src="https://img.shields.io/badge/License-MIT-green?style=for-the-badge" alt="License" /> </p> <p align="center"> <img src="https://img.shields.io/badge/OpenAI-412991?style=for-the-badge&logo=openai&logoColor=white" alt="OpenAI" /> <img src="https://img.shields.io/badge/Anthropic-7562FF?style=for-the-badge&logo=anthropic&logoColor=white" alt="Anthropic" /> <img src="https://img.shields.io/badge/Gemini-8E75FF?style=for-the-badge&logo=google-gemini&logoColor=white" alt="Gemini" /> <img src="https://img.shields.io/badge/Mistral_AI-000000?style=for-the-badge&logo=mistralai&logoColor=white" alt="Mistral AI" /> <img src="https://img.shields.io/badge/Ollama-black?style=for-the-badge&logo=ollama&logoColor=white" alt="Ollama" /> </p> --- Open Comet is an autonomous AI agent that lives in your browser's side panel. It operates on a **"Zero-Data" architecture**, ensuring your privacy by keeping history local while providing high-fidelity execution for research, browsing, and multi-step automation. ## 🚀 Key Features - **Autonomous Browsing**: Executes complex, multi-step workflows across any website. - **Deep Research Engine**: STORM-inspired loops to synthesize information from multiple sources and tabs. - **High-Fidelity Execution**: Uses pixel-perfect DOM grounding and visual analysis for reliable interaction. - **"Zero-Data" Pri
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.