Heybro transforms Chrome into an intelligent AI agent that executes browser tasks through natural language commands. Powered by Google Gemini, this open-source side-panel extension interprets your requests, analyzes page DOM structures, and autonomously performs clicks, form fills, and multi-page navigation— eliminating manual browser interactions
# Heybro 🤖 **Don't change your browser, just ask Bro to do it.** Heybro is a Chrome extension that automates your browser tasks using AI. Just tell it what you want in plain English, and watch it work. --- ## ✨ What does it do? Instead of clicking buttons and filling forms yourself, you tell Heybro what to do: - "Go to Amazon and search for wireless headphones" - "Open Gmail and draft an email to John" - "Find the cheapest flight to NYC on Google Flights" Heybro understands your request, looks at the page, and does the task for you. --- ## 🚀 Quick Start ### 1. Install the Extension 1. Download or clone this repository 2. Open Chrome and go to `chrome://extensions/` 3. Enable **Developer mode** (top right) 4. Click **Load unpacked** and select the extension folder ### 2. Get Your API Key 1. Go to [Google AI Studio](https://aistudio.google.com/app/apikey) 2. Create a free Claude API key 3. Copy the key ### 3. Set Up Heybro 1. Click the Heybro extension icon in Chrome 2. Open the side panel 3. Click **Settings** (gear icon) 4. Paste your API key 5. Click **Save** ### 4. Start Using It 1. Type your request in the chat box 2. Press Enter 3. Watch Heybro work!
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.