A versatile workflow automation platform to create, organize, and execute AI workflows, from a single LLM to complex AI-driven workflows.
<h1 align="center">💬 Agent Pilot</h1> <p align="center">️ <img src="docs/demo.png" width="600px" alt="AgentPilot desktop demo" /> <br><br> A versatile workflow automation system. Create, organize, and execute complex AI-driven tasks. Agent Pilot provides a seamless experience, whether you want to chat with a single LLM or a complex multi-member workflow. <br><br> With an intuitive and feature-rich interface, you can effortlessly design AI workflows and chat with them in real-time. Branching chats are supported, allowing flexible interactions and iterative refinement. <br><br> Agent Pilot offers generative and customizable UI, allowing creation of custom pages and hierarchical configs. This flexibility gives you the freedom to design an interface that aligns with your specific needs and effortlessly integrate into your workflows. <br><br> The system supports scheduled and recurring workflows that can be set to run based on natural language expressions of time, enabling automation that ranges from every second to every leap year. </p> <div align="center"> [](https://discord.gg/ge2ZzDGu9e) [](https://twitter.com/AgentPilotAI) </div> <p align="center"> <img src="docs/demo.gif" align="center" height="255px" alt="AgentPilot gif demo" style="margin-right: 20px;" /> <img src="docs/Screenshot3.png" align="center" height="250px" alt="AgentPilot gif demo" style="margin-right: 20px;" /> <img src="docs/Screenshot1.png" align="center" height="250px" alt="AgentPilot gif demo" style="margin-right: 20px;" /> </p> <p align="center"> <img src="docs/Screenshot2.png" align="center" height="250px" alt="AgentPilot gif demo" style="margin-right: 20px;" /> <img src="docs/Screenshot4.png" align="center" height="250px" alt="AgentPilot gif demo" style="margin-right: 20px;" /> </p> ## Quickstart ### Binaries <table> <tr>
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.