🤖 The Ultimate AI Form Filler. Automate complex web forms & PDFs with Voice, generic LLMs (Gemini/Phi-2), and Playwright. The best open-source autonomous form filling agent.
# Form Flow AI | The Ultimate AI Form Filler & Automation Agent
<div align="center">
<img src="assets/hero-banner.png" alt="Form Flow AI Demo" width="100%" />
<h3>🚀 Intelligent AI Form Filler & Voice-Driven Automation Agent</h3>
<p>
<a href="https://www.python.org/"><img src="https://img.shields.io/badge/Python-3.10+-3776AB?style=flat-square&logo=python&logoColor=white" alt="Python"></a>
<a href="https://react.dev/"><img src="https://img.shields.io/badge/React-18-61DAFB?style=flat-square&logo=react&logoColor=black" alt="React"></a>
<a href="https://fastapi.tiangolo.com/"><img src="https://img.shields.io/badge/FastAPI-0.109-009688?style=flat-square&logo=fastapi&logoColor=white" alt="FastAPI"></a>
<a href="https://playwright.dev/"><img src="https://img.shields.io/badge/Playwright-Automation-45BA4B?style=flat-square&logo=playwright&logoColor=white" alt="Playwright"></a>
<a href="https://aistudio.google.com/"><img src="https://img.shields.io/badge/LLM-Gemini%20Pro-8E75B2?style=flat-square&logo=google&logoColor=white" alt="Gemini"></a>
<br/>
<a href="#-project-roadmap--execution-log"><img src="https://img.shields.io/badge/Status-Active%20Beta-success?style=for-the-badge" alt="Status"></a>
<a href="LICENSE"><img src="https://img.shields.io/badge/License-MIT-blue?style=for-the-badge" alt="License"></a>
</p>
</div>
---
## 📋 Executive Summary
**Form Flow AI** is the world's most advanced **AI Form Filler** and **Automated Form Filling** agent, designed to autonomous navigate, understand, and complete complex web forms through natural voice conversation. Unlike basic autofill extensions, this **AI Form Filler** acts as an intelligent digital proxy—orchestrating a symphony of **Web Speech API** for real-time input, **Generic LLMs** (Gemini/GPT) for semantic reasoning, and **Playwright** for robust browser automation.
> **Key Value Proposition**:
> "Don't just fill forms—delegate them." Form Flow AI turns tedious data entry intoGoogle's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.