An intelligent AI agent powered by Google's Gemini API that can build complete websites based on natural language descriptions. Demo: https://youtu.be/3jdWDY8YHu4
# AI Website Builder Agent 🌐 An intelligent AI agent powered by Google's Gemini API that can build complete websites based on natural language descriptions. Created by [Sanjay Sikdar](https://sanjaysikdar.dev). ## Key Features Covered ✨ - **Natural Language Interface** — Just describe what you want, and the agent builds it. - **Command Execution System (⚙️)** — Secure shell/terminal command execution with safety filters, validation and a 30-second timeout. (Exposed via the executeCommand tool.) - **File Operations (📁)** — Read, write, and list files using modern `fs/promises` APIs (readFile, writeFile, listFiles). - **Persistent Chat Memory (📚)** — Keeps conversational context for up to 20 messages with automatic trimming to stay within token limits. - **Modify Existing Components (🧱)** — Edit HTML, CSS, or JS parts dynamically without breaking the overall document structure. - **Smart Code Generation** — Produces clean, modern, responsive website scaffolds and component code. - **Rate Limiting + Retry Logic (⏳)** — Built-in 2s delay between API calls plus automatic retries with exponential backoff (e.g., 5s, 15s, 30s), up to 3 retries for quota errors. - **Error Handling (🔒)** — Robust handling with smart fallbacks for timeouts, authentication issues and other common failures. - **Image Placeholder Management (🖼️)** — Uses services like `picsum.photos` and `dummyimage.com` for image placeholders during scaffolding. - **Cross-Platform Support (🖥️)** — Works on Windows, macOS, and Linux (handles path and shell differences). - **Enhanced UX** — Beautiful console output with emojis, progress indicators and clear retry/error messages. - **Security Features** — Blocks dangerous commands and enforces command validation policies. ## Installation 📦 ```bash # Install dependencies npm install ``` ## Configuration ⚙️ 1. Get your Google Claude API key from [Google AI Studio](https://aistudio.google.com/app/apikey) 2. Set as environment va
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.