A modular, LLM-agnostic fullstack agent framework.
# Modular Agent Fullstack Modular Agent Fullstack is an extended and improved version of the "[Gemini Fullstack LangGraph Quickstart](https://github.com/google-gemini/gemini-fullstack-langgraph-quickstart)" project by the Google ChatGPT Team. While the original project offers a great introduction to building fullstack AI agents using LangGraph, it also comes with several limitations that can hinder beginners and AI enthusiasts from further exploration, customization, or learning. This project addresses those limitations by refactoring the codebase into a more modular framework, making it easier to extend, customize, develop, and deploy AI agents with your preferred UI and backend architecture. <div style="display: flex; justify-content: center; gap: 5px;"> <img src="assets/select-agent.png" width="49%" /> <img src="assets/rag-ui.png" width="49%" /> </div> ## Limitaions of Gemini's Fullstack LangGraph Project - **Heavy Dependence on LangGraph**: The original project relies extensively on LangGraph, not just for building the agent, but also for serving frontend components and powering the backend API server. This hides important implementation details and reduces flexibility for developers who prefer to use their own methods for frontend/backend integration. - **Difficult to Extend with Custom Agents**: The current frontend design is rigid and doesn’t support adding custom agents easily, for example, creating custom dashboards for workflow visualization or adding configurable parameter inputs. - **Limited to Gemini** - **Cumbersome Development and Deployment**: Although Docker is used, the setup lacks a dedicated development environment and does not offer a clear or feasible deployment strategy - for example, hosting the project on a public URL is not straightforward. ## Improvements in Modular Agent Fullstack - **Reduced LangGraph Dependency**: LangGraph dependencies `@langchain/langgraph-sdk` have been removed from the frontend. The backend and Docker configu
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.