RAGent Chatbot is a smart AI assistant that combines Retrieval-Augmented Generation (RAG) with a tool-using LLM agent. It can answer user queries using uploaded documents, perform web searches, summarize text, solve math expressions, check weather, and more — all powered by LangChain, Qdrant, and Gemini.
# 🤖 RAGent Chatbot — Intelligent RAG + ReAct Assistant > 💡 **Blog Guide**: [How This Chatbot is Built](https://shafiqulai.github.io/blogs/blog_6.html) The **RAGent Chatbot** is a smart, modular, and explainable AI assistant that combines **Retrieval-Augmented Generation (RAG)** with **ReAct-style agent reasoning**. It goes beyond document Q&A — deciding when to read, when to think, and when to act using tools like web search, Wikipedia, calculator, and more. <div align="center"> <img src="figure/thumbnail.png" width="75%" alt="RAGent Chatbot UI Preview"/> </div> ## 🧠 What Can RAGent Do? <ul style="padding-left: 40px;"> <li>📄 Answer questions from uploaded documents (PDF, DOCX, PPTX, XLSX, TXT, CSV, JSON)</li> <li>🛠️ Use external tools dynamically — web search, weather, Wikipedia, calculator, etc.</li> <li>🔍 Combine semantic and keyword search using <b>hybrid retrieval</b></li> <li>🧠 Think step-by-step using the <b>ReAct agent framework</b></li> <li>💬 Maintain chat context using memory for natural multi-turn conversations</li> </ul> ## 🧱 Architecture Highlights <ul style="padding-left: 40px;"> <li>🤖 <b>ReAct Agent</b>: Reasoning + tool calling in a loop: Thought → Action → Observation → Final Answer</li> <li>📚 <b>RAG Pipeline</b>: Retrieves relevant chunks from Qdrant DB and answers with Gemini LLM</li> <li>🛠️ <b>Custom Tooling</b>: Extensible tool framework (BaseTool) with weather, web, wiki, calculator, LLM, etc.</li> <li>💾 <b>Memory</b>: Per-session chat history powered by MemoryManager</li> <li>🖥️ <b>Gradio UI</b>: Intuitive chat + upload interface</li> </ul> ## 🛠 Tech Stack | Layer | Technology | |--------------|--------------------------------------| | 🧠 LLM | Google Gemini via LangChain | | 📦 Embeddings | BAAI/bge-base-en-v1.5 | | 🔍 Vector DB | Qdrant (supports hybrid search) | | 💬 UI | Gradio | | 🧰 Tools
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.