A powerful Streamlit app that enables seamless interaction with multiple large language models (LLMs) using various media inputs, featuring advanced options for multi-modality, voice responses, chat, summarization, and agent-based tools.
# 🚀Super AI Assistant An interactive AI-powered assistant built with Streamlit! Chat with advanced models, get voice responses, and upload all kinds of media to unlock the potential of AI in real time. Super AI Assistant brings Google Gemini and Groq models directly to you with a range of features and an easy-to-use interface. ---  ## Features - **Model Selection:** Pick between Google Gemini or Groq's open-source models. The app's sidebar adapts dynamically based on your choice. - **Multi-Modality:** Enter text or upload images, audio, video, PDFs, Docx files, and even record voice input, all in one place. Snap a picture with your camera, and the app will process it as input! - **Parameter Customization:** Control the temperature and maximum tokens to change the behavior of the models. - **Voice Response:** Get voice responses with multiple voice options to personalize your experience. - **Chat History:** Keep track of your conversation history. The app remembers your messages, allowing you to continue from where you left off—or reset anytime for a fresh start. - **Agent-Based Tasks:** Select tools for specific agent tasks to retrieve real-time information on your chosen topics. - **Summarization:** Effortlessly summarize webpages and YouTube videos. Just input a URL, and the app will provide a markdown summary! --- ## 📚 Libraries Used - **Streamlit:** Main framework for creating the app interface. - **Edge TTS:** For generating voice responses. - **Google Generative AI:** For advanced Google Gemini functionalities. - **Langchain:** For Agent-based tasks, chatbot, and summarization. - **Langchain Groq:** For Groq's lightning-fast LLMs. --- ## Getting Started ### Prerequisites - Python 3.11 or higher - Google Gemini and Groq API keys ### 🛠️ Installation & Setup **Step 1: Clone the Repository:** ```bash git clone https://github.com/aasherkamal216/Super-AI-Assistant-App.git cd Super-AI-Assistant-App ``` **Step 2: Set
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.