HighlighAI - An AI Agentic System for Video Summarizing🎥🚀 Powered by Gemini 2.0 Flash Exp, this tool enables users to upload video files and receive detailed, user-friendly analysis.
# HighlighAI - An AI Agentic System for Video Summarizing🎥🚀 A **Streamlit-based application** for analyzing video content and extracting actionable insights using advanced AI models. Powered by **Gemini 2.0 Flash Exp**, this tool enables users to upload video files and receive detailed, user-friendly analysis. --- ## Features - **AI-Powered Video Analysis**: Leverages Gemini 2.0 Flash Exp for multimodal AI processing. - **Supplementary Web Research**: Uses DuckDuckGo for additional context and insights. - **Streamlined User Interface**: Upload video files, ask questions, and get actionable results effortlessly. - **Supports Popular Formats**: Works with `.mp4`, `.mov`, and `.avi` video files. --- ## Tech Stack - **Python**: Core programming language. - **Streamlit**: Interactive UI and deployment. - **Phidata Library**: For building and managing AI agents. - **Google Generative AI**: Video file processing and Gemini model integration. - **DuckDuckGo**: Supplementary web research tool. --- ## Prerequisites - **Python**: Version 3.8 or higher. - **Google API Key**: Add your Google API key to a `.env` file in the project directory: GOOGLE_API_KEY=your_api_key_here --- ## Installation 1. **Clone the Repository**: git clone https://github.com/your-username/video-ai-summarizer.git cd video-ai-summarizer 2. **Install Dependencies**: Run the following command to install required Python packages: pip install streamlit phi duckduckgo-python python-dotenv google-generative-ai --- ## Running the App 1. Start the Streamlit app: streamlit run app.py 2. Access the app in your web browser at: http://localhost:8501 --- ## How to Use 1. **Upload a Video**: Upload a file in `.mp4`, `.mov`, or `.avi` format. 2. **Ask a Question**: Provide specific queries about the video content. 3. **Analyze**: Click the "Analyze Video" button to start processing. 4. **View Results**: Detailed insights will be displayed in the app. --- ## C
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.