AI music producer agent using SuperComposer and FoxDot. Generate and live code music with natural language through a beautiful Streamlit interface.
# 🎹 FoxDot AI Music Agent Create music with AI using natural language. Generate and live code music using Google Gemini and FoxDot, with real-time streaming, AI thinking visualization, and auto-execution in SuperCollider.    ## ✨ Features ### 🎨 Beautiful Streamlit UI - **Real-time Streaming** - See AI responses stream in character by character - **Thinking Visualization** - Watch the AI's reasoning process unfold - **Tool Call Indicators** - Live function call execution with code preview - **Music State Panel** - Live view of BPM, scale, root, and active layers ### 🤖 AI-Powered Music Creation - **Natural Language Interface** - Describe music in plain English - **Genre Understanding** - Knows house, techno, lo-fi, hip-hop, ambient, and more - **Intelligent Layering** - AI maintains context of what's playing - **Continuous Evolution** - Build and modify compositions layer by layer ### 🎵 FoxDot Integration - **Global Environment** - Shared FoxDot state like the original working code - **Auto-execution** - Code runs immediately in SuperCollider (default ON) - **50+ Synths** - Full knowledge of FoxDot synthesizers - **Effects & Scales** - Complete access to FoxDot's musical capabilities ## 🚀 Quick Start ### 1. Install Dependencies ```bash pip install -r requirements.txt ``` ### 2. Set API Key ```bash # Set environment variable export GOOGLE_API_KEY="your-api-key-here" # Or enter it directly in the Streamlit UI ``` Get your API key from [Google AI Studio](https://aistudio.google.com/apikey). ### 3. Setup SuperCollider & FoxDot **Install SuperCollider:** - Download from [supercollider.github.io](https://supercollider.github.io/downloads) **Start SuperCollider and run:** ```supercollider FoxDot.start ``` **Wait for:** `FoxDot Quark ready!` message ### 4. Run the Str
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.