Agent Spotlight is a desktop AI agent that provides a simple, powerful, and extensible interface for interacting with language models. Built with Tauri, Next.js, and Rust, it's designed to be a lightweight and fast way to bring the power of AI to your desktop.
# 🔍 Agent Spotlight <div align="center"> **The Desktop AI Agent That Actually Gets Stuff Done** *Transform your desktop into an AI-powered command center. One keystroke. Infinite possibilities.* [](https://tauri.app) [](https://gemini.google.com) [](https://rust-lang.org) [](https://nextjs.org) [Quick Start](#quick-start) • [Features](#features) • [Add Tools](#extensibility-the-game-changer) • [Examples](#real-world-examples) *Hit Cmd+` and ask anything. Your AI agent will figure out the rest.* </div> --- ## What Makes Agent Spotlight Special? > **TL;DR**: This isn't just another AI chatbot. It's a desktop agent that can actually interact with your files, run commands, and integrate with any tool you want—all through natural language. **The Problem**: Every AI assistant lives in a browser tab, disconnected from your actual work. **The Solution**: Agent Spotlight brings AI directly to your desktop with: - **Global hotkey access** (Cmd+` from anywhere) - **Real tool integration** via Model Context Protocol (MCP) - **Native desktop performance** (built with Rust + Tauri) - **Beautiful, minimal UI** that stays out of your way - **Infinite extensibility** - add any tool, API, or data source --- ## Features ### Core Features - **Spotlight Interface**: macOS Spotlight-inspired design with global hotkey access - **AI-Powered**: Uses Google's Gemini 2.5 Flash for intelligent query processing - **Function Calling**: AI automatically decides when and which tools to use - **Lightning Fast**: Native performance with Rust backend - **Modern UI**: Built wit
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.