Google Drive for AI agents. Store any file and search by meaning across modalities.
<div align="center"> <img src="assets/banner.png" alt="ClawDrive — Google Drive for AI agents" width="600" /> [](LICENSE) [](https://github.com/hyper3labs/clawdrive/releases) [](https://deepwiki.com/Hyper3Labs/clawdrive) [](https://discord.gg/Za3rBkTPSf) [](https://buymeacoffee.com/morozovdd) [Website](https://claw3drive.com) · [Documentation](CLI.md) · [Live Demo](https://app.claw3drive.com/) · [Report Bug](https://github.com/hyper3labs/clawdrive/issues/new?template=bug_report.md) · [Request Feature](https://github.com/hyper3labs/clawdrive/issues/new?template=feature_request.md) </div> --- <div align="center"> <img src="assets/demo.gif" alt="ClawDrive 3D file cloud demo" width="700" /> <br/> <br/> Browse files by meaning, not folders. </div> ## What is ClawDrive? **ClawDrive** indexes your files (text, images, audio, video) and makes them searchable by meaning. You interact with it through a CLI, a REST API, or a browser UI with a 3D file cloud. Files live in **pots**. A pot is a named collection you build from files, folders, or URLs. Search and sharing are both scoped to the pot — nothing else on your machine is visible. > Everything runs on your machine. The only external call is to Gemini for embeddings. For remote access, point a tunnel at the local server. ## Features - **Multimodal Shared Representation:** Text, images, audio, and video all live in the same embedding space. - **Cross-Modal Retrieval:** Search your files by meaning—use text to find images
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.