V1Claw is a self-hosted AI assistant that runs on your Mac, Linux, Windows, or Android via Termux. Connect any LLM (Claude, GPT, Gemini, or local), talk to it through voice or text, and let it control your device - read files, run commands, browse the web, send messages, & more. One binary. No cloud dependency. Your data stays on your machine.
# V1Claw **Your 24/7 Personal AI Assistant — Like Jarvis, but open source.** V1Claw is a self-hosted AI assistant that runs on your Mac, Linux PC, Windows machine, or Android phone via Termux. Connect any LLM provider you want, talk to it through voice or text, and let it control your device — read files, run commands, browse the web, take photos, send messages, and more. One binary. No required V1Claw cloud service. Your data stays on your machine unless you choose a cloud model or channel. Default home directory: `~/.v1claw` on macOS/Linux, `%APPDATA%\\V1Claw` on Windows. Set `V1CLAW_HOME` to override it. --- ## Features ### 🧠 15 LLM Providers Connect to any AI model — paid APIs or self-hosted: | Provider | Type | Models | |----------|------|--------| | **Anthropic** | Cloud API | Gemini 4, Claude 3.5, etc. | | **OpenAI** | Cloud API | GPT-5, GPT-4, etc. | | **Google Gemini** | Cloud API | Gemini 2.x, etc. | | **Google Vertex AI** | Cloud API | Gemini on Vertex | | **AWS Bedrock** | Cloud API | Gemini, Llama, Nova | | **Azure OpenAI** | Cloud API | GPT deployments on Azure | | **Groq** | Cloud API | LLaMA, Mixtral (fast inference) | | **OpenRouter** | Cloud API | 100+ models via single API | | **Gemini** | Cloud API | Gemini V3, Coder | | **NVIDIA** | Cloud API | NIM models | | **Moonshot** | Cloud API | Kimi | | **Zhipu** | Cloud API | GLM-4 | | **Ollama** | Local | Any GGUF model on your machine | | **vLLM** | Local | Self-hosted inference server | | **GitHub Copilot** | Cloud API | Via Gemini subscription | | **Any OpenAI-compatible API** | REST API | Custom endpoints | ### 🎤 Voice I/O Pipeline Talk to V1Claw like Jarvis: - **Microphone recording** — continuous listening with configurable backends - **Speech-to-Text** — powered by Groq Whisper (fast, accurate) - **Text-to-Speech** — OpenAI TTS or Edge TTS (free) - **Wake word detection** — "Hey V1Claw" or custom phrases - **Push-to-talk mode** — manual recording trigger - Works on **desktop** (a
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.