♾️ Private Agent Fleet with Spec Coding. Each agent gets their own GPU-accelerated desktop. Run Claude, Codex, Gemini and open models on a full private AI Stack ♾️
<div align="center"> <img alt="logo" src="https://helix.ml/assets/helix-logo.png" width="250px"> <br/> <br/> </div> <p align="center"> <a href="https://app.helix.ml/">SaaS</a> • <a href="https://docs.helixml.tech/docs/controlplane">Private Deployment</a> • <a href="https://docs.helixml.tech/docs/overview">Docs</a> • <a href="https://discord.gg/VJftd844GE">Discord</a> </p> # HelixML - AI Agents on a Private GenAI Stack [👥 Discord](https://discord.gg/VJftd844GE) **Deploy AI agents in your own data center or VPC and retain complete data security & control.** HelixML is an enterprise-grade platform for building and deploying AI agents with support for RAG (Retrieval-Augmented Generation), API calling, vision, and multi-provider LLM support. Build and deploy LLM applications by writing a simple [`helix.yaml`](https://docs.helixml.tech/helix/develop/getting-started/) configuration file. Our intelligent GPU scheduler packs models efficiently into available GPU memory and dynamically loads and unloads models based on demand, optimizing resource utilization. ## ✨ Key Features ### 🤖 AI Agents - **Easy-to-use Web UI** for agent interaction and management - **Session-based architecture** with pause/resume capabilities - **Multi-step reasoning** with tool orchestration - **Memory management** for context-aware interactions - **Support for multiple LLM providers** (OpenAI, Anthropic, and local models) <img width="1768" height="1053" alt="AI Agents Interface" src="https://github.com/user-attachments/assets/0e945ace-4f54-46a2-8d20-49485169486f" /> ### 🛠️ Skills and Tools - **REST API integration** with OpenAPI schema support - **MCP (Model Context Protocol) server** compatibility - **GPTScript integration** for advanced scripting - **OAuth token management** for secure third-party access - **Custom tool development** with flexible SDK <img width="1767" height="1057" alt="Skills and Tools" src="https://github.com/user-attachments/assets/575330f7-cfda-4e68-acd
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.