A high performance C++ SDK for AI Agents
[TOC] # Agents-SDK - A High Performance C++ Framework for AI Agents    **Agents-SDK** is a **portable, high-performance C++ framework** for building **on-device, agentic AI systems** — think **LangChain for the edge**. This SDK is purpose-built for developers who want to create **local-first AI agents** that can reason, plan, and act without relying on the cloud. ## Features - **Modular Architecture** — Compose agents from interchangeable components. - **Multi-Provider Support** — Connect to multiple LLM providers seamlessly: - **OpenAI** (GPT-5, GPT-4o, GPT-4) - **Anthropic** (Gemini 3 family models: Opus, Sonnet, Haiku) - **Google** (Gemini family models: Pro, Flash) - **Ollama/llama-cpp** (local models like Llama, Mistral, etc.) - **Optimized for Speed and Memory** — Built in C++ with focus on performance. - **Built-In Workflow Patterns** - Prompt Chaining - Routing - Parallelization - Orchestrator-Workers - Evaluator-Optimizer - **Autonomous Agents** — Supports modern reasoning strategies: - ReAct (Reason + Act) - CoT (Chain-of-Thought) [In Development] - Plan and Execute - Zero-Shot [In Development] - Reflexion [In Development] - **Extensible Tooling System** — Plug in your own tools or use built-in ones (Web Search, Wikipedia, Python Executor, etc). ## Pre-requisites - C++20 compatible compiler (GCC 14+, Clang 17+, MSVC 2022+) - Bazel 8.3.1+ (https://bazel.build/install) - Dependencies (already provided for convenience) - python3 (3.11+) - nlohmann/json - spdlog ## Quick Start ### Installation 1. Clone the repository: ```bash git clone https://github.com/RunEdgeAI/agents-sdk.git ``` 2. Navigate to SDK: ```bash cd agents-sdk ``` 3. Obtain A
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.