A flexible, extensible AI agent backend built with NestJS—designed for running local, open-source LLMs (Llama, Gemma, Qwen, DeepSeek, etc.) via Docker Model Runner. Real-time streaming, Redis messaging, web search, and Postgres memory out of the box. No cloud APIs required!
# Local Agent: Fast AI Backend with Docker Model Runner A flexible, extensible AI agent backend built with NestJS—designed for running local, open-source LLMs (Llama, Gemma, Qwen, DeepSeek, etc.) via Docker Model Runner. Real-time streaming, Redis messaging, web search, and Postgres memory out of the box. No cloud APIs required! --- ## 🚀 Quick Start 1. **Clone the repository** ```bash git clone <your-repo-url> cd <your-repo-folder> ``` 2. **Copy and edit environment variables** ```bash cp .env.example .env # Edit .env and fill in your model and service config ``` 3. **Start required services (Redis, PostgreSQL, Local LLM) with Docker Compose** ```bash docker compose up -d ``` - PostgreSQL: `localhost:5433` - Redis: `localhost:6379` - Local LLM runner: `localhost:12434` ([Model Runner guide](https://blog.agentailor.com/posts/docker-model-runner-gemma)) 4. **Install dependencies** ```bash pnpm install ``` 5. **Start the development server** ```bash pnpm run start:dev ``` --- ## 🛠️ Environment Variables See `.env.example` for all options. Key variables: - `MODEL_BASE_URL` — e.g. `http://localhost:12434/engines/llama.cpp/v1` - `MODEL_NAME` — e.g. `ai/gemma3:latest`, `llama-3`, `qwen`, `deepseek` - `TAVILY_API_KEY` — for web search ([Get your key](https://www.tavily.com/)) - `REDIS_HOST`, `REDIS_PORT`, etc. - `POSTGRES_*` — for memory --- ## ✨ Features - 🤖 Local, open-source LLMs (Llama, Gemma, Qwen, DeepSeek, etc.) - 🌊 Real-time streaming responses - 💾 Conversation history with Postgres memory - 🌐 Web search integration (Tavily) - 🧵 Custom ThreadService for conversations - 📡 Redis pub/sub for real-time messaging - 🎯 Clean, maintainable architecture --- ## 🧩 Model Setup (Docker Model Runner) - This project is designed for local LLMs only, using [Docker Model Runner](https://blog.agentailor.com/posts/docker-model-runner-gemma). - Supported models: Llama, Gemma, Qwen, DeepSeek, and other op
HAL 分层混合模型工作流 — 强模型(Claude)负责理解/拆解/验收,低成本模型(DeepSeek)负责检索/提取/清洗。Hermes Agent skill。
An LLM agent fine-tuned on DeepSeek for spaced repetition, dynamically integrating knowledge points based on the Ebbinghaus forgetting curve.
基于 STM32F103 构建的端到端 AI 智能手表生态。自研“零重定位”原生机器码动态加载引擎与页面栈式 UI 框架;集成生产级 OTA 回滚保护机制与高带宽(921600 baud)串口协议栈。通过 Node.js 中继实现 DeepSeek AI 语义控制及 ASRPRO 语音全双工交互,是一个集成了分布式计算、现代存储管理与 AI Agent 的嵌入式全栈工程。
A Meta-Agent-Driven Self-Evolving Multi-Agent System for UAV Detection and Tracking
One command to run Hermes AI Agent with a browser UI. Zero prerequisites. 一行命令,AI 就位。
网页应用Agent,接入DeepSeek、Mimo等模型