A fully local voice agent powered by DeepSeek R1, Faster Whisper, and Chatterbox TTS. Orchestrated with LangGraph for seamless voice-to-voice interaction without cloud dependencies.
# PyVoiceAgent A powerful, local-first interactive voice assistant built for **offline capability** and **complete control**. This project orchestrates state-of-the-art local AI models to provide a seamless voice-to-voice experience without relying on third-party cloud APIs. ## Key Highlights - **Full Voice Interaction**: Talk to the agent and hear it speak back naturally. - **Persistent Memory**: Remembers your previous conversations across sessions using a robust SQLite database. - **Local Intelligence**: Powered by **DeepSeek R1** (via Ollama) for reasoning and **Faster Whisper** for transcription. - **Intelligent Summarization**: Automatically summarizes interactions to maintain concise context. ## Pros & Cons | Advantages | Trade-offs | | :--- | :--- | | **Zero Cost**: No recurring API fees; runs entirely on your hardware. | **Hardware Dependent**: Performance scales with your CPU/GPU power. | | **Offline**: Works completely without an internet connection (after initial setup). | **Setup**: Requires installing and managing local models (Ollama, etc.). | | **Customizable**: Full access to modify the graph, prompts, and memory logic. | **Model Capability**: Local models (e.g., 8B) are powerful but may lag behind massive cloud models (e.g., GPT-4) in complex reasoning. | | **Low Latency**: Eliminates network latency constraints. | **Resource Usage**: Can be memory and compute intensive during inference. | ## Architecture The system uses **LangGraph** to manage the conversational flow: 1. **Transcribe**: `Faster Whisper` converts your voice to text. 2. **Context retrieval**: Fetches conversation history and session context from `SQLite`. 3. **Process**: `DeepSeek R1` generates a response and "thinks" through the problem. 4. **Synthesize**: `Chatterbox TTS` converts the text response back to audio. 5. **Save & Summarize**: The interaction is logged, and a summary is generated for future context. ## Quick Start ### Prerequisites - **Python 3.
HAL 分层混合模型工作流 — 强模型(Claude)负责理解/拆解/验收,低成本模型(DeepSeek)负责检索/提取/清洗。Hermes Agent skill。
An LLM agent fine-tuned on DeepSeek for spaced repetition, dynamically integrating knowledge points based on the Ebbinghaus forgetting curve.
基于 STM32F103 构建的端到端 AI 智能手表生态。自研“零重定位”原生机器码动态加载引擎与页面栈式 UI 框架;集成生产级 OTA 回滚保护机制与高带宽(921600 baud)串口协议栈。通过 Node.js 中继实现 DeepSeek AI 语义控制及 ASRPRO 语音全双工交互,是一个集成了分布式计算、现代存储管理与 AI Agent 的嵌入式全栈工程。
A Meta-Agent-Driven Self-Evolving Multi-Agent System for UAV Detection and Tracking
One command to run Hermes AI Agent with a browser UI. Zero prerequisites. 一行命令,AI 就位。
网页应用Agent,接入DeepSeek、Mimo等模型