DeepSeek-R1-Voice-Agent

Name: DeepSeek-R1-Voice-Agent
Author: theaifutureguy

theaifutureguy June 20, 2025

20 copies 0 downloads

An interactive AI voice agent that can capture and transcribe speech in real-time, generate intelligent responses using the DeepSeek R1 (7B model) AI, and convert the responses back to natural speech for immediate playback. The agent maintains conversation context and supports cross-platform usage on macOS, Linux, and Windows.

DeepSeek R1 AI Voice Agent

A real-time AI voice assistant powered by DeepSeek R1 that enables seamless voice conversations through speech-to-text transcription, AI response generation, and text-to-speech synthesis.

🌟 Overview

This project creates an interactive AI voice agent that:

Captures and transcribes speech in real-time using AssemblyAI
Generates intelligent responses using DeepSeek R1 (7B model) via Ollama
Converts AI responses back to natural speech using ElevenLabs
Streams audio responses for immediate playback

✨ Features

Real-time Speech Recognition: High-quality speech-to-text transcription with AssemblyAI
Advanced AI Responses: Powered by DeepSeek R1's reasoning capabilities
Natural Voice Synthesis: Professional text-to-speech with ElevenLabs
Streaming Audio Playback: Low-latency audio streaming for responsive conversations
Conversation Memory: Maintains context throughout the conversation
Cross-platform Support: Works on macOS, Linux, and Windows

🔧 Prerequisites

API Keys Required

AssemblyAI API Key: Get your free API key
ElevenLabs API Key: Sign up for ElevenLabs

System Dependencies

Install Ollama

Download and install Ollama from ollama.com

Install PortAudio

Ubuntu/Debian:

sudo apt update && sudo apt install portaudio19-dev

macOS:

brew install portaudio

Windows: PortAudio is typically included with the Python package installation.

Install MPV (macOS only)

brew install mpv

📦 Installation

1. Clone the Repository

git clone https://github.com/danieladdisonorg/DeepSeek-R1-Voice-Agent.git
cd DeepSeek-R1-Voice-Agent

2. Install Python Dependencies

pip install "assemblyai[extras]" ollama elevenlabs

3. Download DeepSeek R1 Model

Comments

More Agents

View all

agentic-ai

Klaatcode

Open-source AI coding agent for the terminal. Claude Code-grade accuracy with smart model routing — uses the right AI model for each task, cutting costs 10x. Supports Claude, GPT, Gemini, DeepSeek & more.

KlaatAI

139

agent

Agentmaker

A general-purpose Python framework for building LLM agents and multi-agent systems. "Four lines of code, an agent with memory."

xinhuangcs

ai-api

Api Model Playground Cookbook

Ultimate LLM API Integration Cookbook 2026 for Cursor & AI Agents

09omerdgn-droid

150

agent-framework

Agent Ecologies

Ultimate Multi-Agent OS for Autonomous AI NPCs 2026

israriqbal

153

Private Agent

PrivateAgent is an open-source Android automation agent built with Flutter. It utilizes the DeepSeek API and native Android Accessibility Services to interpret screen layouts and execute multi-step tasks across any installed application via natural language commands.

orailnoor

123

Loom Novel

把一队分工 Agent 织成一条写小说的流水线,做成桌面客户端;写作指纹让它越写越像你(BYO DeepSeek key,纯本地)。

WadeZhao23

184