VoiceAssistant

Name: VoiceAssistant
Author: umara25

umara25 September 16, 2025

7 copies 0 downloads

A real-time voice AI agent powered by LiveKit and Google's Gemini Realtime API, enabling natural conversational interactions through browser-based voice interface.

Voice Agent Prototype

A real-time voice AI agent using LiveKit and Google's Gemini Realtime API for natural conversation.

What It Does

Real-time bidirectional voice conversation with AI
Natural speech processing and response generation
Web-based interface for easy access
Continuous conversation flow (not just single responses)

Built With

LiveKit - Real-time audio streaming
Google Gemini API - AI conversation model
Flask - Web backend for token generation
HTML/JavaScript - Browser-based voice interface

Quick Start

Option 1: Docker (Recommended)

Prerequisites: Install Docker Desktop

Set up environment:
```
cp env.example .env
```
Edit .env with your LiveKit and Google Cloud credentials.
Run with Docker:
```
docker-compose up --build
```
Start conversation:
- Open http://localhost:5000
- Click "Join Conversation"
- Allow microphone access
- Start talking with the AI agent

Option 2: Local Development

Install dependencies:
```
pip install -r requirements.txt
```
Set up environment:
```
cp env.example .env
```
Edit .env with your LiveKit and Google Cloud credentials.
Run the application:
```
python run_webui.py
```
Start conversation:
- Open http://localhost:5000
- Click "Join Conversation"
- Allow microphone access
- Start talking with the AI agent

Environment Variables

LIVEKIT_URL=wss://your-livekit-server.livekit.cloud
LIVEKIT_API_KEY=your_livekit_api_key
LIVEKIT_API_SECRET=your_livekit_api_secret
GOOGLE_API_KEY=your_google_api_key

Comments

More Agents

View all

agentic-ai

Agentsmith

Universal, model-agnostic operating harness for AI agents (Claude, Codex, Gemini, …) — a lean core + work-type profiles assembled by one setup script.

PromptPartner

308

agent-skills

Awesome Gamedev Agent Skills

Game-development Agent Skills for AI coding agents: install once and a master router loads the right skill for your engine and task. 66 original, version-pinned skills (plus a master router) in the portable SKILL.md format that runs across Claude Code, Cursor, Codex, Copilot, Gemini CLI and more, for Godot, Unity, Unreal, web and beyond.

gamedev-skills

303

ai-agents

Agentpet

A desktop pet for macOS & Windows that monitors your AI coding agents (Claude Code, Codex, Cursor, Gemini...) in real time, and grows as you code, feed it tokens, level it up, climb the leaderboard.

ntd4996

279

ai-agent

UltraGameStudio

UltraGameStudio - AI coding agent for game development: engine workflows, gameplay code, and asset generation.

wellingfeng

260

Zero

The coding agent that answers to you, your model, your machine, your rules.

Gitlawb

1,099

agent-bridge

Lucarne

Stop babysitting local AI agents. Just notifications, approve, and resume your Codex,Pi,Grok, or Claude code sessions anywhere. 0-Intrusion mobile control bridge via Telegram/微信/飞书. No hooks, no skills, no MCP.

tuchg

314