Debug your AI agents
<div align="center">
<img src="assets/images/logos/logo_dark.svg" width="200px" alt="vLLora Logo">
#### Lightweight, Real-time Debugging for AI Agents
Debug your Agents in Real Time. Trace, analyze, and optimize instantly. Seamless with LangChain, Google ADK, OpenAI, and all major frameworks.
**[Documentation](https://vllora.dev/docs)** | **[Issues](https://github.com/vllora/vllora/issues)**
</div>
## Quick Start
First, install [Homebrew](https://brew.sh) if you haven't already, then:
```bash
brew tap vllora/vllora
brew install vllora
```
### Start the vLLora:
```bash
vllora
```
> The server will start on `http://localhost:9090` and the UI will be available at `http://localhost:9091`.
vLLora uses OpenAI-compatible chat completions API, so when your AI agents make calls through vLLora, it automatically collects traces and debugging information for every
interaction.
<div align="center">

</div>
### Test Send your First Request
1. **Configure API Keys**: Visit `http://localhost:9091` to configure your AI provider API keys through the UI
2. **Make a request** to see debugging in action:
```bash
curl http://localhost:9090/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "gpt-4o-mini",
"messages": [{"role": "user", "content": "What is the capital of France?"}]
}'
```
### Rust streaming example (OpenAI-compatible)
In `llm/examples/openai_stream_basic/src/main.rs` you can find a minimal Rust example that:
- **Builds an OpenAI-style request** using `CreateChatCompletionRequestArgs` with:
- `model("gpt-4.1-mini")`
- a **system message**: `"You are a helpful assistant."`
- a **user message**: `"Stream numbers 1 to 20 in separate lines."`
- **Constructs a `VlloraLLMClient`** and configures credentials via:
```bash
export VLLORA_OPENAI_API_KEY="your-openai-compatible-key"
```
Inside the example, the client is creaHAL 分层混合模型工作流 — 强模型(Claude)负责理解/拆解/验收,低成本模型(DeepSeek)负责检索/提取/清洗。Hermes Agent skill。
An LLM agent fine-tuned on DeepSeek for spaced repetition, dynamically integrating knowledge points based on the Ebbinghaus forgetting curve.
基于 STM32F103 构建的端到端 AI 智能手表生态。自研“零重定位”原生机器码动态加载引擎与页面栈式 UI 框架;集成生产级 OTA 回滚保护机制与高带宽(921600 baud)串口协议栈。通过 Node.js 中继实现 DeepSeek AI 语义控制及 ASRPRO 语音全双工交互,是一个集成了分布式计算、现代存储管理与 AI Agent 的嵌入式全栈工程。
A Meta-Agent-Driven Self-Evolving Multi-Agent System for UAV Detection and Tracking
One command to run Hermes AI Agent with a browser UI. Zero prerequisites. 一行命令,AI 就位。
网页应用Agent,接入DeepSeek、Mimo等模型