Autonomous research pipeline using LangGraph with locally-hosted DeepSeek-R1-14B via vLLM. Adaptive planning, web + ArXiv search, gap analysis, LaTeX output with citations.
# Research Agent
A deep research AI system that generates publication-ready LaTeX papers. Built with a self-hosted reasoning model (DeepSeek-R1-Distill-Qwen-14B) and GPT-4o-mini workers for search and extraction.
## Features
- **Deep Research**: Searches web and ArXiv for relevant papers and sources
- **Self-Hosted Brain**: 128K context reasoning model via vLLM
- **Publication-Ready Output**: LaTeX papers with proper BibTeX citations
- **Domain Focus**: AI/ML, Quantum Physics, Astrophysics
- **Resilient**: Circuit breaker protection for external services
- **Observable**: Prometheus-compatible metrics and structured logging
## Architecture
```
User Query
|
v
+------------------+
| FastAPI | API Layer (rate limiting, validation)
+------------------+
|
v
+------------------+
| LangGraph | Orchestration (state machine, checkpointing)
+------------------+
|
+---> Brain (DeepSeek-R1-Distill-14B, self-hosted vLLM)
| - Planning: Creates research strategy
| - Analysis: Evaluates gathered sources
| - Synthesis: Combines findings
| - Review: Quality checks output
|
+---> Workers (GPT-4o-mini API)
- Web search (Tavily API)
- ArXiv search and paper retrieval
- LaTeX section drafting
|
v
+------------------+
| LaTeX Output | Papers with BibTeX citations
+------------------+
```
## Requirements
- Python 3.12+
- Docker and Docker Compose
- NVIDIA GPU with 24GB+ VRAM (for local brain service)
- API keys: OpenAI, Tavily
## Quick Start
### 1. Clone and setup
```bash
git clone <repository-url>
cd research-agent
./scripts/setup.sh
```
### 2. Configure environment
```bash
cp .env.example .env
```
Edit `.env` with your API keys:
```bash
# Required
OPENAI_API_KEY=sk-your-openai-api-key
TAVILY_API_KEY=tvly-your-tavily-api-key
# Brain service (use mock for development)
BRAIN_SERVICE_URL=http://localhost:8001
BRAIN_API_KEY=your-internal-braHAL 分层混合模型工作流 — 强模型(Claude)负责理解/拆解/验收,低成本模型(DeepSeek)负责检索/提取/清洗。Hermes Agent skill。
An LLM agent fine-tuned on DeepSeek for spaced repetition, dynamically integrating knowledge points based on the Ebbinghaus forgetting curve.
基于 STM32F103 构建的端到端 AI 智能手表生态。自研“零重定位”原生机器码动态加载引擎与页面栈式 UI 框架;集成生产级 OTA 回滚保护机制与高带宽(921600 baud)串口协议栈。通过 Node.js 中继实现 DeepSeek AI 语义控制及 ASRPRO 语音全双工交互,是一个集成了分布式计算、现代存储管理与 AI Agent 的嵌入式全栈工程。
A Meta-Agent-Driven Self-Evolving Multi-Agent System for UAV Detection and Tracking
One command to run Hermes AI Agent with a browser UI. Zero prerequisites. 一行命令,AI 就位。
网页应用Agent,接入DeepSeek、Mimo等模型