A WhatsApp agent that uses WhatsApp MCP server to send messages and perform other actions.
# WhatsApp Agent
An Agent that interacts with WhatsApp using WhatsApp MCP server. It can perform multiple actions on the user's behalf.

# 🚀 Features
The Agent is connected to WhatsApp MCP server. This server provides the following tools:
- **search_contacts:** Search for contacts by name or phone number
- **list_messages:** Retrieve messages with optional filters and context
- **list_chats:** List available chats with metadata
- **get_chat:** Get information about a specific chat
- **get_direct_chat_by_contact:** Find a direct chat with a specific contact
- **get_contact_chats:** List all chats involving a specific contact
- **get_last_interaction:** Get the most recent message with a contact
- **get_message_context:** Retrieve context around a specific message
- **send_message:** Send a WhatsApp message to a specified phone number or group JID
- **send_file:** Send a file (image, video, raw audio, document) to a specified recipient
- **send_audio_message:** Send an audio file as a WhatsApp voice message (requires the file to be an .ogg opus file or ffmpeg must be installed)
- **download_media:** Download media from a WhatsApp message and get the local file path
# ⚡Quick Start
## Prerequisites
* Python 3.10+
* WhatsApp MCP server
* An OpenAI API key (or compatible service like DeepSeek)
* [uv](https://github.com/astral-sh/uv)
> [!NOTE]
> To setup WhatsApp MCP server, follow the instructions in the [README](https://github.com/lharries/whatsapp-mcp).
## Installation
Follow these steps to set up the project on your local machine:
1. **Clone the repository**
```bash
git clone https://github.com/aasherkamal216/whatsapp_agent.git
cd whatsapp_agent
```
2. **Create and activate a virtual environment**
```bash
uv venv
# On macOS/Linux
source .venv/bin/activate
# On Windows
.venv\Scripts\activate
```
3. **Install dependencies**
```bash
uv sync
```
4. **Set up environmHAL 分层混合模型工作流 — 强模型(Claude)负责理解/拆解/验收,低成本模型(DeepSeek)负责检索/提取/清洗。Hermes Agent skill。
An LLM agent fine-tuned on DeepSeek for spaced repetition, dynamically integrating knowledge points based on the Ebbinghaus forgetting curve.
基于 STM32F103 构建的端到端 AI 智能手表生态。自研“零重定位”原生机器码动态加载引擎与页面栈式 UI 框架;集成生产级 OTA 回滚保护机制与高带宽(921600 baud)串口协议栈。通过 Node.js 中继实现 DeepSeek AI 语义控制及 ASRPRO 语音全双工交互,是一个集成了分布式计算、现代存储管理与 AI Agent 的嵌入式全栈工程。
A Meta-Agent-Driven Self-Evolving Multi-Agent System for UAV Detection and Tracking
One command to run Hermes AI Agent with a browser UI. Zero prerequisites. 一行命令,AI 就位。
网页应用Agent,接入DeepSeek、Mimo等模型