Memory Efficient Coding Agent ( MECA ) vram < 8GiB wukong is the first MECA-class implementation. Primary target: 4–8GB VRAM via ollama.
# wukong > **Memory Efficient Coding Agent (MECA)** — reference implementation. --- ## MECA MECA (Memory Efficient Coding Agent) is a class of coding agent defined by the following properties: 1. **VRAM budget is a first-class constraint** — the agent's tool schema, context management, and task decomposition are designed around a defined VRAM ceiling, not retrofitted to one 2. **Tool schema surface is minimized for reliable small model parsing** — flat, typed, no nested objects 3. **Context is actively managed per step** — token spend is tracked and pruned explicitly, not truncated as a fallback 4. **Task decomposition happens before model invocation** — tasks are broken into subtasks bounded by small model working memory limits before the model sees anything 5. **Reliability is instrumented and published** — benchmark scores per task class are part of the artifact, not anecdotal An agent that supports local models but was not designed around these constraints is not MECA-class — it is a large-model agent with local inference shimmed in. --- ## What Is It ? wukong is the first MECA-class implementation. Primary target: 4–8GB VRAM via ollama. Aider, opencode, goose — all built for large models. Run them on a 7b model and they degrade silently: context assumptions are wrong, tool schemas are too complex for reliable small model parsing, no budget awareness. wukong is built the other way. The 7b model is the design target. Everything follows from that. --- ## Architecture - Justification | Problem | What existing agents do | What wukong does | |---|---|---| | 7b models hallucinate complex tool call formats | Simplify nothing, degrade silently | Flat tool schema with minimal surface area | | Context bloat tanks small model output quality | Truncate or ignore | Active token budget tracking with explicit pruning per step | | Full tasks exceed small model working memory | Pass full task, let model figure it out | Rule-based decomposition into bounded subtasks
HAL 分层混合模型工作流 — 强模型(Claude)负责理解/拆解/验收,低成本模型(DeepSeek)负责检索/提取/清洗。Hermes Agent skill。
An LLM agent fine-tuned on DeepSeek for spaced repetition, dynamically integrating knowledge points based on the Ebbinghaus forgetting curve.
基于 STM32F103 构建的端到端 AI 智能手表生态。自研“零重定位”原生机器码动态加载引擎与页面栈式 UI 框架;集成生产级 OTA 回滚保护机制与高带宽(921600 baud)串口协议栈。通过 Node.js 中继实现 DeepSeek AI 语义控制及 ASRPRO 语音全双工交互,是一个集成了分布式计算、现代存储管理与 AI Agent 的嵌入式全栈工程。
A Meta-Agent-Driven Self-Evolving Multi-Agent System for UAV Detection and Tracking
One command to run Hermes AI Agent with a browser UI. Zero prerequisites. 一行命令,AI 就位。
网页应用Agent,接入DeepSeek、Mimo等模型