Autonomous AI experiment loop CLI -- run research overnight with any coding agent
<div align="center"> # Autoresearch CLI **Run autonomous AI experiments while you sleep. Wake up to results.** <br /> [](https://github.com/199-biotechnologies/autoresearch-cli/stargazers) [](https://x.com/longevityboris) <br /> [](https://crates.io/crates/autoresearch) [](https://crates.io/crates/autoresearch) [](LICENSE) [](https://github.com/199-biotechnologies/autoresearch-cli/actions) --- A single Rust binary that turns any AI coding agent into an autonomous research machine. Define one file to modify, one metric to optimize, and one eval command. Your agent handles the rest -- running experiments, tracking results, keeping winners, reverting losers. You sleep. It works. [Install](#install) | [How It Works](#how-it-works) | [Features](#features) | [Contributing](#contributing) </div> ## Why Autonomous Research Matters Karpathy's [autoresearch](https://github.com/karpathy/autoresearch) ran 126 ML experiments overnight on a single GPU. Since then, people have applied the same pattern to [chess engines](https://x.com/MindCanvasx/status/2035817965614940472) (expert to grandmaster), [Bitcoin modeling](https://x.com/xmal/status/2035735100516634805) (halved prediction errors), [Sudoku solvers](https
Agent that generates comprehensive documentation, API references, architecture diagrams, and developer onboarding guides from existing code.
Agent configuration for systematic bug investigation that traces issues from error logs through the codebase to root cause with suggested fixes.
Agent for integrating third-party APIs including SDK setup, type generation, error handling, retry logic, and rate limit management.
Cursor's built-in autonomous coding agent that can make multi-file edits, run terminal commands, search the codebase, and iteratively build features with minimal human intervention.
Cloud-based autonomous coding agent that runs in the background on remote sandboxed environments, handling complex multi-step tasks while you continue working.
Cursor's multi-file editing agent within Composer mode that can create, edit, and delete files across your entire project in a single conversation.