agenteval — Cursor Agents | Neura Market
    Neura MarketNeura Market/Cursor
    ChatGPTChatGPTClaudeClaudeGeminiGeminiCursorCursorGrokGrokPerplexityPerplexityDeepSeekDeepSeek
    CoPilotCoPilotStable DiffusionStable DiffusionMidjourneyMidjourney
    View All Directories
    OverviewRulesPromptsMCPsAgentsBlogVideosGuidesCoursesCommunityExtensionsTrendingGenerate
    CursorAgentsagenteval
    Back to Agents
    agenteval

    agenteval

    lukasmetzler April 1, 2026
    4 copies 0 downloads

    Lint, benchmark, and score your AI coding instructions. Stop guessing, start measuring.

    Agent Definition
    # agenteval
    
    Your CLAUDE.md is untested. So is your AGENTS.md, your cursor-instructions.md, and your .cursorrules.
    
    agenteval is a linter, benchmarker, and CI gate for AI coding instructions. It finds dead references, token bloat, contradictions, and stale instructions before your agent does. Then it scores agent performance so you can measure whether your instruction changes actually help.
    
    [![CI](https://github.com/lukasmetzler/agenteval/actions/workflows/ci.yml/badge.svg)](https://github.com/lukasmetzler/agenteval/actions/workflows/ci.yml)
    [![npm](https://img.shields.io/npm/v/agenteval-cli?label=npm)](https://www.npmjs.com/package/agenteval-cli)
    [![npm downloads](https://img.shields.io/npm/dm/agenteval-cli)](https://www.npmjs.com/package/agenteval-cli)
    [![Version](https://img.shields.io/github/v/release/lukasmetzler/agenteval?label=release)](https://github.com/lukasmetzler/agenteval/releases)
    [![License: MIT](https://img.shields.io/badge/license-MIT-blue.svg)](LICENSE)
    
    ![agenteval demo](demo/demo.gif)
    
    ## Install
    
    ```bash
    npm install -g agenteval-cli
    ```
    
    Or pick your preferred method:
    
    ```bash
    brew tap lukasmetzler/agenteval && brew install agenteval   # Homebrew
    curl -fsSL https://raw.githubusercontent.com/lukasmetzler/agenteval/main/install.sh | bash  # Shell
    ```
    
    No Bun, no Node at runtime. The binary is self-contained.
    
    ## Quick Start
    
    ```bash
    agenteval lint                    # Find problems in your instruction files
    agenteval lint --explain          # Same, with explanations for each rule
    agenteval harvest --dry-run       # Preview what AI commits are in your history
    agenteval ci                      # Run all tasks, fail on regressions
    ```
    
    ## What It Catches
    
    - Dead references to files, paths, and headings that don't exist
    - Filler phrases that waste context tokens ("make sure to", "it is important that")
    - Contradictions between instruction files ("always use X" and "never use X")
    - Content overlap and duplication across files
    - Token budget overruns 

    Tags

    agentsaibenchmarkbunclaudeclicode-qualitycopilotcursordeveloper

    Comments

    More Agents

    View all
    documentation

    Documentation & Onboarding Agent

    Agent that generates comprehensive documentation, API references, architecture diagrams, and developer onboarding guides from existing code.

    C
    Community
    debugging

    Cursor Bug Triage Agent

    Agent configuration for systematic bug investigation that traces issues from error logs through the codebase to root cause with suggested fixes.

    C
    Community
    api

    API Integration Agent

    Agent for integrating third-party APIs including SDK setup, type generation, error handling, retry logic, and rate limit management.

    C
    Community
    coding

    Cursor Agent Mode

    Cursor's built-in autonomous coding agent that can make multi-file edits, run terminal commands, search the codebase, and iteratively build features with minimal human intervention.

    C
    Cursor Team
    cloud

    Cursor Background Agent

    Cloud-based autonomous coding agent that runs in the background on remote sandboxed environments, handling complex multi-step tasks while you continue working.

    C
    Cursor Team
    composer

    Cursor Composer Agent

    Cursor's multi-file editing agent within Composer mode that can create, edit, and delete files across your entire project in a single conversation.

    C
    Cursor Team

    Stay up to date

    Get the latest Cursor prompts, rules, and resources delivered to your inbox weekly.

    Neura Market LogoNeura Market

    Discover the best AI prompts, plugins, and resources for Cursor and more.

    Content Types

    • Rules
    • Prompts
    • MCPs
    • Agents
    • Guides

    Platforms

    • ChatGPT Directory
    • Claude Directory
    • Gemini Directory
    • Cursor Directory
    • Grok Directory
    • Perplexity Directory
    • DeepSeek Directory
    • CoPilot Directory
    • Stable Diffusion Directory
    • Midjourney Directory
    • All Directories

    Resources

    • Blog
    • Documentation
    • Help Center
    • Marketplace

    Legal

    • Privacy Policy
    • Terms of Service

    © 2026 Neura Market. All rights reserved.

    |

    Not affiliated with any AI platform vendors.