Agents

2 agents available in the Gemini directory

Pre-built AI agents with specialized instructions for specific tasks — from coding and writing to research and analysis. Each agent is ready to deploy with a single click.

agent-benchmark

AgentBench-Live

The open benchmark for AI agent task execution. Claude Code vs Gemini CLI — who wins? Live leaderboard inside.

jackjin1997

agent-evaluation

web-search-agent-evals

Extensible benchmarking suite for evaluating AI coding agents on web search tasks. Compare native search vs MCP servers (You.com, expanding) across multiple agents (Claude Code, Gemini, Droid, Codex, expanding) with automated Docker workflows and statistical analysis.

youdotcom-oss