AI agents - performance comparison using major LLMs.
# Overview   Experiment on AI agent performance using `versionhq` and `pydantic`. **Visit:** - [PyPI](https://pypi.org/project/versionhq/) - [Github](https://github.com/versionHQ/multi-agent-system) - [Docs](https://docs.versi0n.io) - [Process (Medium story)](https://medium.com/@kuriko-iwai/48d42fc57b71) <hr /> ## Key Features Generate multi-agent systems based on the task complexity, execute tasks, and evaluate output based on the given criteria. Agents are model-agnostic, and can handle and share RAG tools, knowledge, memory, and callbacks among other agents. <hr /> ## Quick Start 1. Install `uv` package manager: For MacOS: ``` brew install uv ``` For Ubuntu/Debian: ``` sudo apt-get install uv ``` 2. Install dependencies: ``` uv venv source .venv/bin/activate uv pip install -r requirements.txt ``` * In case of AssertionError/module mismatch, run Python version control using `.pyenv` ``` pyenv install 3.12.8 pyenv global 3.12.8 (optional: `pyenv global system` to get back to the system default ver.) uv python pin 3.12.8 echo 3.12.8 > .python-version ``` 3. Set up environment variables: Create `.env` file in the project root and add the following: ``` OPENAI_API_KEY=your-openai-api-key GEMINI_API_KEY=your-gemini-api-key OPENROUTER_API_KEY=your-openrouter-api-key ``` 4. Run: ``` uv run main.py ``` <hr /> ## Customizing - To add or refine an agent, use `src/agents.py`. - To add or refine a task, use `src/tasks.py`. <hr /> ## Results <img src="https://res.cloudinary.com/dfeirxlea/image/upload/v1738634968/pj_m_test/ulzp0wi0rptq61vkirkq.png"> (Feb 3, 2025) <hr /> ## Trouble Shooting Common issues and solutions: - API key errors: Ensure all API keys in the `.env` file are correct and up to date. Make sure to add `load_dotenv()`
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.