Write browser tests in natural language, built on pytest and allure.
# 🤖 AgentiTest: Agentic Test Automation This framework uses an LLM to automate browser testing through high-level, natural language commands. It's designed to be a flexible and developer-friendly tool for creating robust and maintainable browser tests. ## 🛠️ Tech Stack * **[`browser-use`](https://github.com/browser-use/browser-use)**: Translates natural language into browser actions with an agent. * **[`pytest`](https://github.com/pytest-dev/pytest)**: A framework for structuring and running tests. * **[Allure Report](https://github.com/allure-framework/allure2)**: For creating interactive and detailed test reports.  ## ✨ Key Features * **Natural Language Tests**: Write tests using descriptive, natural language tasks instead of brittle selectors. * **Adaptable Logic**: The core testing logic is designed to be easily adapted for any website. * **Insightful Reports**: Allure reports provide screenshots, agent actions, and AI "thoughts" for each step, giving you a clear view into the test execution. * **Environment Documentation**: Key details about the test environment—such as the OS, Python version, and browser version—are automatically included in the Allure report for better debugging and context. * **Customizable Browser Settings**: The framework allows for detailed configuration of browser settings. For more information, see the [Browser Settings documentation](https://docs.browser-use.com/customize/browser-settings). * **Example Implementation**: `test_community_website.py` provides a practical example of how to write a test suite. ## ⚙️ How It Works The framework's structure is designed for clarity and scalability, separating shared logic from individual tests. It's composed of two main parts: * **A common file (`conftest.py`)**: This central file contains `pytest` fixtures for setting up the browser and the LLM, along with a reusable `run_agent_task` helper function that executes test steps.
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.