A URL Fetch Gemini Processor to be used with Gemini's genai-processors
# genai-processors-url-fetch [](https://pypi.org/project/genai-processors-url-fetch/) [](https://github.com/mbeacom/genai-processors-url-fetch/actions/workflows/validate.yml) [](https://codecov.io/github/mbeacom/genai-processors-url-fetch) [](LICENSE) A URL Fetch Processor for Google's genai-processors framework that detects URLs in text, fetches their content concurrently, and yields new ProcessorParts containing the page content. ## UrlFetchProcessor The UrlFetchProcessor is a PartProcessor that detects URLs in incoming text parts, fetches their content concurrently, and yields new ProcessorParts containing the page content. It is a powerful and secure tool for enabling AI agents to access and process information from the web. ### Motivation Many advanced AI applications, especially those involving Retrieval-Augmented Generation (RAG) or agentic behavior, need to interact with the outside world. This processor provides the fundamental capability of "reading" a webpage. * **Enables RAG:** Fetches the content of source URLs so an LLM can use up-to-date information to answer questions. * **Automates Research:** Allows an agent to follow links to gather context for a research task. * **Simplifies Tooling:** Abstracts away the complexities of asynchronous HTTP requests, rate-limiting, security validation, and HTML parsing. ### Installation Install the package using pip: ```bash pip install genai-processors-url-fetch ``` For enhanced content processing with markitdown support: ```bash pip install genai-processors-url-fetch[markitdown] ``` Or using uv (recommended): ```bash uv add genai-processors-url-fet
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.