The Pydantic Gemini Processor to be used with Gemini's genai-processors
# genai-processors-pydantic [](https://pypi.org/project/genai-processors-pydantic/) [](https://github.com/mbeacom/genai-processors-pydantic/actions/workflows/validate.yml) [](https://codecov.io/github/mbeacom/pydantic-gemini-processor) [](LICENSE) A Pydantic validator processor for Google's [genai-processors](https://github.com/google-gemini/genai-processors) framework. **Note:** This is an independent contrib processor that extends the genai-processors ecosystem. ## ⚠️ Important: Current Limitations & Roadmap This processor was developed based on feedback from the genai-processors maintainers. While functional and tested, it has known limitations in certain scenarios. See [MAINTAINER_FEEDBACK.md](MAINTAINER_FEEDBACK.md) for detailed analysis and our roadmap to address these challenges: * **Streaming**: Currently works best with complete JSON in single Parts * **Tool Integration**: Planned support for `genai_types.ToolResponse` Parts * **Multi-Model Validation**: Single-model design; multi-model support planned * **MIME Type Independence**: ✅ Already handles unmarked JSON Parts We're committed to addressing these limitations while maintaining a stable API. ## PydanticValidator The PydanticValidator is a PartProcessor that validates the JSON content of a ProcessorPart against a specified [Pydantic](https://docs.pydantic.dev/latest/) model. It provides a simple, declarative way to enforce data schemas and improve the robustness of your AI pipelines. ## Motivation In many AI applications, processors ingest data from external sources like user inputs or API calls. This data can be unpredictable or malformed. T
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.