Autonomous AI Agent for the JVM: shell, files, google search, runs any LLM generated java code on the jvm itself. JIT compilation and a child-first classloader with ANY classpath. Comes wit a swing GUI , can be "dropped in" into any existing java application. Easy tools easy pojos. IoT devices. Run it standalone for a pure-java AI assistant.
[](https://github.com/sponsors/anahata-os) [](https://central.sonatype.com/artifact/uno.anahata/gemini-java-client) [](https://anahata-os.github.io/gemini-java-client/apidocs/) # gemini-java-client: The Autonomous AI Agent Engine for the JVM **[Website](https://anahata-os.github.io/gemini-java-client/) | [Anahata TV (YouTube)](https://www.youtube.com/@anahata108) | [Discord](https://discord.gg/M396BNtX) | [v2 on its way!](https://github.com/anahata-os/anahata-asi)**  **Stop building chatbots. Start spawning Agents.** The `gemini-java-client` is a pure-Java engine specially engineered to exploit the full power of the **Google Gemini API**. It provides the infrastructure for an AI to inhabit your runtime, introspect your memory, and execute code in-process. It's the first framework that turns your application into a living, breathing host for autonomous agents that don't just suggest code—they **live inside your JVM**. --- ## 🚀 What's New in v1.1.0 - **Enhanced Context Management**: Improved PAYG (Prune-As-You-Go) v2 logic for even more efficient token usage and longer conversations. - **UI Stability & Ergonomics**: Resolved critical "Modal Hang" issues and improved split-pane behavior in the prompter. - **Vector Icon System**: Transitioned to a theme-aware vector icon system for a crisp, professional look on any display. - **Theme Persistence**: Full support for persisting UI themes across sessions. --- ## 🚀 The Killer Advantage: Autonomous JVM Execution While other AI tools are external observers, Anahata is an **insider**. It operates as an autonomous agent within your application's runtime, capable of executing any Java logic with
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.