AI Agent - It can book rides, order food, post tweets and also control basic tasks on device.
# Aiboo – AI Agent for Android **Aiboo** is a Kotlin-based Android AI agent that can understand natural language commands and perform tasks on your device. It is powered by **Google Gemini**, and **[Prexocore](https://github.com/binarybeam/Prexocore)** to simplify Android development and reduce boilerplate code. [Download APK](https://github.com/binarybeam/Aiboo/releases/download/1.0.0/aiboo.apk) > **Prexocore** is a utility library for Android that powers Aiboo's background operations, including permission handling, view interactions, file operations, and system actions. --- ## Key Features Aiboo can perform these tasks in the background after understanding your query: * Order food * Book rides * Post on Twitter/X * Call or text contacts on different platforms * Launch apps or websites * Capture photos and screenshots * Perform quick actions (flashlight, volume, brightness, vibrate, etc.) * Set alarms and reminders * Delete, copy, move, and rename files * Analyse data like photos, contacts, SMS, transactions, call history, files, or calendar events (with consent) > All tasks are executed in the background automatically once Aiboo understands the command. --- ## Architecture * **Input:** Voice or text query * **Agent Layer:** Aiboo interprets your intent and required details * **Task Execution:** Action is performed on the device silently in the background * **Feedback:** Only the `message` (short status) is shown to the user with Lottie animations * **Utility Backbone:** [**Prexocore**](https://github.com/binarybeam/Prexocore) handles: * Permission requests & checks * View operations (show/hide/focus) * File IO & screen capture * Navigation & alerts * Background task helpers like `after()` and speech synthesis --- ## Installation & Setup ### Requirements * Android Studio (latest) * Android device or emulator (API 24+) * Google Claude API key from [AI Studio](https://aistudio.google.com) ### Steps 1. Clone the repository: ```bash gi
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.