AI agent that turns your rough ideas into perfect image generation prompts. 7-component formula, 70+ creative techniques, 9 domain modes.
# Gemini Banana **You tell it what picture you want. It writes the perfect instructions to make that picture.** That's it. That's what Gemini Banana does. --- ## The Problem You want AI to make you an image. You type "a cool sunset." You get... something okay. But not what you imagined. **Why?** Because AI image generators need very specific instructions. The more detail you give them — lighting, camera angle, colors, mood, textures — the better the result. But writing those detailed instructions is hard and takes practice. ## The Solution Gemini Banana is your **creative assistant**. You describe what you want in plain words, and it: 1. **Asks you a few quick questions** to understand your vision 2. **Builds a detailed, optimized prompt** using a proven formula 3. **Gives you a ready-to-copy prompt** that gets amazing results from image generators Think of it like having a professional photographer in your pocket who translates "I want a cool sunset" into a paragraph of perfect instructions that the AI actually understands. --- ## How to Use It ### Step 1: Get Claude Code Install [Claude Code](https://docs.anthropic.com/en/docs/claude-code) (Anthropic's command-line tool). ### Step 2: Download This Project ```bash git clone https://github.com/Hainrixz/gemini-banana.git cd gemini-banana ``` ### Step 3: Start Gemini and Ask ```bash gemini ``` Then just describe what you want in plain words: ``` "Help me create a prompt for a cozy coffee shop with warm lighting and a cat sleeping on some books" ``` Gemini automatically picks up the prompt-architect agent and knowledge base from this project. It will ask you a couple of questions (like "What's the mood?" or "Where will you use this image?"), and then give you a perfect prompt ready to paste into any image generator. --- ## What Makes It Special ### The 7-Piece Formula Every prompt Gemini Banana creates has 7 carefully balanced ingredients: | Ingredient | What It Does | Example | |-----------|-
Google's AI-powered research notebook that ingests your documents and becomes an expert on your content. Generates audio overviews, study guides, FAQs, and interactive discussions from uploaded sources.
Google DeepMind's experimental AI agent that can navigate websites, fill forms, and complete multi-step browser tasks autonomously. Uses Gemini's multimodal understanding to interact with web interfaces.
Google DeepMind's universal AI assistant prototype that can see, hear, and respond in real-time through your device camera and microphone. Demonstrates the future of multimodal AI interaction.
Google Cloud's enterprise platform for building, deploying, and managing AI agents powered by Gemini. Supports multi-agent orchestration, tool integration, and enterprise governance.
Gemini's agentic research capability that autonomously browses the web, synthesizes information from dozens of sources, and produces comprehensive research reports on any topic.
Interactive coding and content creation agent that generates, previews, and iterates on code, documents, and interactive applications in a side panel. Supports HTML/CSS/JS, Python, and more.