Nano Banana Pro (Gemini Image Gen)
Generate or edit images via Gemini 3 Pro Image. Supports text-to-image, image editing, and multi-image composition.
Tags: image-generation, ai, gemini, creative
Category: AI
Use Cases
- Quick image generation for social media posts and blog headers
- Iterative image editing with natural language
- Creating presentation slide visuals and mockups
- Generating product photos and marketing materials
- Text-heavy image generation (infographics, signs, UI screenshots)
- Style transfer: apply one image's style to another
Tips
- Use Gemini's conversational nature: generate an image, then refine with 'make the colors warmer'
- For text-in-images (signs, UI mockups), Gemini significantly outperforms DALL-E 3
- Pass reference images alongside text prompts for style matching or editing
- Use the free tier for prototyping, then switch to paid for production batch generation
- Combine with the nano-pdf skill to generate and insert images into presentations
- For best quality, use specific detailed prompts rather than vague descriptions
- Multi-image composition works well for before/after comparisons and collages
Known Issues & Gotchas
- Requires GEMINI_API_KEY env var — get one from ai.google.dev
- Image generation is subject to Google's safety filters — some prompts will be blocked without clear explanation
- Free tier has rate limits: 60 requests/min and 1,000 requests/day
- Earlier model versions (Gemini 2.0 Flash, 3 Pro Preview) have been deprecated — check current model availability
- Generated images may have artifacts with complex scenes — iterate with follow-up prompts
- The skill uses `uv` to manage Python dependencies — install uv first via Homebrew
- Image output quality varies significantly between model versions — 2.5 Flash Image is the sweet spot
Alternatives
- OpenAI Image Gen (DALL-E 3 / GPT Image)
- Stable Diffusion (local)
- Midjourney
- Flux (Black Forest Labs)
Community Feedback
Google improves Gemini AI image editing with 'nano banana' model. Gemini 2.5 Flash Image is currently atop LMArena's image-editing leaderboard.
— Ars Technica
Google's new AI image model gives users finer control over editing photos, a step meant to catch up with OpenAI's GPT Image capabilities.
— TechCrunch
We have suspended all use of the API for the time being and would like more information about rate limits and content filtering on image generation.
— Reddit r/GeminiAI
Gemini 2.0 Flash is making waves with its groundbreaking native image generation. This 'workhorse' AI now crafts and edits visuals directly from text prompts in a single API call.
— Medium
Configuration Examples
Setup and basic generation
# Install uv
brew install uv
# Set API key
export GEMINI_API_KEY="your-key-from-ai.google.dev"
# Generate an image
uv run nano-banana-pro "A minimalist logo for a coffee shop in warm earth tones"Edit an existing image
# Edit with reference image
uv run nano-banana-pro --image photo.jpg "Remove the background and replace with a gradient sunset"Multi-image composition
# Combine multiple images
uv run nano-banana-pro --image logo.png --image bg.jpg "Place the logo centered on the background with a subtle drop shadow"Installation
brew install uvHomepage: https://ai.google.dev/
Source: bundled