Nano Banana, Explained: Google's Gemini Image Models (Pro, 2, and the Original)

Nano Banana is Google's Gemini image model line. What Pro, 2, and the original each do, real per-image pricing, how to use them, and which to pick.

Friday, June 19, 2026Omid Saffari
Nano Banana, Explained: Google's Gemini Image Models (Pro, 2, and the Original)

Nano Banana is the nickname for Google's Gemini image models, and there are now three of them: the original, a flagship Pro, and a faster, cheaper "2." Pick the wrong one and you either overpay four times over or ship a poster with garbled text. Here is exactly what each one is, what it costs per image, and which to reach for.

Nano Banana, in one line (and which one you want)

Nano Banana is not a single product. It is Google's pet name for the image generation and editing models inside Gemini, and three of them are live right now, each a different Gemini model under the hood.

The short version of the verdict: reach for Nano Banana Pro when the image is the deliverable and it has to be right the first time, especially anything with text, charts, or 4K detail. Drop to Nano Banana 2 when you are generating at volume and want most of that quality for roughly half the price. Use the original Nano Banana when you just need a lot of quick images cheaply and 1024 pixels is enough.

If you only remember one thing: the price gap between them is real money at scale. The original costs about four cents an image, Pro costs about thirteen. At ten images that is nothing. At fifty thousand it is the difference between $1,950 and $6,700.

The three Nano Bananas at a glance

Every "Nano Banana" you read about maps to one of these exact Gemini model IDs. The price is per generated image on Google's paid API tier.

VariantGemini model IDBest forMax resolutionStandoutPrice / image
Nano Banana (original)gemini-2.5-flash-imageCheap, high-volume images1024px (1K)Lowest cost, fastest$0.039
Nano Banana 2gemini-3.1-flash-imageVolume work that still needs quality4KSpeed + new 0.5K tier + video-to-image$0.067 (1K) to $0.151 (4K)
Nano Banana Progemini-3-pro-imageFinished assets, text, infographics4KReasoning, best-in-class text, Search grounding$0.134 (1K/2K), $0.24 (4K)

All three put an invisible SynthID watermark on every image, and all are reachable from the Gemini app, Google AI Studio, and the Gemini API.

Nano Banana Pro (Gemini 3 Pro Image): the flagship

Nano Banana Pro is the one most people mean when they say a Nano Banana image "looks real now." Google introduced it on November 20, 2025, and it is built on Gemini 3 Pro, the company's top reasoning model. That lineage is the whole point: instead of pattern-matching pixels, it can reason about what it is drawing before it draws it.

Nano Banana Pro (Gemini 3 Pro Image) model page on the Google DeepMind site
Nano Banana Pro (Gemini 3 Pro Image)

What that buys you in practice is the thing every other image model still fumbles: text inside the image. Ask Pro for a product mockup, a poster, an infographic, or a diagram with real labels, and the words come out legible and correctly spelled, in multiple languages. It will also reach out to Google Search mid-generation to ground an image in real facts, so "a chart of this week's weather" or "the Eiffel Tower in autumn" comes back with plausible geometry instead of a confident hallucination.

It generates up to 4K, and it accepts up to 14 reference images at once: as many as 10 product shots to keep an object consistent, plus up to 4 photos of a person to hold a character's face steady across a whole set. For a founder building a brand, that is the difference between one good hero image and a coherent set of twenty.

The cost is the catch. Pro runs $0.134 per image at 1K or 2K, and $0.24 at 4K. That is the premium tier of the family, and you feel it the moment you generate in bulk.

Nano Banana 2 (Gemini 3.1 Flash Image): the volume workhorse

Nano Banana 2 is Gemini 3.1 Flash Image, the fast, cheaper sibling that now sits between the original and Pro. "Flash" in Google's naming means speed-optimized, and this is the model to use when you are generating a lot and do not want to pay Pro prices for every frame.

Google AI Studio, where you can try the Gemini image models for free
Google AI Studio

It pulls a good chunk of Pro's intelligence down into a faster, leaner package. You still get up to 4K, the improved text rendering, and Search grounding, but at $0.067 per image at 1K, $0.101 at 2K, and $0.151 at 4K, roughly half of Pro for the comparable sizes. It also adds two things Pro does not have: a smaller 512-pixel (0.5K) tier for cheap thumbnails and previews, and a fistful of unusual aspect ratios like 1:4, 4:1, and 1:8 for banners, stories, and tall mobile formats. Its take on the 14-reference-image feature leans more toward style: up to 6 object photos, up to 5 character photos, and up to 3 images used purely as style references.

The honest limit: on the hardest prompts, the dense infographic, the long paragraph of in-image text, the intricate multi-subject scene, Pro's extra reasoning still pulls ahead. Flash is the workhorse, not the showpiece.

The original Nano Banana (Gemini 2.5 Flash Image): still the cheapest

The original Nano Banana is Gemini 2.5 Flash Image, the model that made the name go viral in 2025 for restoring old photos and spinning up little figurines. It is still here, still the cheapest of the three, and still the right call for one job: high volume where you do not need the new tricks.

At $0.039 per image it undercuts everything else Google offers, but it caps out at 1024 pixels and predates the reasoning, the best text rendering, and the Search grounding. If you are batch-generating simple images, avatars, quick variations, placeholder art, and 1K is fine, the original still earns its place on cost alone. The moment you need text in the image, higher resolution, or factual accuracy, you have outgrown it.

What actually makes the Gemini 3 image models different

The jump from the original to the Gemini 3 generation (Pro and 2) is not just sharper pixels. Four specific capabilities change what you can actually use these for, and each one quietly fixes a thing that used to send you back to a designer.

Text that is actually readable. Image models have always mangled words, which is why "make me a poster" never quite worked. The Gemini 3 image models render legible, stylized text in multiple languages, so mockups, ads, slides, and infographics come out usable instead of as gibberish you have to retype in Photoshop.

Grounding in Google Search. The model can call Search as a tool while it generates. Ask for something factual, a map, a labeled diagram, a "current" scene, and it pulls real information in rather than inventing it. For an operator making an explainer graphic, that is the gap between a draft and a slide you can present.

Thinking mode. Before generating, the model runs a reasoning pass over your prompt. In plain terms, it plans the image instead of free-associating, which is why complex, multi-part instructions ("a three-panel layout with these labels in this order") land more reliably.

Up to 14 reference images. You can feed it as many as 14 images to steer the result: objects to keep consistent, faces to hold steady, styles to imitate. This is the feature that turns a one-off generator into a brand tool. A solo founder can lock a product's look and a spokesperson's face across an entire campaign without a photoshoot.

How to actually use it

You do not need to touch code to use Nano Banana, but the right door depends on what you are doing. Here are the three real ones.

The no-code path. Open the Gemini app, describe the image, and it routes to a Nano Banana model for you. Best for one-off images, quick edits, and trying the quality before you commit. Free tier included.

If you are building it into a product, the call itself is short. Pick the model ID for the price/quality you want and generate:

Python
from google import genai

client = genai.Client()

response = client.models.generate_content(
    model="gemini-3.1-flash-image",  # Nano Banana 2; swap for gemini-3-pro-image
    contents=["A clean product mockup of a matte ceramic water bottle, "
              "label reading 'STILL', soft studio lighting"],
)

for part in response.parts:
    if part.inline_data is not None:
        part.as_image().save("out.png")

Swap gemini-3.1-flash-image for gemini-3-pro-image to move up to the flagship, or gemini-2.5-flash-image to drop to the cheapest tier. The resolution is a parameter too, accepting 512, 1K, 2K, or 4K, and that choice is what drives your per-image cost.

The cost math, and where it falls short

The pricing looks tiny until you multiply. Image generation is billed per image, and the three tiers are genuinely far apart, so the model you default to is a real line item once you are at any scale.

Beyond cost, three honest limits. First, the SynthID watermark is mandatory, as covered above. Second, Nano Banana is not the aesthetic leader for every style: for painterly, illustrative, or highly art-directed work, tools like Midjourney still have a look many designers prefer, and rivals like Seedream compete hard on photorealism. Nano Banana's edge is reasoning, text, and grounding, not necessarily the prettiest single render. Third, availability and features move fast and vary by surface and region, so a capability you see in AI Studio may lag in the app or a specific country.

For where Nano Banana sits against the rest of the field, see the wider roundup of the best AI image generators, and for a close look at its toughest photorealism rival, the Seedream review.

Which Nano Banana should you use?

Match your situation to the model. The decision rule underneath the whole table: let the cheap tiers explore, and only pay Pro for what ships.

Your situationUseWhy
Ad creative, packaging, anything with textNano Banana ProBest text rendering + 4K, mistakes are expensive
Pitch-deck diagrams, factual infographicsNano Banana ProReasoning + Search grounding keep it accurate
Generating dozens of options to cullNano Banana 2Half the price, most of the quality
Social formats, banners, storiesNano Banana 2The unusual aspect ratios (1:4, 4:1, 1:8)
Brand set with a consistent product or facePro (or 2)Up to 14 reference images
High-volume simple images, 1K is fineOriginal Nano BananaCheapest at $0.039, fastest
Images must not look AI-generatedNone of themMandatory SynthID watermark
Is Nano Banana free?

Yes, you can use it for free in the Gemini app and in Google AI Studio's free tier. The per-image prices in this article are the paid Gemini API rates you pay when you build it into your own product or generate at volume.

What is the difference between Nano Banana and Nano Banana Pro?

The original Nano Banana is Gemini 2.5 Flash Image at $0.039 per image, capped at 1024 pixels. Nano Banana Pro is Gemini 3 Pro Image, built on Gemini 3 Pro, with up to 4K, the best in-image text, reasoning, and Search grounding, at $0.134 per image and up.

Is Nano Banana the same as Gemini?

It is the image model inside Gemini. "Nano Banana" is Google's nickname; each variant maps to a specific Gemini image model ID (gemini-2.5-flash-image, gemini-3.1-flash-image, gemini-3-pro-image).

Does Nano Banana add a watermark?

Yes. Every image from every Nano Banana model carries an invisible SynthID watermark for provenance, and you cannot turn it off.

What is Nano Banana 2?

Nano Banana 2 is Gemini 3.1 Flash Image, the faster, cheaper sibling of Pro. It runs $0.067 to $0.151 per image depending on resolution, adds a 512-pixel tier and new aspect ratios, and is the right default for high-volume work.

Last Updated

Jun 19, 2026

CategoryAI
Newsletter

One letter, every Sunday. Working systems, not hot takes.

Build logs, working systems, and field notes from running a portfolio of AI ventures. Sent weekly, never more.

Weekly. No spam. Unsubscribe anytime.