What are the new Gemini models released in July 2026?

Google released 3 new models: Gemini 3.1 Flash-Lite ($0.25/$1.50 per 1M tokens, 1M context), Gemini 3 Flash ($0.50/$3.00 per 1M tokens, 1M context), and Gemini 2.5 Flash-Lite ($0.10/$0.40 per 1M tokens, 1M context). All three have 1M token context windows.

Which new Gemini model is cheapest?

Gemini 2.5 Flash-Lite is the cheapest at $0.10/$0.40 per 1M tokens — matching the deprecated Gemini 2.0 Flash pricing but with a 1M context window. Gemini 3.1 Flash-Lite at $0.25/$1.50 is the best budget pick for production workloads.

Are the old Gemini 2.0 models deprecated?

Yes. Google has marked Gemini 2.0 Flash and Gemini 2.5 Flash-Lite as deprecated. Gemini 2.0 Flash is replaced by Gemini 3 Flash ($0.50/$3.00) and Gemini 2.5 Flash-Lite is replaced by Gemini 3.1 Flash-Lite ($0.25/$1.50). Both new models have larger 1M context windows.

3 New Gemini Models: Flash-Lite, 3 Flash & 2.5 Flash-Lite

How These Compare to the Competition

Here's how the new Gemini budget models stack up against other cheap options:

Gemini 2.5 Flash-Lite vs DeepSeek V4 Flash: Gemini is cheaper on input ($0.10 vs $0.14) but more expensive on output ($0.40 vs $0.28). DeepSeek wins on output-heavy tasks.
Gemini 3.1 Flash-Lite vs Claude Haiku 4.5: Gemini is 75% cheaper on input ($0.25 vs $1.00) and 70% cheaper on output ($1.50 vs $5.00). Haiku has better reasoning, but for most tasks Gemini is the better value.
Gemini 3 Flash vs GPT-4o mini: GPT-4o mini is cheaper at $0.15/$0.60 vs Gemini 3 Flash at $0.50/$3.00. But Gemini 3 Flash has 1M context vs GPT-4o mini's 128K — 8x more context for 3x the price.
All 3 vs Mistral Small 4: Mistral Small 4 at $0.15/$0.60 is cheaper on input than Gemini 3.1 Flash-Lite ($0.25/$1.50) but has only 128K context vs 1M. For long documents, the new Gemini models win.

Which One Should You Use?

Absolute cheapest

Gemini 2.5 Flash-Lite — $0.10/$0.40, can't beat it

Best quality per dollar

Gemini 3.1 Flash-Lite — $0.25/$1.50 with 1M context

Speed + quality balance

Gemini 3 Flash — $0.50/$3.00, fastest inference

Need 1M context on a budget

Any of these — all have 1M windows

The Bigger Picture

These 3 new models push the APIpulse database to 85 models across 10 providers. The Gemini lineup now covers every price point from $0.10 to $12.50 per 1M input tokens — the widest range of any single provider.

The trend is clear: 1M context windows are becoming the standard, even at budget price points. If you're still using models with 128K context, you're paying for less capability than you could get.

With Claude 4 retired on June 15, now is the perfect time to evaluate these new Gemini options. Use our deprecation calculator to see how much you'd save by switching.

Compare all 85 models side by side

Open Cost Calculator

Pricing data verified Jul 9, 2026. Prices per 1M tokens. See our full pricing table for all 85 models.