AI API Pricing Trends 2026

Every major price move across OpenAI, Anthropic, Google, DeepSeek, Mistral, and xAI. Updated Jun 2026.

-67%

GPT-4o price drop
since Jan 2025

-75%

Mistral Large
price drop

-58%

Grok price
drop (3→4.3)

$0.075

Cheapest input
(Gemini Flash Lite)

Biggest Moves This Quarter

These are the price changes that matter most for your budget. If you're still paying old prices, you're overpaying.

GPT-4o

OpenAI

-67%

$10.00 → $2.50 / 1M input

Was the go-to flagship. Now mid-tier pricing with strong performance.

Mistral Large 3

Mistral

-75%

$2.00 → $0.50 / 1M input

Budget-tier pricing with mid-tier capabilities. Excellent value.

DeepSeek V4 Pro

DeepSeek

-75%

$1.75 → $0.44 / 1M input

Now cheaper than GPT-4o mini for many workloads. 1M context window.

Grok 4.3

xAI

-58%

$3.00 → $1.25 / 1M input

Rebranded from Grok 3 with a major price cut. Grok Build 0.1 ($0.30) is the budget option.

How Much Could You Save?

Select your current model and monthly spend to see exact savings from switching.

Your current model

Monthly spend (USD)

Maximum annual savings by switching

Current Prices at a Glance

All prices are per 1M tokens. Sorted by tier, then by input price. Green rows = cheapest in their tier.

Model	Provider	Tier	Input	Output	Context
Claude Opus 4.7	Anthropic	Premium	$5.00	$25.00	1M
GPT-5.5	OpenAI	Premium	$5.00	$30.00	1M
Claude 4 Opus	Anthropic	Premium	$15.00	$75.00	200K
Grok 4.3	xAI	Mid	$1.25	$2.50	1M
GPT-5	OpenAI	Mid	$1.25	$10.00	272K
Gemini 2.5 Pro	Google	Mid	$1.25	$10.00	1M
Claude Sonnet 4.6	Anthropic	Mid	$3.00	$15.00	1M
GPT-4o	OpenAI	Mid	$2.50	$10.00	128K
Gemini 3.1 Pro	Google	Mid	$2.00	$12.00	1M
Gemini 2.0 Flash Lite	Google	Budget	$0.075	$0.30	1M
Llama 3.1 8B	Meta (Together.ai)	Budget	$0.10	$0.10	128K
Gemini 2.0 Flash	Google	Budget	$0.10	$0.40	1M
DeepSeek V4 Flash	DeepSeek	Budget	$0.14	$0.28	1M
GPT-4o mini	OpenAI	Budget	$0.15	$0.60	128K
Mistral Small 4	Mistral	Budget	$0.15	$0.60	128K
DeepSeek V4 Pro	DeepSeek	Budget	$0.44	$0.87	1M
Mistral Large 3	Mistral	Budget	$0.50	$1.50	128K
Claude Haiku 4.5	Anthropic	Budget	$1.00	$5.00	200K

Best Value Right Now

Based on current pricing (May 2026). These are the models that give you the most capability per dollar.

Chatbots & Customer Support

Gemini 2.0 Flash

$0.10 input / $0.40 output — 1M context

Best quality-to-price ratio for conversational AI. Handles multi-turn dialogs, tool use, and function calling. 1M context means no chunking.

Code Generation

DeepSeek V4 Pro

$0.44 input / $0.87 output — 1M context

75% cheaper than before. Strong at code completion, refactoring, and debugging. 1M context handles entire codebases.

Complex Reasoning & Analysis

GPT-5

$1.25 input / $10.00 output — 272K context

Latest OpenAI reasoning model at mid-tier pricing. Handles multi-step analysis, research tasks, and nuanced decision-making.

Budget Batch Processing

Llama 3.1 8B

$0.10 input / $0.10 output — 128K context

Open-source, cheapest option. Great for classification, extraction, and structured data tasks at scale.

Want to save these recommendations and track costs over time?

APIpulse Pro lets you save scenarios, export cost reports, and get personalized optimization tips.

Get Pro — $29 one-time

When to Switch Providers

Use this decision framework to decide if a provider change makes sense for your workload.

You're paying more than $0.50/1M input tokens for a non-reasoning workload

Switch to Gemini 2.0 Flash ($0.10) or DeepSeek V4 Flash ($0.14). You'll save 70–90% with similar quality for most tasks.

You're using GPT-4o and haven't checked pricing since early 2025

GPT-4o dropped to $2.50 (from $10). If you're still paying old rates through a reseller, switch to direct OpenAI billing. Or try GPT-5 at $1.25 for better quality at lower cost.

You're using Grok 3 and the bill shocked you

Grok 4.3 (formerly Grok 3) now at $1.25/1M input. Switch to Grok Build 0.1 ($0.30) for 90% savings, or GPT-5 ($1.25) for even more savings with comparable quality.

You need the cheapest possible API with decent quality

GPT-oss 20B at $0.08/1M input is the cheapest in the market. For slightly better quality, Llama 3.1 8B at $0.10 via Together.ai is the best open-source option.

You need long context (500K+) and don't want to pay premium prices

Gemini 2.0 Flash offers 1M context at $0.10/$0.40. DeepSeek V4 Pro offers 1M context at $0.44/$0.87. Both are budget-tier priced with generous context.

You're building a multi-model pipeline

Use Gemini Flash for routing/classification ($0.10), DeepSeek V4 Pro for code tasks ($0.44), and GPT-5 for complex reasoning ($1.25). Total cost under $2/1M tokens for most workloads.

How We Got Here

2023

The GPT-4 Era

GPT-4 launched at $30/$60 per 1M tokens. Claude 2 at $24. Only two serious providers. 8K-32K context windows.

Early 2024

Price Wars Begin

Google enters with Gemini. OpenAI cuts GPT-4 to $10/$30. Anthropic launches Claude 3. Context windows hit 128K.

Mid 2024

The Budget Revolution

GPT-4o mini at $0.15/$0.60. Gemini Flash at $0.075/$0.30. Open-source Llama pressures the market. Budget AI becomes viable for production.

Late 2025

Context Explodes, Prices Tank

Google launches 1M context. DeepSeek enters with aggressive pricing. Mistral Large drops 75%. GPT-4o drops 67%. 10 providers now compete.

May 2026

The Current Landscape

Budget models at $0.075/1M input. 1M+ context standard at budget tier. 42 models across 10 providers. Prices 90% lower than 2023. AI is cheaper than ever — Grok rebranded to 4.3 with a price drop to $1.25.

What to Watch Next

More price cuts coming: Competition between OpenAI, Google, Anthropic, and DeepSeek will keep driving prices down through 2026

Budget models catching up: Gemini Flash and DeepSeek V4 already match 2024 flagship quality at 1/20th the price

Open-source parity: Llama 4 Scout (1M context), $0.18) is closing the gap with proprietary models

Watch for more Grok updates: xAI rebranded Grok 3 to 4.3 with a price drop — more changes may follow

Calculate your costs with current pricing. See what switching could save you.
Try the APIpulse Calculator or Compare Models Side-by-Side

Prices verified against official provider pages. See full changelog for every price change. Get alerts when prices change.