Limited time: Pro lifetime access $19 — price goes up July 12
Cheapest AI API in July 2026: Every Model Ranked by Cost
Ranked list of every AI API by cost. Find the cheapest model for your use case. From $0.08/1M tokens to $30/1M tokens — full ranking with interactive calculator.
AI API pricing spans a massive range in July 2026. The cheapest models cost $0.075 per 1M input tokens, while the most expensive hit $180. That is a 2,400x difference. Choosing the right model for your workload is the single biggest cost lever for any AI application. This guide ranks every model by cost and helps you find the cheapest option for your specific use case.
Top 10 Cheapest AI Models (Ranked by Input Cost)
These are the cheapest models available today, ranked by input token cost. All prices are per 1M tokens.
| # | Model | Input / 1M | Output / 1M | Provider |
|---|---|---|---|---|
| 1 | Gemini 2.5 Flash-Lite | $0.075 | $0.30 | |
| 2 | Mistral Small 4 | $0.10 | $0.30 | Mistral |
| 3 | Gemini 2.5 Flash-Lite | $0.10 | $0.40 | |
| 4 | DeepSeek V4 Flash | $0.14 | $0.28 | DeepSeek |
| 5 | GPT-4o mini | $0.15 | $0.60 | OpenAI |
| 6 | Llama 4 Scout | $0.18 | $0.59 | Meta |
| 7 | Llama 4 Maverick | $0.20 | $0.60 | Meta |
| 8 | GPT-5 mini | $0.25 | $2.00 | OpenAI |
| 9 | DeepSeek V4 Pro | $0.435 | $0.87 | DeepSeek |
| 10 | Mistral Large 3 | $0.50 | $1.50 | Mistral |
Gemini 2.5 Flash-Lite is the cheapest API overall at $0.075/1M input tokens. But cheapest input does not always mean cheapest total cost — DeepSeek V4 Flash at $0.14/$0.28 has cheaper output tokens, making it better for generation-heavy workloads. GPT-5 mini at $0.25/$2 is the cheapest option from a major provider for general-purpose use.
Top 10 by Output Cost (Cheapest to Generate)
Output tokens are typically 3-10x more expensive than input tokens. For workloads that generate long responses (chatbots, code generation, content writing), output cost matters more than input cost.
| # | Model | Output / 1M | Input / 1M | Provider |
|---|---|---|---|---|
| 1 | DeepSeek V4 Flash | $0.28 | $0.14 | DeepSeek |
| 2 | Mistral Small 4 | $0.30 | $0.10 | Mistral |
| 3 | Gemini 2.5 Flash-Lite | $0.30 | $0.075 | |
| 4 | Gemini 2.5 Flash-Lite | $0.40 | $0.10 | |
| 5 | GPT-4o mini | $0.60 | $0.15 | OpenAI |
| 6 | Llama 4 Scout | $0.59 | $0.18 | Meta |
| 7 | Llama 4 Maverick | $0.60 | $0.20 | Meta |
| 8 | DeepSeek V4 Pro | $0.87 | $0.435 | DeepSeek |
| 9 | Mistral Large 3 | $1.50 | $0.50 | Mistral |
| 10 | GPT-5 mini | $2.00 | $0.25 | OpenAI |
Budget Tier: Models Under $1/1M Input
Budget Tier — Under $1/1M InputThese models are ideal for high-volume, low-complexity tasks: classification, extraction, simple Q&A, FAQ chatbots, data labeling, and content moderation. They handle 80% of typical API workloads at a fraction of the cost.
| Model | Input / 1M | Output / 1M | Best For |
|---|---|---|---|
| Gemini 2.5 Flash-Lite | $0.075 | $0.30 | Simple classification, extraction |
| Mistral Small 4 | $0.10 | $0.30 | General budget tasks |
| DeepSeek V4 Flash | $0.14 | $0.28 | High-volume generation |
| GPT-4o mini | $0.15 | $0.60 | OpenAI ecosystem integration |
| Llama 4 Scout | $0.18 | $0.59 | Self-hosted, on-prem |
| GPT-5 mini | $0.25 | $2.00 | Smart budget with reasoning |
| DeepSeek V4 Pro | $0.435 | $0.87 | Budget code generation |
| Mistral Large 3 | $0.50 | $1.50 | Mid-quality at budget price |
GPT-5 mini is the smart budget pick. At $0.25/$2, it offers significantly better reasoning than other budget models. For pure cost, Gemini Flash-Lite and Mistral Small 4 are cheaper, but GPT-5 mini handles complex tasks that trip up other budget models. If you need both cheap and capable, GPT-5 mini is the sweet spot.
Mid Tier: Models $1-5/1M Input
Mid Tier — $1-5/1M InputThe mid tier is where production applications live. These models balance quality and cost, handling complex reasoning, code generation, and multi-step analysis without the premium price tag.
| Model | Input / 1M | Output / 1M | Best For |
|---|---|---|---|
| Claude Haiku 4.5 | $1.00 | $5.00 | Fast, high-quality budget Anthropic |
| GPT-5 | $1.25 | $10.00 | Best mid-tier value overall |
| Gemini 2.5 Pro | $1.25 | $10.00 | Google ecosystem, long context |
| Grok 4.3 | $1.25 | $2.50 | Fast inference, social data |
| GPT-5.3 Codex | $1.75 | $14.00 | Code generation specialist |
| Gemini 3.1 Pro | $2.00 | $12.00 | Latest Google premium |
| Claude Sonnet 5 | $3.00 | $15.00 | High-quality analysis, writing |
| Claude Sonnet 4.6 | $3.00 | $15.00 | Proven Anthropic workhorse |
Premium Tier: Models $5+/1M Input
Premium Tier — $5+/1M InputPremium models are for the hardest problems: complex multi-step reasoning, advanced coding, creative writing, research analysis, and tasks requiring deep domain knowledge. Use them sparingly — most workloads do not need this level.
| Model | Input / 1M | Output / 1M | Best For |
|---|---|---|---|
| GPT-5.5 | $5.00 | $30.00 | Complex reasoning, multi-file code |
| Claude Opus 4.8 | $5.00 | $25.00 | Deep analysis, long-form writing |
| Claude Opus 4.7 | $5.00 | $25.00 | Proven premium Anthropic |
| Claude Fable 5 | $5.00 | $25.00 | Creative tasks, storytelling |
| Claude Mythos 5 | $5.00 | $25.00 | Abstract reasoning, research |
| GPT-4o | $2.50 | $10.00 | Vision, multimodal |
| GPT-5.5 Pro | $30.00 | $180.00 | Extreme reasoning (very expensive) |
Use Case Recommendations
Not sure which tier to pick? Here is a recommendation based on your specific use case:
Chatbots and Customer Support
Budget Tier RecommendedCustomer support bots handle simple questions 80% of the time. Use GPT-5 mini ($0.25/$2) for routine questions and route complex issues to a mid-tier model. This hybrid approach costs 80-90% less than using a premium model for everything.
Code Generation and Review
Mid Tier RecommendedCode generation needs reasoning capability but not premium. GPT-5 ($1.25/$10) handles most coding tasks well. For complex multi-file refactoring, upgrade to GPT-5.3 Codex ($1.75/$14) or Opus 4.8 ($5/$25) only when needed.
Content Writing and Marketing
Mid Tier RecommendedBlog posts, emails, and marketing copy work well with Claude Sonnet 5 ($3/$15) or GPT-5 ($1.25/$10). For long-form articles or creative content that needs a distinctive voice, Opus 4.8 ($5/$25) produces noticeably better output.
Data Extraction and Classification
Budget Tier RecommendedStructured extraction and classification are where budget models shine. Mistral Small 4 ($0.10/$0.30) or GPT-4o mini ($0.15/$0.60) handle these tasks at 95%+ accuracy for a fraction of premium costs. Save your budget for tasks that actually need intelligence.
Complex Reasoning and Research
Premium Tier RequiredMulti-step reasoning, mathematical proofs, research analysis, and complex decision-making require premium models. GPT-5.5 ($5/$30) or Claude Opus 4.8 ($5/$25) are the only options that handle these tasks reliably. There is no budget shortcut here.
Translation and Summarization
Budget Tier RecommendedTranslation and summarization are well-solved problems. GPT-4o mini ($0.15/$0.60) or DeepSeek V4 Flash ($0.14/$0.28) produce quality comparable to premium models for these specific tasks. Premium models add marginal quality at 50-100x the cost.
Full Cost Comparison Table
Every model in one table, sorted by input cost. This is your complete reference for AI API pricing in July 2026.
| Model | Provider | Input / 1M | Output / 1M | Tier |
|---|---|---|---|---|
| Gemini 2.5 Flash-Lite | $0.075 | $0.30 | Budget | |
| Mistral Small 4 | Mistral | $0.10 | $0.30 | Budget |
| Gemini 2.5 Flash-Lite | $0.10 | $0.40 | Budget | |
| DeepSeek V4 Flash | DeepSeek | $0.14 | $0.28 | Budget |
| GPT-4o mini | OpenAI | $0.15 | $0.60 | Budget |
| Llama 4 Scout | Meta | $0.18 | $0.59 | Budget |
| Llama 4 Maverick | Meta | $0.20 | $0.60 | Budget |
| GPT-5 mini | OpenAI | $0.25 | $2.00 | Budget |
| DeepSeek V4 Pro | DeepSeek | $0.435 | $0.87 | Budget |
| Mistral Large 3 | Mistral | $0.50 | $1.50 | Budget |
| Claude Haiku 4.5 | Anthropic | $1.00 | $5.00 | Mid |
| GPT-5 | OpenAI | $1.25 | $10.00 | Mid |
| Gemini 2.5 Pro | $1.25 | $10.00 | Mid | |
| Grok 4.3 | xAI | $1.25 | $2.50 | Mid |
| GPT-5.3 Codex | OpenAI | $1.75 | $14.00 | Mid |
| Gemini 3.1 Pro | $2.00 | $12.00 | Mid | |
| GPT-4o | OpenAI | $2.50 | $10.00 | Mid |
| Command R+ | Cohere | $2.50 | $10.00 | Mid |
| Claude Sonnet 5 | Anthropic | $3.00 | $15.00 | Mid |
| Claude Sonnet 4.6 | Anthropic | $3.00 | $15.00 | Mid |
| GPT-5.5 | OpenAI | $5.00 | $30.00 | Premium |
| Claude Opus 4.8 | Anthropic | $5.00 | $25.00 | Premium |
| Claude Opus 4.7 | Anthropic | $5.00 | $25.00 | Premium |
| Claude Fable 5 | Anthropic | $5.00 | $25.00 | Premium |
| Claude Mythos 5 | Anthropic | $5.00 | $25.00 | Premium |
| GPT-5.5 Pro | OpenAI | $30.00 | $180.00 | Premium |
Interactive Cost Calculator
Enter your tokens and requests to see exactly what each model costs for your workload.
Try It Live — Cost Calculator
See exactly what any model costs for your workload. No signup needed.
Key Takeaways
- Cheapest overall: Gemini 2.5 Flash-Lite at $0.075/$0.30 per 1M tokens. For output-heavy work, DeepSeek V4 Flash at $0.14/$0.28 is cheapest.
- Cheapest from OpenAI: GPT-4o mini at $0.15/$0.60. For smarter budget tasks, GPT-5 mini at $0.25/$2 is the sweet spot.
- Cheapest from Anthropic: Claude Haiku 4.5 at $1/$5. No budget-tier option — Anthropic focuses on quality over price.
- Best mid-tier value: GPT-5 at $1.25/$10 — near-premium quality at mid-tier pricing. Handles 90% of production workloads.
- Cheapest premium: Claude Opus 4.8 at $5/$25 — $5 cheaper per 1M output tokens than GPT-5.5. With 90% caching, it is the premium value pick.
- Avoid GPT-5.5 Pro unless necessary: At $30/$180, it costs 6x more than other premium models. Only use for extreme reasoning tasks.
- Match model to task: Classification and extraction need budget models. Code and analysis need mid-tier. Complex reasoning needs premium. Most developers overpay by using premium for budget-level tasks.
Find the Cheapest Model for Your Exact Workload
Use our comparison tool to test any model against your real tokens and requests. See exact daily and monthly costs, not just list prices.
Try the Cost Calculator → Compare All Models →Pro tip: APIpulse Cost Explorer — visualize pricing across all 49 models and find the cheapest option for any workload.
Stop overpaying for AI APIs
APIpulse Pro ($19) includes real-time pricing for all 49 models, scenario saving, and cost comparison exports that help you save 40%+ on AI API costs.
Get Pro — $19 LifetimeFind out if you are overpaying in 30 seconds
Log costs, set budgets, detect price changes — free dashboard