What is the cheapest LLM API in July 2026?

GPT-oss 20B is the cheapest at $0.08/$0.35 per million input/output tokens. For open-source models, Llama 3.1 8B via Together.ai costs $0.10/$0.10 per million tokens.

How much does GPT-5 cost in 2026?

GPT-5 costs $1.25 per million input tokens and $10.00 per million output tokens. GPT-5 mini is 5x cheaper at $0.25/$2.00 per million tokens.

How much does Claude cost in 2026?

Claude Sonnet 4.6 costs $3.00/$15.00 per million input/output tokens. Claude Haiku 4.5 costs $1.00/$5.00. Claude Opus 4.8 costs $5.00/$25.00.

Which LLM provider is cheapest overall?

DeepSeek and Google offer the cheapest models. DeepSeek V4 Flash costs $0.14/$0.28, and Gemini 2.5 Flash-Lite costs $0.10/$0.40 per million tokens.

What LLM models are being deprecated in July 2026?

Claude 4 Opus, Claude Sonnet 4, and DeepSeek V3 are all retired on June 15, 2026. Migrate to Claude Opus 4.8, Claude Sonnet 4.6, and DeepSeek V4 Flash respectively.

State of LLM API Pricing — June 2026: 59 Models Compared

Data verified Jun 7, 2026. Prices in USD per million tokens. This report is updated monthly. View the full interactive version with sortable tables, charts, and cost scenarios.