xAI Grok API Pricing Guide 2026: Grok 3 vs Grok 3 Mini

xAI's Grok models offer a unique combination of real-time knowledge and competitive pricing. Here's everything you need to know about Grok 3 and Grok 3 Mini API costs.

Price Alert (May 2, 2026): Grok 3 has increased by 10x — now $30.00/$150.00 per 1M tokens. Read our May 2026 Pricing Shakeup for the full impact analysis.

xAI Grok Model Lineup

xAI offers two API models, each targeting different use cases and budgets:

Model Input (per 1M tokens) Output (per 1M tokens) Context Window Tier
Grok 3 $3.00 $15.00 128K Mid
Grok 3 Mini $0.30 $0.50 128K Budget

Grok 3: The Flagship

Grok 3 is xAI's most capable model, designed for complex reasoning, coding, and general-purpose tasks. At $3.00/$15.00 per 1M tokens, it positions itself in the mid-tier alongside Claude Sonnet 4 ($3.00/$15.00) and GPT-4o ($2.50/$10.00).

What Makes Grok 3 Different

Real-time knowledge: Grok 3 has access to real-time information through X (Twitter), giving it an edge for current events and trending topics.

Unfiltered responses: Grok is known for fewer content restrictions compared to competitors, making it popular for creative and open-ended tasks.

X/Twitter integration: Native understanding of social media content, threads, and trends.

Grok 3 Mini: The Budget Option

Grok 3 Mini is xAI's answer to GPT-4o mini and Claude Haiku. At $0.30/$0.50 per 1M tokens, it's significantly cheaper than most competitors while maintaining solid performance for routine tasks.

Model Input Output Context Best For
Grok 3 Mini $0.30 $0.50 128K Chatbots, classification, simple Q&A
GPT-4o mini $0.15 $0.60 128K General budget tasks
Claude Haiku 4.5 $0.80 $4.00 200K Fast classification, summarization
DeepSeek V4 Flash $0.14 $0.28 128K Cheapest option overall

Use Case Cost Breakdowns

1. Chatbot (10K requests/day)

Assuming 500 input tokens and 300 output tokens per request, running 30 days/month:

ModelMonthly Input CostMonthly Output CostTotal Monthly
Grok 3$4.50$13.50$18.00
Grok 3 Mini$0.45$0.45$0.90
GPT-4o$3.75$9.00$12.75
Claude Sonnet 4$4.50$13.50$18.00

Verdict: Grok 3 Mini is the cheapest option at $0.90/month. Grok 3 matches Claude Sonnet 4 but is more expensive than GPT-4o.

2. Code Generation (500 requests/day)

Assuming 2,000 input tokens and 1,500 output tokens per request:

ModelMonthly Input CostMonthly Output CostTotal Monthly
Grok 3$9.00$33.75$42.75
Grok 3 Mini$0.90$1.13$2.03
GPT-4o$7.50$22.50$30.00
DeepSeek V4 Pro$6.54$19.62$26.16

Verdict: For code generation, Grok 3 is pricier than GPT-4o and DeepSeek V4 Pro. Consider Grok 3 Mini for simpler code tasks and Grok 3 only for complex reasoning.

3. Document Analysis (1,000 requests/day)

Assuming 3,000 input tokens and 500 output tokens per request:

ModelMonthly Input CostMonthly Output CostTotal Monthly
Grok 3$27.00$22.50$49.50
Grok 3 Mini$2.70$0.75$3.45
GPT-4o$22.50$15.00$37.50
Gemini 2.5 Pro$11.25$15.00$26.25

Verdict: Document analysis is input-heavy, making Grok 3's $3.00 input rate costly. Gemini 2.5 Pro offers better value for this use case.

When to Choose xAI Grok

Choose Grok 3 when:

Choose Grok 3 Mini when:

Consider alternatives when:

See exactly how much you'd save with xAI Grok. Compare pricing across all 33 models.

Try the Free Calculator →

Cost Optimization Tips for Grok

  1. Use Grok 3 Mini for routine tasks: It's 10x cheaper on input and 30x cheaper on output than Grok 3
  2. Implement model routing: Route simple queries to Grok 3 Mini, complex ones to Grok 3
  3. Set max_tokens limits: Grok's output is $15/1M tokens — cap it to avoid runaway costs
  4. Batch similar requests: Process multiple documents in a single prompt when possible
  5. Monitor usage closely: xAI's usage dashboard may lag — track costs independently

The Bottom Line

xAI Grok occupies a unique niche in the LLM API market. Grok 3 matches Claude Sonnet 4 on pricing but differentiates with real-time knowledge and fewer restrictions. Grok 3 Mini is a solid budget option, though DeepSeek V4 Flash remains the cheapest overall.

The best strategy: use Grok 3 Mini for high-volume, low-complexity tasks and reserve Grok 3 for tasks where real-time knowledge or creative freedom matters. Combine with other providers for the best cost-to-quality ratio.