Cohere API Pricing
Complete pricing breakdown for every Cohere model — Command R+ and Command R
Why Cohere?
Cohere specializes in enterprise AI with a focus on retrieval-augmented generation (RAG) and search.
RAG-Optimized
Command R models are purpose-built for retrieval-augmented generation. They excel at grounding responses in your documents and data sources, making them ideal for enterprise knowledge bases.
Enterprise-Grade
Cohere offers on-premise deployment, fine-tuning, and enterprise SLAs. Their models are designed for production workloads with consistent, reliable outputs.
Competitive Pricing
Command R at $0.50/$1.50 per 1M tokens offers strong RAG capabilities at a competitive price point. Command R+ competes with GPT-4o at a similar price point.
Tool Use & Agents
Both Command R models support tool use and multi-step agent workflows natively, making them suitable for building AI agents and automated pipelines.
All Cohere Models
Pricing per 1M tokens, as of May 2026.
| Model | Tier | Input (per 1M tokens) | Output (per 1M tokens) | Context Window |
|---|---|---|---|---|
| Command R | Budget | $0.50 | $1.50 | 128K |
| Command R+ | Mid | $2.50 | $10.00 | 128K |
Which Cohere Model Should You Use?
Match your use case to the right model and tier.
RAG & Document Q&A
Cohere's flagship model for retrieval-augmented generation. Best accuracy when grounding answers in large document collections. $2.50/$10.00 per 1M tokens.
Chatbots & Support
Affordable conversational AI with strong instruction following. Handles FAQs and support queries at $0.15/$0.60 per 1M tokens — same price as GPT-4o mini.
Semantic Search
Excellent for search reranking and semantic matching. Process large result sets affordably with 128K context at budget pricing.
Content Generation
High-quality text generation for marketing copy, reports, and summaries. Strong reasoning at $2.50/$10.00 per 1M tokens.
Classification & Extraction
Budget-friendly for high-volume classification, entity extraction, and structured data processing. 128K context handles large inputs.
AI Agents & Tool Use
Native tool use support for multi-step agent workflows. Best for complex pipelines requiring multiple API calls and reasoning steps.
Cohere Cost Calculator
Estimate your monthly spend for any Cohere model.
Monthly Cost Estimate
Based on published Cohere API pricing. Actual costs may vary.
Cohere vs Competitors
How do Cohere's prices compare to OpenAI, Anthropic, and Google for equivalent model tiers?
| Tier | Cohere | Input / Output | OpenAI | Input / Output | Input / Output | |
|---|---|---|---|---|---|---|
| Budget | Command R | $0.15 / $0.60 | GPT-4o mini | $0.15 / $0.60 | Gemini 2.0 Flash | $0.10 / $0.40 |
| Mid-Range | Command R+ | $2.50 / $10.00 | GPT-4o | $2.50 / $10.00 | Gemini 2.5 Pro | $1.25 / $10.00 |
Prices per 1M tokens. Cohere Command R matches GPT-4o mini pricing exactly. Google Gemini models have larger context windows (up to 1M tokens).
See how Cohere models stack up against OpenAI, Anthropic, and more
Related Reading
Deep dives on Cohere pricing, RAG costs, and provider comparisons.
AI API Free Tiers Compared: What You Can Build for Free
Compare free tiers from OpenAI, Anthropic, Google, Mistral, and Cohere. See exactly what you can build without spending a dollar.
EmbeddingsEmbedding API Pricing: OpenAI vs Cohere vs Google (2026)
Compare embedding API pricing for RAG, semantic search, and classification. Cohere's embed models offer competitive pricing.
RAGThe True Cost of RAG: LLM Pricing for Retrieval-Augmented Generation
Break down the costs of RAG pipelines: embedding, vector search, and generation. Compare Cohere's Command R against alternatives.
ReferenceLLM API Pricing Cheat Sheet: Every Model, Every Provider
A complete pricing reference for every model across all providers, including Cohere Command R and Command R+.
Calculate Your Cohere Costs
Enter your usage, pick a model, and see exactly what you'll spend per month.
Try the APIpulse CalculatorGet notified when API prices change
No spam. Only pricing updates and new features. Unsubscribe anytime.