Cohere API Pricing

Complete pricing breakdown for every Cohere model — Command R+ and Command R

2
Models
$0.15–$10
Per 1M tokens
128K
Context window
Apr 2026
Pricing Verified

Why Cohere?

Cohere specializes in enterprise AI with a focus on retrieval-augmented generation (RAG) and search.

RAG-Optimized

Command R models are purpose-built for retrieval-augmented generation. They excel at grounding responses in your documents and data sources, making them ideal for enterprise knowledge bases.

Enterprise-Grade

Cohere offers on-premise deployment, fine-tuning, and enterprise SLAs. Their models are designed for production workloads with consistent, reliable outputs.

Competitive Pricing

Command R at $0.50/$1.50 per 1M tokens offers strong RAG capabilities at a competitive price point. Command R+ competes with GPT-4o at a similar price point.

Tool Use & Agents

Both Command R models support tool use and multi-step agent workflows natively, making them suitable for building AI agents and automated pipelines.

All Cohere Models

Pricing per 1M tokens, as of May 2026.

Model Tier Input (per 1M tokens) Output (per 1M tokens) Context Window
Command R Budget $0.50 $1.50 128K
Command R+ Mid $2.50 $10.00 128K

Which Cohere Model Should You Use?

Match your use case to the right model and tier.

RAG & Document Q&A

Command R+

Cohere's flagship model for retrieval-augmented generation. Best accuracy when grounding answers in large document collections. $2.50/$10.00 per 1M tokens.

Chatbots & Support

Command R

Affordable conversational AI with strong instruction following. Handles FAQs and support queries at $0.15/$0.60 per 1M tokens — same price as GPT-4o mini.

Semantic Search

Command R

Excellent for search reranking and semantic matching. Process large result sets affordably with 128K context at budget pricing.

Content Generation

Command R+

High-quality text generation for marketing copy, reports, and summaries. Strong reasoning at $2.50/$10.00 per 1M tokens.

Classification & Extraction

Command R

Budget-friendly for high-volume classification, entity extraction, and structured data processing. 128K context handles large inputs.

AI Agents & Tool Use

Command R+

Native tool use support for multi-step agent workflows. Best for complex pipelines requiring multiple API calls and reasoning steps.

Cohere Cost Calculator

Estimate your monthly spend for any Cohere model.

Monthly Cost Estimate

Input cost $0.00
Output cost $0.00
Total tokens/month 0
Monthly Total $0.00

Based on published Cohere API pricing. Actual costs may vary.

Cohere vs Competitors

How do Cohere's prices compare to OpenAI, Anthropic, and Google for equivalent model tiers?

Tier Cohere Input / Output OpenAI Input / Output Google Input / Output
Budget Command R $0.15 / $0.60 GPT-4o mini $0.15 / $0.60 Gemini 2.0 Flash $0.10 / $0.40
Mid-Range Command R+ $2.50 / $10.00 GPT-4o $2.50 / $10.00 Gemini 2.5 Pro $1.25 / $10.00

Prices per 1M tokens. Cohere Command R matches GPT-4o mini pricing exactly. Google Gemini models have larger context windows (up to 1M tokens).

Compare Cohere Models Side-by-Side

See how Cohere models stack up against OpenAI, Anthropic, and more

Calculate Your Cohere Costs

Enter your usage, pick a model, and see exactly what you'll spend per month.

Try the APIpulse Calculator
Report a Pricing Error