Cohere API Pricing

Complete pricing breakdown for every Cohere model — Command R+ and Command R

Models

$0.15–$10

Per 1M tokens

128K

Context window

May 2026

Pricing Verified

Why Cohere?

Cohere specializes in enterprise AI with a focus on retrieval-augmented generation (RAG) and search.

RAG-Optimized

Command R models are purpose-built for retrieval-augmented generation. They excel at grounding responses in your documents and data sources, making them ideal for enterprise knowledge bases.

Enterprise-Grade

Cohere offers on-premise deployment, fine-tuning, and enterprise SLAs. Their models are designed for production workloads with consistent, reliable outputs.

Competitive Pricing

Command R at $0.50/$1.50 per 1M tokens offers strong RAG capabilities at a competitive price point. Command R+ competes with GPT-4o at a similar price point.

Tool Use & Agents

Both Command R models support tool use and multi-step agent workflows natively, making them suitable for building AI agents and automated pipelines.

All Cohere Models

Pricing per 1M tokens, as of May 2026.

Model	Tier	Input (per 1M tokens)	Output (per 1M tokens)	Context Window
Command A	Mid	$2.50	$10.00	128K
Command R	Budget	$0.50	$1.50	128K
Command R+	Mid	$2.50	$10.00	128K

Which Cohere Model Should You Use?

Match your use case to the right model and tier.

RAG & Document Q&A

Command R+

Cohere's flagship model for retrieval-augmented generation. Best accuracy when grounding answers in large document collections. $2.50/$10.00 per 1M tokens.

Chatbots & Support

Command R

Affordable conversational AI with strong instruction following. Handles FAQs and support queries at $0.15/$0.60 per 1M tokens — same price as GPT-4o mini.

Semantic Search

Command R

Excellent for search reranking and semantic matching. Process large result sets affordably with 128K context at budget pricing.

Content Generation

Command R+

High-quality text generation for marketing copy, reports, and summaries. Strong reasoning at $2.50/$10.00 per 1M tokens.

Classification & Extraction

Command R

Budget-friendly for high-volume classification, entity extraction, and structured data processing. 128K context handles large inputs.

AI Agents & Tool Use

Command R+

Native tool use support for multi-step agent workflows. Best for complex pipelines requiring multiple API calls and reasoning steps.

Cohere Cost Calculator

Estimate your monthly spend for any Cohere model.

Model

Input tokens per request

Output tokens per request

Requests per day

Days per month

Monthly Cost Estimate

Input cost $0.00

Output cost $0.00

Total tokens/month 0

Monthly Total $0.00

Based on published Cohere API pricing. Actual costs may vary.

Cohere vs Competitors

How do Cohere's prices compare to OpenAI, Anthropic, and Google for equivalent model tiers?

Tier	Cohere	Input / Output	OpenAI	Input / Output	Google	Input / Output
Budget	Command R	$0.15 / $0.60	GPT-4o mini	$0.15 / $0.60	Gemini 2.0 Flash	$0.10 / $0.40
Mid-Range	Command R+	$2.50 / $10.00	GPT-4o	$2.50 / $10.00	Gemini 2.5 Pro	$1.25 / $10.00

Prices per 1M tokens. Cohere Command R matches GPT-4o mini pricing exactly. Google Gemini models have larger context windows (up to 1M tokens).

Compare Cohere Models Side-by-Side

See how Cohere models stack up against OpenAI, Anthropic, and more

Calculate Your Cohere Costs

Enter your usage, pick a model, and see exactly what you'll spend per month.

Try the APIpulse Calculator

Get notified when API prices change

No spam. Only pricing updates and new features. Unsubscribe anytime.

Report a Pricing Error

Cohere API Pricing

Why Cohere?

RAG-Optimized

Enterprise-Grade

Competitive Pricing

Tool Use & Agents

All Cohere Models

Which Cohere Model Should You Use?

RAG & Document Q&A

Chatbots & Support

Semantic Search

Content Generation

Classification & Extraction

AI Agents & Tool Use

Cohere Cost Calculator

Monthly Cost Estimate

Cohere vs Competitors

Related Reading

AI API Free Tiers Compared: What You Can Build for Free

Embedding API Pricing: OpenAI vs Cohere vs Google (2026)

The True Cost of RAG: LLM Pricing for Retrieval-Augmented Generation

LLM API Pricing Cheat Sheet: Every Model, Every Provider

Calculate Your Cohere Costs

Get notified when API prices change