by Cohere

Command R+.

text open weights datacenter 104B params 128K ctx Transformer Quality 72.2
Cheapest input
$2.5/M
on Cohere API
Cheapest output
$10.0/M
on Cohere API
Fastest
32 tok/s
on OpenRouter
Smallest GPU
1× Nvidia A100
$0.48/hr
Capability snapshot

What it's best at.

General knowledge 74.6
Coding 70.7

Scores normalised against benchmark ceilings (100 = perfect). Coloured by tier — coral 80+ frontier, lavender 65+ strong, sage 50+ solid, slate below.

Benchmarks

Published scores.

Benchmark Score Source
MMLU 74.6 official ↗
HumanEval 70.7 official ↗
Description

About Command R+.

Command R+ is Cohere's open-weight model optimised for retrieval-augmented generation (RAG) and tool use — 104B parameters, dense, trained with a strong emphasis on grounded generation (citing source documents inline) and structured tool calling. Available on Hugging Face under CC-BY-NC-4.0 (non-commercial research only; commercial use requires Cohere's API or enterprise license). Strong multilingual coverage (10+ languages). Particularly popular in enterprise RAG deployments because of its explicit citation feature.

Architecture

How it's built.

Architecture
Transformer
Knowledge cutoff
Mar 2024
182 days from cutoff to release.
Context window

How much it can remember.

128K tokens ≈ 96,000 English words
4K 32K 128K 1M
Max output per call: 4K tokens
Capabilities

What it can do.

· Vision input
· Audio input
· Video input
Function calling
Tool use
JSON mode
Streaming
Fine-tuning