Cohere For AI · Released August 30, 2024

Command R+ (08-2024)

Command R+ is Cohere's flagship open-weight release: a 104-billion-parameter dense model purpose-built for retrieval-augmented generation, tool use, and grounded enterprise workflows. The 08-2024 refresh improved instruction-following and refusal handling over the original April 2024 release.

What sets it apart

Most open-weight models treat RAG as an afterthought. Command R+ was trained with explicit document-citation tokens and a tool-call schema as first-class outputs. When given a set of documents, it produces grounded answers with inline citations to specific passages — without needing prompt engineering hacks.

What it's good at

Enterprise document QA, multi-step tool use, multilingual workflows (10 languages with strong coverage), and any task where you need the model to cite its sources. Performance on creative tasks is solid but not its strong suit.

Running it locally

Full FP16 weights are ~208 GB. Q4_K_M GGUF is ~63 GB. Practical for a multi-GPU workstation but not for a single consumer card; most users access it through Cohere's hosted API or a self-hosted vLLM deployment.

License

CC-BY-NC 4.0 — non-commercial only. For commercial use you must obtain a separate license from Cohere. This is the most restrictive license in the catalog; if commercial use matters, look at Llama 3.3 or Qwen2.5 instead.