Skip to content

Command A

Command A is an open-weights enterprise model from Cohere built for tool use, agentic workflows, retrieval-augmented generation (RAG), and multilingual tasks across a context window of 256K tokens.

Tool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'cohere/command-a',
prompt: 'Why is the sky blue?'
})

About Command A

Command A is a 111-billion-parameter open-weights language model released March 13, 2025. It uses a transformer stack with three sliding-window attention layers (4,096-token window each) and one global attention layer. The global layer allows token interaction across the full context of 256K tokens. This hybrid attention design handles local context and long-range dependencies in one request.

Tool use and multi-step ReAct agent behavior are core strengths. Command A delivers higher throughput than Command R+ 08-2024 on comparable setups. Structured output, citation generation, safety modes, and RAG are built-in API features rather than prompt-only patterns. The knowledge cutoff is June 2024.

Command A covers 23 languages: English, French, Spanish, Italian, German, Portuguese, Japanese, Korean, Arabic, Chinese (Mandarin), Russian, Polish, Turkish, Vietnamese, Dutch, Czech, Indonesian, Ukrainian, Romanian, Greek, Hindi, Hebrew, and Persian. This breadth supports multilingual enterprise deployments without separate regional models.

See https://docs.cohere.com/v2/docs/command-a for API details. Typical output length is capped at 8K tokens per generation where the API enforces a limit.