Command A is a 111-billion-parameter open-weights language model released March 13, 2025. It uses a transformer stack with three sliding-window attention layers (4,096-token window each) and one global attention layer. The global layer allows token interaction across the full context of 256K tokens. This hybrid attention design handles local context and long-range dependencies in one request.
Tool use and multi-step ReAct agent behavior are core strengths. Command A delivers higher throughput than Command R+ 08-2024 on comparable setups. Structured output, citation generation, safety modes, and RAG are built-in API features rather than prompt-only patterns. The knowledge cutoff is June 2024.
Command A covers 23 languages: English, French, Spanish, Italian, German, Portuguese, Japanese, Korean, Arabic, Chinese (Mandarin), Russian, Polish, Turkish, Vietnamese, Dutch, Czech, Indonesian, Ukrainian, Romanian, Greek, Hindi, Hebrew, and Persian. This breadth supports multilingual enterprise deployments without separate regional models.
See https://docs.cohere.com/v2/docs/command-a for API details. Typical output length is capped at 8K tokens per generation where the API enforces a limit.