Kimi K2 Instruct, released September 5, 2025, is a Mixture-of-Experts (MoE) language model from Moonshot AI.
Sparse expert routing at 32B activation. The full trillion parameters encode broad knowledge: programming languages, API conventions, domain facts, and tool-use patterns. At inference time, a routing mechanism selects roughly 32 billion parameters per token. Latency and compute cost stay comparable to a dense 32B model, while the knowledge base spans the entire trillion-parameter budget.
With 32B active parameters for reasoning depth and a full 1T parameter budget encoding broad tool-use and coding knowledge, K2 handles structured sequences of API calls, multi-step planning, and code synthesis.
Kimi K2 Instruct is available through AI Gateway at $0.57 per million input tokens and $2.3 per million output tokens.
AI Gateway routes K2 across novita, giving you automatic failover across multiple providers.