Kimi K2 Instruct

Kimi K2 Instruct is Moonshot AI's Mixture-of-Experts (MoE) language model with one trillion total parameters and 32 billion active per forward pass, a context window of 131.1K tokens, available through AI Gateway via Novita AI.

Tool Use

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'moonshotai/kimi-k2',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

About Kimi K2 Instruct

Kimi K2 Instruct, released July 11, 2025, is a Mixture-of-Experts (MoE) language model from Moonshot AI.

Sparse expert routing at 32B activation. The full trillion parameters encode broad knowledge: programming languages, API conventions, domain facts, and tool-use patterns. At inference time, a routing mechanism selects roughly 32 billion parameters per token. Latency and compute cost stay comparable to a dense 32B model, while the knowledge base spans the entire trillion-parameter budget.

With 32B active parameters for reasoning depth and a full 1T parameter budget encoding broad tool-use and coding knowledge, K2 handles structured sequences of API calls, multi-step planning, and code synthesis.

Kimi K2 Instruct is available through AI Gateway at $0.57 per million input tokens and $2.3 per million output tokens.

AI Gateway routes K2 across Novita AI, giving you automatic failover across multiple providers.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Kimi K2 Instruct

About Kimi K2 Instruct