Skip to content

Kimi K2 Instruct

Kimi K2 Instruct is Moonshot AI's Mixture-of-Experts (MoE) language model with one trillion total parameters and 32 billion active per forward pass, a context window of 131.1K tokens, available through AI Gateway via novita.

Tool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'moonshotai/kimi-k2',
prompt: 'Why is the sky blue?'
})

Frequently Asked Questions

  • How does sparse routing translate to cost savings?

    The full 1T parameters store broad knowledge, but only ~32B activate per token via the expert router. You pay compute proportional to a 32B dense model while drawing on knowledge encoded across the entire trillion-parameter budget.

  • Why does base K2 list many providers on AI Gateway?

    It was the first K2 variant adopted across providers, so routing across novita reflects earlier integration. Later checkpoints and variants can have narrower provider sets.

  • Is K2 text-only?

    Yes. Kimi K2 Instruct accepts and produces text. Multimodal capabilities are not part of this release.

  • What agentic patterns does K2 handle well?

    Structured multi-step sequences: invoke an API, parse the response, branch on results, call a second API, and synthesize a final output. The function-calling interface in AI Gateway maps directly to these workflows.

  • Can I bring my own provider credentials?

    Yes. AI Gateway supports Bring Your Own Key for providers where you hold a direct account. BYOK requests are excluded from ZDR coverage.