Question 1

How does sparse routing translate to cost savings?

Accepted Answer

The full 1T parameters store broad knowledge, but only ~32B activate per token via the expert router. You pay compute proportional to a 32B dense model while drawing on knowledge encoded across the entire trillion-parameter budget.

Question 2

Why does base K2 list many providers on AI Gateway?

Accepted Answer

It was the first K2 variant adopted across providers, so routing across Novita AI reflects earlier integration. Later checkpoints and variants can have narrower provider sets.

Question 3

Is K2 text-only?

Accepted Answer

Yes. Kimi K2 Instruct accepts and produces text. Multimodal capabilities are not part of this release.

Question 4

What agentic patterns does K2 handle well?

Accepted Answer

Structured multi-step sequences: invoke an API, parse the response, branch on results, call a second API, and synthesize a final output. The function-calling interface in AI Gateway maps directly to these workflows.

Question 5

Can I bring my own provider credentials?

Accepted Answer

Yes. AI Gateway supports Bring Your Own Key for providers where you hold a direct account. BYOK requests are excluded from ZDR coverage.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Kimi K2 Instruct

Frequently Asked Questions