Kimi K2 0905

moonshotai/kimi-k2-0905

Kimi K2 0905 has shown strong performance on agentic tasks thanks to its tool calling, reasoning abilities, and long context handling. But as a large parameter model (1T parameters), it’s also resource-intensive. Running it in production requires a highly optimized inference stack to avoid excessive latency.

import { streamText } from 'ai'

const result = streamText({
  model: 'moonshotai/kimi-k2-0905',
  prompt: 'Why is the sky blue?'
})

Playground

Try out Kimi K2 0905 by Moonshot AI. Usage is billed to your team at API rates. Free users get $5 of credits every 30 days, and you are considered a free user if you haven't made a payment.

Chat with

Providers

The AI Gateway supports routing requests across multiple AI providers. You can control provider preferences using the provider slugs available for copying with the buttons below. For more see the AI Gateway provider options documentation. By using the AI provider you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.

Provider

Context	Max Output	Latency	Throughput	Input	Output	Cache	Image Gen	Video Gen	Web Search	Capabilities	ZDR	Release Date

Legal:Terms

•

Privacy

131K

16K

1.1s

$0.60/M

$2.50/M

—

09/05/2025

Legal:Terms

•

Privacy

256K

128K

0.5s

57tps

$0.60/M

$2.50/M

—

09/05/2025

Legal:Terms

•

Privacy

262K

16K

0.1s

$1.00/M

$3.00/M

Read:$0.50/M

Write:—

—

09/05/2025

Legal:Terms

•

Privacy

256K

16K

1.7s

13tps

$0.60/M

$2.50/M

—

09/05/2025

More models by Moonshot AI

Model

Context	Max Output	Latency	Throughput	Input	Output	Cache	Image Gen	Video Gen	Web Search	Capabilities	Providers	ZDR	Release Date

131K

0.6s

63tps

$0.50/M

$2.00/M

—

09/05/2025

262K

0.3s

133tps

$0.60/M

$2.50/M

Read:$0.15/M

Write:—

—

11/06/2025

262K

0.8s

96tps

$1.15/M

$8.00/M

Read:$0.15/M

Write:—

—

11/06/2025

256K

16K

4.4s

57tps

$2.40/M

$10.00/M

—

09/05/2025

262K

0.3s

147tps

$0.50/M

$2.80/M

Read:$0.10/M

Write:—

—

01/26/2026

Playground

Try out Kimi K2 0905 by Moonshot AI. Usage is billed to your team at API rates. Free users get $5 of credits every 30 days, and you are considered a free user if you haven't made a payment.

Chat with

Providers

Provider

Context	Max Output	Latency	Throughput	Input	Output	Cache	Image Gen	Video Gen	Web Search	Capabilities	ZDR	Release Date

Legal:Terms

•

Privacy

131K

16K

1.1s

$0.60/M

$2.50/M

—

09/05/2025

Legal:Terms

•

Privacy

256K

128K

0.5s

57tps

$0.60/M

$2.50/M

—

09/05/2025

Legal:Terms

•

Privacy

262K

16K

0.1s

$1.00/M

$3.00/M

Read:$0.50/M

Write:—

—

09/05/2025

Legal:Terms

•

Privacy

256K

16K

1.7s

13tps

$0.60/M

$2.50/M

—

09/05/2025

More models by Moonshot AI

Model

Context	Max Output	Latency	Throughput	Input	Output	Cache	Image Gen	Video Gen	Web Search	Capabilities	Providers	ZDR	Release Date

131K

0.6s

63tps

$0.50/M

$2.00/M

—

09/05/2025

262K

0.3s

133tps

$0.60/M

$2.50/M

Read:$0.15/M

Write:—

—

11/06/2025

262K

0.8s

96tps

$1.15/M

$8.00/M

Read:$0.15/M

Write:—

—

11/06/2025

256K

16K

4.4s

57tps

$2.40/M

$10.00/M

—

09/05/2025

262K

0.3s

147tps

$0.50/M

$2.80/M

Read:$0.10/M

Write:—

—

01/26/2026

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Kimi K2 0905

Playground

Providers

More models by Moonshot AI

Playground

Providers

More models by Moonshot AI