Skip to content

KAT-Coder-Pro V1

KAT-Coder-Pro V1 is KwaiPilot's agentic coding model. It achieves a 73.4% resolve rate on SWE-Bench Verified with a context window of 256K tokens, parallel tool calling, and multi-turn support.

Reasoning
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'kwaipilot/kat-coder-pro-v1',
prompt: 'Why is the sky blue?'
})

Playground

Try out KAT-Coder-Pro V1 by KwaiPilot. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Novita AI
Legal:Terms
Privacy
256K
3.6s
93tps
$0.03/M$1.20/M
Read:$0.06/M
Write:
10/24/2025
Throughput

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.

Latency

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.

Uptime

Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.

More models by KwaiPilot

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
256K
1.9s
103tps
$0.30/M$1.20/M
Read:$0.06/M
Write:
streamlake logo
03/27/2026

About KAT-Coder-Pro V1

KAT-Coder-Pro V1 is the first production release from KwaiPilot's KAT-Coder series, designed for agentic software engineering at repository scale. On SWE-Bench Verified, a standard benchmark for autonomous issue resolution, it achieves a 73.4% resolve rate.

The model's agentic design centers on two capabilities. Parallel tool calling lets the model issue multiple tool calls simultaneously instead of waiting sequentially. Multi-turn support enables sustained coding sessions. That efficiency gain matters for agent loops billed per token or per call.

KAT-Coder-Pro V1 covers eight task types: feature implementation, feature enhancement, bug fixing, refactoring, performance optimization, test case generation, code understanding, and configuration and deployment. The context window of 256K tokens lets it ingest large codebases, multi-file diffs, and extended conversation history in a single pass. Full technical details appear at https://novita.ai.

What To Consider When Choosing a Provider

  • Configuration: KAT-Coder-Pro V1 targets multi-turn agentic sessions. Factor context window consumption and session state into your integration planning. See https://novita.ai for methodology and benchmark details from KwaiPilot.
  • Zero Data Retention: AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use KAT-Coder-Pro V1

Best For

  • Automated issue resolution: Pull request generation in real repositories end-to-end
  • Multi-file refactoring: Refactoring workflows that span many files in one session
  • Parallel tool pipelines: Agent pipelines where parallel tool calling reduces total completion time
  • Scale test generation: Test case generation across existing codebases

Consider Alternatives When

  • General reasoning needs: Writing or multimodal input sits outside a coding-tuned model's scope
  • Simple completions: A lighter model suffices with minimal context
  • LiveCodeBench benchmark: Competitive programming is the primary evaluation criterion

Conclusion

KAT-Coder-Pro V1 pairs a 73.4% SWE-Bench Verified resolve rate with parallel tool calling and multi-turn support. Use it for software engineering automation across real repository tasks. Route requests through AI Gateway for access via novita.

Frequently Asked Questions

  • What is KAT-Coder-Pro V1's SWE-Bench Verified score?

    73.4% resolve rate on SWE-Bench Verified, which tests autonomous resolution of real GitHub issues.

  • What does parallel tool calling mean in practice?

    The model issues multiple tool calls in a single inference step instead of waiting for each response sequentially. That cuts latency when several operations can run at once.

  • What is the context window size?

    KAT-Coder-Pro V1 has a context window of 256K tokens. You can fit large codebases, multi-file diffs, and extended conversation history in a single context.

  • What types of software engineering tasks does it support?

    Eight task types: feature implementation, feature enhancement, bug fixing, refactoring, performance optimization, test case generation, code understanding, and configuration and deployment.

  • What is the pricing for KAT-Coder-Pro V1?

    Current pricing is shown on this page. AI Gateway routes across providers, and rates may vary by provider.

  • How do I try KAT-Coder-Pro V1?

    Open the playground at https://ai-sdk.dev/playground/novita:kwaipilot/kat-coder-pro or call the model through AI Gateway with your provider credentials.