Skip to content

GLM 4.6

GLM 4.6 is Z.ai's coding-focused model released September 30, 2025, with enhanced performance on both benchmarks and real-world programming tasks. It features an expanded context window of 204.8K tokens for handling large codebases and complex agent workflows.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'zai/glm-4.6',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Z.ai
200K
4.9s
105tps
$0.60/M$2.20/M
Read:$0.11/M
Write:
+1
09/30/2025
DeepInfra
203K
0.4s
29tps
$0.45/M$1.90/M
Read:$0.11/M
Write:
09/30/2025
Novita AI
205K
1.3s
111tps
$0.60/M$2.20/M
Read:$0.11/M
Write:
+1
09/30/2025