Skip to content

GLM 5

GLM 5 is Z.ai's GLM-5 generation model released February 12, 2026, featuring multiple thinking modes, enhanced long-range planning and memory, and improved handling of complex multi-step agent tasks. It supports agentic coding and structured data extraction workflows.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'zai/glm-5',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Z.ai
203K
6.7s
68tps
$1.00/M$3.20/M
Read:$0.2/M
Write:
+1
02/12/2026
Fireworks
203K
$1.00/M$3.20/M
Read:$0.2/M
Write:
+1
02/12/2026
DeepInfra
203K
1.1s
20tps
$0.80/M$2.56/M
Read:$0.16/M
Write:
+1
02/12/2026
Parasail
203K
0.4s
41tps
$1.00/M$3.20/M
02/12/2026
Amazon Bedrock
203K
1.1s
74tps
$1.00/M$3.20/M
02/12/2026
Baseten
203K
0.3s
155tps
$0.95/M$3.15/M
Read:$0.2/M
Write:
+1
02/12/2026
Novita AI
203K
1.7s
46tps
$1.00/M$3.20/M
Read:$0.2/M
Write:
+1
02/12/2026