GLM 4.5 Air
GLM 4.5 Air is Z.ai's efficiency-focused model released July 28, 2025. It delivers fast inference for high-volume workloads while keeping reasoning and coding capability at reduced cost compared to the full GLM-4.5.
import { streamText } from 'ai'
const result = streamText({ model: 'zai/glm-4.5-air', prompt: 'Why is the sky blue?'})Frequently Asked Questions
How does GLM 4.5 Air compare to the full GLM-4.5?
GLM 4.5 Air is the lightweight variant optimized for faster inference and lower cost. GLM-4.5 provides deeper reasoning capability at higher per-token cost. Both share the same API surface and context window.
What is GLM 4.5 Air best suited for?
High-volume tasks where speed and cost matter: classification, extraction, summarization, and conversational applications. For deep reasoning tasks, consider the full GLM-4.5 or GLM-5.
What is the context window for GLM 4.5 Air?
128K tokens, matching the full GLM-4.5 model.
How do I switch between GLM 4.5 Air and GLM-4.5?
Change the model identifier in your API call. Both models share the same API interface, so no other integration changes are needed.
How do I authenticate with GLM 4.5 Air through AI Gateway?
AI Gateway provides a unified API key. Configure it in your environment and use the model identifier to route requests. No separate Z.ai account is required, though BYOK is supported.
Is GLM 4.5 Air suitable for agentic workflows?
Yes, for agent steps that prioritize speed over deep reasoning. For planning-heavy steps, route those to GLM-4.5 or GLM-5 while using GLM 4.5 Air for faster execution steps.
What is the pricing for GLM 4.5 Air?
Check the pricing panel on this page for today's numbers. AI Gateway tracks rates across every provider that serves GLM 4.5 Air.