Skip to content

GLM 4.5 Air

GLM 4.5 Air is Z.ai's efficiency-focused model released July 28, 2025. It delivers fast inference for high-volume workloads while keeping reasoning and coding capability at reduced cost compared to the full GLM-4.5.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'zai/glm-4.5-air',
prompt: 'Why is the sky blue?'
})

Frequently Asked Questions

  • How does GLM 4.5 Air compare to the full GLM-4.5?

    GLM 4.5 Air is the lightweight variant optimized for faster inference and lower cost. GLM-4.5 provides deeper reasoning capability at higher per-token cost. Both share the same API surface and context window.

  • What is GLM 4.5 Air best suited for?

    High-volume tasks where speed and cost matter: classification, extraction, summarization, and conversational applications. For deep reasoning tasks, consider the full GLM-4.5 or GLM-5.

  • What is the context window for GLM 4.5 Air?

    128K tokens, matching the full GLM-4.5 model.

  • How do I switch between GLM 4.5 Air and GLM-4.5?

    Change the model identifier in your API call. Both models share the same API interface, so no other integration changes are needed.

  • How do I authenticate with GLM 4.5 Air through AI Gateway?

    AI Gateway provides a unified API key. Configure it in your environment and use the model identifier to route requests. No separate Z.ai account is required, though BYOK is supported.

  • Is GLM 4.5 Air suitable for agentic workflows?

    Yes, for agent steps that prioritize speed over deep reasoning. For planning-heavy steps, route those to GLM-4.5 or GLM-5 while using GLM 4.5 Air for faster execution steps.

  • What is the pricing for GLM 4.5 Air?

    Check the pricing panel on this page for today's numbers. AI Gateway tracks rates across every provider that serves GLM 4.5 Air.