GLM 4.5 Air
by Z.ai
GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean.
Specifications
Technical details and pricing.
Benchmarks
12 benchmark scores from Artificial Analysis.
Composite Indices
Intelligence, Coding, Math
Standard Benchmarks
Academic and industry benchmarks
Frequently Asked Questions
What is GLM 4.5 Air good for?
Use GLM 4.5 Air for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.
How much does GLM 4.5 Air cost?
Pricing is based on usage. Current rates are $0.49/1M tokens for input and $1.90/1M tokens for output.
Can I try GLM 4.5 Air for free?
Yes. You can start a chat instantly and test the model before deciding on a plan.
Does GLM 4.5 Air support images or audio?
GLM 4.5 Air focuses on text-based tasks.
Similar Models
Other models you might want to explore.
Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.