Z.ai Models
Z.ai logo

GLM 4.5 Air

by Z.ai

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean.

Chat with GLM 4.5 Air
Input Price$0.49/1M tokens
Output Price$1.90/1M tokens
Intelligence26.2
Coding26.3

Specifications

Technical details and pricing.

ProviderZ.ai
Context Window131,072 tokens
Release DateJul 28, 2025
ModalitiesText

Benchmarks

12 benchmark scores from Artificial Analysis.

GPQA78.2%
MMLU Pro83.5%
HLE12.2%
LiveCodeBench73.8%
MATH 50097.9%
AIME 202573.7%
AIME87.3%
SciCode34.8%
LCR48.3%
IFBench44.1%
Tau243.0%
TerminalBench Hard22.0%

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Frequently Asked Questions

What is GLM 4.5 Air good for?

Use GLM 4.5 Air for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does GLM 4.5 Air cost?

Pricing is based on usage. Current rates are $0.49/1M tokens for input and $1.90/1M tokens for output.

Can I try GLM 4.5 Air for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does GLM 4.5 Air support images or audio?

GLM 4.5 Air focuses on text-based tasks.

Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.