Z.ai Models
Z.ai logo

GLM 4.7 Flash

by Z.ai

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

Chat with GLM 4.7 Flash
Input Price$0.07/1M tokens
Output Price$0.40/1M tokens
Intelligence30.1
Coding25.9

Specifications

Technical details and pricing.

ProviderZ.ai
Context Window202,752 tokens
Release DateJan 19, 2026
ModalitiesText

Benchmarks

7 benchmark scores from Artificial Analysis.

GPQA58.1%
HLE7.1%
SciCode33.7%
LCR35.0%
IFBench60.8%
Tau298.8%
TerminalBench Hard22.0%

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Frequently Asked Questions

What is GLM 4.7 Flash good for?

Use GLM 4.7 Flash for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does GLM 4.7 Flash cost?

Pricing is based on usage. Current rates are $0.07/1M tokens for input and $0.40/1M tokens for output.

Can I try GLM 4.7 Flash for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does GLM 4.7 Flash support images or audio?

GLM 4.7 Flash focuses on text-based tasks.

Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.