GLM 4.7 Flash

Name: GLM 4.7 Flash
Brand: Z.ai
Price: 0.065 USD

by Z.ai

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

Chat with GLM 4.7 Flash

Input Price$0.07/1M tokens

Output Price$0.40/1M tokens

Intelligence30.1

Coding25.9

Specifications

Technical details and pricing.

ProviderZ.ai

Context Window202,752 tokens

Release DateJan 19, 2026

ModalitiesText

Benchmarks

7 benchmark scores from Artificial Analysis.

GPQA58.1%

HLE7.1%

SciCode33.7%

LCR35.0%

IFBench60.8%

Tau298.8%

TerminalBench Hard22.0%

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Frequently Asked Questions

What is GLM 4.7 Flash good for?

Use GLM 4.7 Flash for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does GLM 4.7 Flash cost?

Pricing is based on usage. Current rates are $0.07/1M tokens for input and $0.40/1M tokens for output.

Can I try GLM 4.7 Flash for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does GLM 4.7 Flash support images or audio?

GLM 4.7 Flash focuses on text-based tasks.

Similar Models

Other models you might want to explore.

GLM 4.7

Z.ai

GLM-4.7 is Z.ai’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution.

Details →

GLM 4.6

Z.ai

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks.

Details →

GLM 4 32B

Z.ai

GLM 4 32B is a cost-effective foundation language model.

Details →

Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.