GLM 4.6
by Z.ai
Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages. Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability. More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks. Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.
Specifications
Technical details and pricing.
Benchmarks
10 benchmark scores from Artificial Analysis.
Composite Indices
Intelligence, Coding, Math
Standard Benchmarks
Academic and industry benchmarks
Frequently Asked Questions
What is GLM 4.6 good for?
Use GLM 4.6 for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.
How much does GLM 4.6 cost?
Pricing is based on usage. Current rates are $0.57/1M tokens for input and $2.20/1M tokens for output.
Can I try GLM 4.6 for free?
Yes. You can start a chat instantly and test the model before deciding on a plan.
Does GLM 4.6 support images or audio?
GLM 4.6 focuses on text-based tasks.
Similar Models
Other models you might want to explore.
Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.