Skip to main content

GLM-4.6 (Reasoning)

Z AIGLMOpen WeightMIT · Commercial OK

Description

GLM-4.6 is the latest version of Z.ai's flagship model, bringing significant improvements over GLM-4.5. Key features include: 200K token context window (expanded from 128K), superior coding performance with better real-world application in Claude Code/Cline/Roo Code/Kilo Code, advanced reasoning with tool use during inference, stronger agent capabilities, and refined writing aligned with human preferences. GLM-4.6 achieves competitive performance with DeepSeek-V3.2-Exp and Claude Sonnet 4, reaching near parity with Claude Sonnet 4 (48.6% win rate) on CC-Bench real-world coding tasks.

Release Date
2025-09-30
Parameters
357.0B
Context Length
205K
Modalities
image, text, video

Capability Radar

45
general
44
coding
85
reasoning
51
scienceest.
40
agents
20
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Agents & Tools84
43.0
LS
Code Ranking111
58.0
AA
General Ranking135
61.0
AA
Math Reasoning54
87.0
AA
Science122
58.0
AA

Benchmark Scores (LLM Stats)

Agents

BrowseComp45.1%SR
Terminal-Bench40.5%SR

Biology

GPQA81.0%SR

Code

SWE-Bench Verified68.0%SR

General

LiveCodeBench v682.8%SR

Math

AIME 202593.9%SR
Humanity's Last Exam17.2%SR

AA Evaluation Indices

Math Index
86.0
Intelligence Index
32.5
Coding Index
29.5
Aime 25
0.9
Mmlu Pro
0.8
Gpqa
0.8
Tau2
0.7
Livecodebench
0.7
Lcr
0.5
Ifbench
0.4
Scicode
0.4
Terminalbench Hard
0.3
Hle
0.1

LLM Stats Category Scores

Biology
80
Chemistry
80
General
80
Physics
80
Frontend Development
70
Math
60
Reasoning
60
Code
50
Search
50
Agents
40
Vision
20

Pricing

Input Price$0.55 / 1M tokens
Output Price$2.2 / 1M tokens
Blended Price (3:1)$0.963 / 1M tokens

Speed

Tokens/sec37.2 tokens/s
Time to First Token0.82s
Time to Answer54.62s

Available Providers

(LS internal units)
ProviderInput PriceOutput Price
Fireworks550K2.2M
DeepInfra600K2.0M

External Sources