GLM-4.6 (Reasoning)
Z AIGLMOpen WeightMIT · Commercial OK
描述
GLM-4.6 is the latest version of Z.ai's flagship model, bringing significant improvements over GLM-4.5. Key features include: 200K token context window (expanded from 128K), superior coding performance with better real-world application in Claude Code/Cline/Roo Code/Kilo Code, advanced reasoning with tool use during inference, stronger agent capabilities, and refined writing aligned with human preferences. GLM-4.6 achieves competitive performance with DeepSeek-V3.2-Exp and Claude Sonnet 4, reaching near parity with Claude Sonnet 4 (48.6% win rate) on CC-Bench real-world coding tasks.
發布日期
2025-09-30
參數規模
357.0B
上下文長度
205K
支援模態
image, text, video
能力雷達圖
45
general
44
coding
85
reasoning
51
science估算
40
agents
20
multimodal
Science 在缺少專門科學評測時使用推理能力代理估算。
排行榜排名
基準測試分數 (LLM Stats)
Agents
BrowseComp
45.1%自報
Terminal-Bench
40.5%自報
Biology
GPQA
81.0%自報
Code
SWE-Bench Verified
68.0%自報
General
LiveCodeBench v6
82.8%自報
Math
AIME 2025
93.9%自報
Humanity's Last Exam
17.2%自報
AA 評測指數
Math Index86.0
Intelligence Index32.5
Coding Index29.5
Aime 250.9
Mmlu Pro0.8
Gpqa0.8
Tau20.7
Livecodebench0.7
Lcr0.5
Ifbench0.4
Scicode0.4
Terminalbench Hard0.3
Hle0.1
LLM Stats 分類評分
Biology80
Chemistry80
General80
Physics80
Frontend Development70
Math60
Reasoning60
Code50
Search50
Agents40
Vision20
定價
輸入價格$0.55 / 1M tokens
輸出價格$2.2 / 1M tokens
混合價格(3:1)$0.963 / 1M tokens
速度
Tokens/秒37.2 tokens/s
首Token延遲0.82s
首回答延遲54.62s
可用提供商
(LS 內部計價單位)| 提供商 | 輸入價格 | 輸出價格 |
|---|---|---|
| Fireworks | 550K | 2.2M |
| DeepInfra | 600K | 2.0M |