跳转到主要内容

GLM-4.6 (Reasoning)

Z AIGLMOpen WeightMIT · Commercial OK

描述

GLM-4.6 is the latest version of Z.ai's flagship model, bringing significant improvements over GLM-4.5. Key features include: 200K token context window (expanded from 128K), superior coding performance with better real-world application in Claude Code/Cline/Roo Code/Kilo Code, advanced reasoning with tool use during inference, stronger agent capabilities, and refined writing aligned with human preferences. GLM-4.6 achieves competitive performance with DeepSeek-V3.2-Exp and Claude Sonnet 4, reaching near parity with Claude Sonnet 4 (48.6% win rate) on CC-Bench real-world coding tasks.

发布日期
2025-09-30
参数规模
357.0B
上下文长度
205K
支持模态
image, text, video

能力雷达图

45
general
44
coding
85
reasoning
51
science估算
40
agents
20
multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域#排名分数来源
智能体与工具84
43.0
LS
代码能力榜111
58.0
AA
通用能力榜135
61.0
AA
数学推理54
87.0
AA
科学能力122
58.0
AA

基准测试分数 (LLM Stats)

Agents

BrowseComp45.1%自报
Terminal-Bench40.5%自报

Biology

GPQA81.0%自报

Code

SWE-Bench Verified68.0%自报

General

LiveCodeBench v682.8%自报

Math

AIME 202593.9%自报
Humanity's Last Exam17.2%自报

AA 评测指数

Math Index
86.0
Intelligence Index
32.5
Coding Index
29.5
Aime 25
0.9
Mmlu Pro
0.8
Gpqa
0.8
Tau2
0.7
Livecodebench
0.7
Lcr
0.5
Ifbench
0.4
Scicode
0.4
Terminalbench Hard
0.3
Hle
0.1

LLM Stats 分类评分

Biology
80
Chemistry
80
General
80
Physics
80
Frontend Development
70
Math
60
Reasoning
60
Code
50
Search
50
Agents
40
Vision
20

定价

输入价格$0.55 / 1M tokens
输出价格$2.2 / 1M tokens
混合价格(3:1)$0.963 / 1M tokens

速度

Tokens/秒37.2 tokens/s
首Token延迟0.82s
首回答延迟54.62s

可用提供商

(LS 内部计价单位)
提供商输入价格输出价格
Fireworks550K2.2M
DeepInfra600K2.0M

外部链接