跳轉到主要內容

GLM-4.6 (Reasoning)

Z AIGLMOpen WeightMIT · Commercial OK

描述

GLM-4.6 is the latest version of Z.ai's flagship model, bringing significant improvements over GLM-4.5. Key features include: 200K token context window (expanded from 128K), superior coding performance with better real-world application in Claude Code/Cline/Roo Code/Kilo Code, advanced reasoning with tool use during inference, stronger agent capabilities, and refined writing aligned with human preferences. GLM-4.6 achieves competitive performance with DeepSeek-V3.2-Exp and Claude Sonnet 4, reaching near parity with Claude Sonnet 4 (48.6% win rate) on CC-Bench real-world coding tasks.

發布日期
2025-09-30
參數規模
357.0B
上下文長度
205K
支援模態
image, text, video

能力雷達圖

45
general
44
coding
85
reasoning
51
science估算
40
agents
20
multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域#排名分數來源
智能体与工具84
43.0
LS
代码能力榜111
58.0
AA
通用能力榜135
61.0
AA
数学推理54
87.0
AA
科学能力122
58.0
AA

基準測試分數 (LLM Stats)

Agents

BrowseComp45.1%自報
Terminal-Bench40.5%自報

Biology

GPQA81.0%自報

Code

SWE-Bench Verified68.0%自報

General

LiveCodeBench v682.8%自報

Math

AIME 202593.9%自報
Humanity's Last Exam17.2%自報

AA 評測指數

Math Index
86.0
Intelligence Index
32.5
Coding Index
29.5
Aime 25
0.9
Mmlu Pro
0.8
Gpqa
0.8
Tau2
0.7
Livecodebench
0.7
Lcr
0.5
Ifbench
0.4
Scicode
0.4
Terminalbench Hard
0.3
Hle
0.1

LLM Stats 分類評分

Biology
80
Chemistry
80
General
80
Physics
80
Frontend Development
70
Math
60
Reasoning
60
Code
50
Search
50
Agents
40
Vision
20

定價

輸入價格$0.55 / 1M tokens
輸出價格$2.2 / 1M tokens
混合價格(3:1)$0.963 / 1M tokens

速度

Tokens/秒37.2 tokens/s
首Token延遲0.82s
首回答延遲54.62s

可用提供商

(LS 內部計價單位)
提供商輸入價格輸出價格
Fireworks550K2.2M
DeepInfra600K2.0M

外部連結