跳转到主要内容

GLM-4.7 (Reasoning)

Z AIGLMOpen WeightMIT · Commercial OK

描述

GLM 4.7 is a coding‑centric model that thinks before acting, preserves its reasoning across turns, and lets you control thinking per request for speed or accuracy. It upgrades agentic workflows with stronger multi‑step tool use, better terminal and multilingual coding, and a noticeable jump in UI output quality for modern, clean webpages and slides. You can use it in popular coding agents, call it via the Z.ai API, and even run it locally with public weights on HuggingFace and ModelScope using vLLM or SGLang.

发布日期
2025-12-22
参数规模
358.0B
上下文长度
203K
支持模态
text

能力雷达图

53
general
56
coding
93
reasoning
59
science估算
60
agents
0
multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域#排名分数来源
智能体与工具59
55.0
LS
代码能力榜57
71.0
AA
通用能力榜39
81.0
AA
数学推理13
96.0
AA
推理能力55
67.0
LS
科学能力44
74.0
AA

基准测试分数 (LLM Stats)

Agents

Tau-bench87.4%自报
BrowseComp52.0%自报
Terminal-Bench 2.041.0%自报
Terminal-Bench33.3%自报

Biology

GPQA85.7%自报

Code

SWE-Bench Verified73.8%自报
SWE-bench Multilingual66.7%自报

Finance

MMLU-Pro84.3%自报

General

LiveCodeBench v684.9%自报

Math

AIME 202595.7%自报
IMO-AnswerBench82.0%自报
Humanity's Last Exam42.8%自报

Reasoning

BrowseComp-zh66.6%自报

AA 评测指数

Math Index
95.0
Intelligence Index
42.1
Coding Index
36.3
Tau2
1.0
Aime 25
0.9
Livecodebench
0.9
Gpqa
0.9
Mmlu Pro
0.9
Ifbench
0.7
Lcr
0.6
Scicode
0.5
Terminalbench Hard
0.3
Hle
0.3

LLM Stats 分类评分

Biology
90
Chemistry
90
General
90
Physics
90
Finance
80
Healthcare
80
Language
80
Legal
80
Math
80
Frontend Development
70
Reasoning
70
Tool Calling
60
Search
60
Agents
50
Code
50
Vision
40

定价

输入价格$0.6 / 1M tokens
输出价格$2.2 / 1M tokens
混合价格(3:1)$1 / 1M tokens

速度

Tokens/秒91.5 tokens/s
首Token延迟0.90s
首回答延迟22.74s

可用提供商

(LS 内部计价单位)
提供商输入价格输出价格
Novita600K2.2M
Fireworks600K2.2M

外部链接