跳轉到主要內容

GLM-4.7 (Reasoning)

Z AIGLMOpen WeightMIT · Commercial OK

描述

GLM 4.7 is a coding‑centric model that thinks before acting, preserves its reasoning across turns, and lets you control thinking per request for speed or accuracy. It upgrades agentic workflows with stronger multi‑step tool use, better terminal and multilingual coding, and a noticeable jump in UI output quality for modern, clean webpages and slides. You can use it in popular coding agents, call it via the Z.ai API, and even run it locally with public weights on HuggingFace and ModelScope using vLLM or SGLang.

發布日期
2025-12-22
參數規模
358.0B
上下文長度
203K
支援模態
text

能力雷達圖

53
general
56
coding
93
reasoning
59
science估算
60
agents
0
multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域#排名分數來源
智能体与工具59
55.0
LS
代码能力榜57
71.0
AA
通用能力榜39
81.0
AA
数学推理13
96.0
AA
推理能力55
67.0
LS
科学能力44
74.0
AA

基準測試分數 (LLM Stats)

Agents

Tau-bench87.4%自報
BrowseComp52.0%自報
Terminal-Bench 2.041.0%自報
Terminal-Bench33.3%自報

Biology

GPQA85.7%自報

Code

SWE-Bench Verified73.8%自報
SWE-bench Multilingual66.7%自報

Finance

MMLU-Pro84.3%自報

General

LiveCodeBench v684.9%自報

Math

AIME 202595.7%自報
IMO-AnswerBench82.0%自報
Humanity's Last Exam42.8%自報

Reasoning

BrowseComp-zh66.6%自報

AA 評測指數

Math Index
95.0
Intelligence Index
42.1
Coding Index
36.3
Tau2
1.0
Aime 25
0.9
Livecodebench
0.9
Gpqa
0.9
Mmlu Pro
0.9
Ifbench
0.7
Lcr
0.6
Scicode
0.5
Terminalbench Hard
0.3
Hle
0.3

LLM Stats 分類評分

Biology
90
Chemistry
90
General
90
Physics
90
Finance
80
Healthcare
80
Language
80
Legal
80
Math
80
Frontend Development
70
Reasoning
70
Tool Calling
60
Search
60
Agents
50
Code
50
Vision
40

定價

輸入價格$0.6 / 1M tokens
輸出價格$2.2 / 1M tokens
混合價格(3:1)$1 / 1M tokens

速度

Tokens/秒91.5 tokens/s
首Token延遲0.90s
首回答延遲22.74s

可用提供商

(LS 內部計價單位)
提供商輸入價格輸出價格
Novita600K2.2M
Fireworks600K2.2M

外部連結