跳转到主要内容

GLM-4.7-Flash (Non-reasoning)

Z AIGLMOpen WeightMIT · Commercial OK

描述

GLM-4.7-Flash is a high-speed, cost-efficient variant of GLM-4.7 optimized for fast inference and lower latency. It retains the coding-centric capabilities of GLM-4.7 including thinking before acting, preserved reasoning across turns, and per-request thinking control for speed or accuracy trade-offs. Ideal for applications requiring quick responses while maintaining strong performance on coding, agentic workflows, and general reasoning tasks.

发布日期
2026-01-19
参数规模
30.0B
上下文长度
203K
支持模态
text

能力雷达图

18
general
13
coding
45
reasoning
30
science估算
80
agents
0
multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域#排名分数来源
智能体与工具30
64.0
LS
代码能力榜375
16.0
AA
通用能力榜195
51.0
AA
科学能力354
31.0
AA

基准测试分数 (LLM Stats)

Agents

Tau-bench79.5%自报
BrowseComp42.8%自报

Biology

GPQA75.2%自报

Code

SWE-Bench Verified59.2%自报

Math

AIME 202591.6%自报
Humanity's Last Exam14.4%自报

AA 评测指数

Intelligence Index
22.1
Coding Index
11.0
Tau2
0.9
Ifbench
0.5
Gpqa
0.5
Scicode
0.3
Lcr
0.1
Hle
0.0
Terminalbench Hard
0.0

LLM Stats 分类评分

Tool Calling
80
Biology
80
Chemistry
80
General
80
Physics
80
Agents
60
Code
60
Frontend Development
60
Reasoning
60
Math
50
Search
40
Vision
10

定价

输入价格$0.07 / 1M tokens
输出价格$0.4 / 1M tokens
混合价格(3:1)$0.153 / 1M tokens

速度

Tokens/秒94.6 tokens/s
首Token延迟0.89s
首回答延迟0.89s

可用提供商

(LS 内部计价单位)
提供商输入价格输出价格
ZAI70K400K

外部链接