GLM-4.7-Flash (Non-reasoning)
Z AIGLMOpen WeightMIT · Commercial OK
描述
GLM-4.7-Flash is a high-speed, cost-efficient variant of GLM-4.7 optimized for fast inference and lower latency. It retains the coding-centric capabilities of GLM-4.7 including thinking before acting, preserved reasoning across turns, and per-request thinking control for speed or accuracy trade-offs. Ideal for applications requiring quick responses while maintaining strong performance on coding, agentic workflows, and general reasoning tasks.
发布日期
2026-01-19
参数规模
30.0B
上下文长度
203K
支持模态
text
能力雷达图
18
general
13
coding
45
reasoning
30
science估算
80
agents
0
multimodal
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
基准测试分数 (LLM Stats)
Agents
Tau-bench
79.5%自报
BrowseComp
42.8%自报
Biology
GPQA
75.2%自报
Code
SWE-Bench Verified
59.2%自报
Math
AIME 2025
91.6%自报
Humanity's Last Exam
14.4%自报
AA 评测指数
Intelligence Index22.1
Coding Index11.0
Tau20.9
Ifbench0.5
Gpqa0.5
Scicode0.3
Lcr0.1
Hle0.0
Terminalbench Hard0.0
LLM Stats 分类评分
Tool Calling80
Biology80
Chemistry80
General80
Physics80
Agents60
Code60
Frontend Development60
Reasoning60
Math50
Search40
Vision10
定价
输入价格$0.07 / 1M tokens
输出价格$0.4 / 1M tokens
混合价格(3:1)$0.153 / 1M tokens
速度
Tokens/秒94.6 tokens/s
首Token延迟0.89s
首回答延迟0.89s
可用提供商
(LS 内部计价单位)| 提供商 | 输入价格 | 输出价格 |
|---|---|---|
| ZAI | 70K | 400K |