GLM-4.7-Flash (Non-reasoning)
Z AIGLMOpen WeightMIT · Commercial OK
Description
GLM-4.7-Flash is a high-speed, cost-efficient variant of GLM-4.7 optimized for fast inference and lower latency. It retains the coding-centric capabilities of GLM-4.7 including thinking before acting, preserved reasoning across turns, and per-request thinking control for speed or accuracy trade-offs. Ideal for applications requiring quick responses while maintaining strong performance on coding, agentic workflows, and general reasoning tasks.
Release Date
2026-01-19
Parameters
30.0B
Context Length
203K
Modalities
text
Capability Radar
18
general
13
coding
45
reasoning
30
scienceest.
80
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Agents & Tools | 30 | 64.0 | LS |
| Code Ranking | 375 | 16.0 | AA |
| General Ranking | 195 | 51.0 | AA |
| Science | 354 | 31.0 | AA |
Benchmark Scores (LLM Stats)
Agents
Tau-bench
79.5%SR
BrowseComp
42.8%SR
Biology
GPQA
75.2%SR
Code
SWE-Bench Verified
59.2%SR
Math
AIME 2025
91.6%SR
Humanity's Last Exam
14.4%SR
AA Evaluation Indices
Intelligence Index22.1
Coding Index11.0
Tau20.9
Ifbench0.5
Gpqa0.5
Scicode0.3
Lcr0.1
Hle0.0
Terminalbench Hard0.0
LLM Stats Category Scores
Tool Calling80
Biology80
Chemistry80
General80
Physics80
Agents60
Code60
Frontend Development60
Reasoning60
Math50
Search40
Vision10
Pricing
Input Price$0.07 / 1M tokens
Output Price$0.4 / 1M tokens
Blended Price (3:1)$0.153 / 1M tokens
Speed
Tokens/sec94.6 tokens/s
Time to First Token0.89s
Time to Answer0.89s
Available Providers
(LS internal units)| Provider | Input Price | Output Price |
|---|---|---|
| ZAI | 70K | 400K |