GLM-4.7-Flash (Non-reasoning)
Z AIGLMOpen WeightMIT · Commercial OK
Descripción
GLM-4.7-Flash is a high-speed, cost-efficient variant of GLM-4.7 optimized for fast inference and lower latency. It retains the coding-centric capabilities of GLM-4.7 including thinking before acting, preserved reasoning across turns, and per-request thinking control for speed or accuracy trade-offs. Ideal for applications requiring quick responses while maintaining strong performance on coding, agentic workflows, and general reasoning tasks.
Fecha de lanzamiento
2026-01-19
Parámetros
30.0B
Longitud del contexto
203K
Modalidades
text
Radar de capacidades
18
general
13
coding
45
reasoning
30
scienceest.
80
agents
0
multimodal
Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.
Rankings
| Dominio | #Posición | Puntuación | Fuente |
|---|---|---|---|
| Agents & Tools | 30 | 64.0 | LS |
| Code Ranking | 375 | 16.0 | AA |
| General Ranking | 195 | 51.0 | AA |
| Science | 354 | 31.0 | AA |
Puntuaciones de benchmarks (LLM Stats)
Agents
Tau-bench
79.5%Aut.
BrowseComp
42.8%Aut.
Biology
GPQA
75.2%Aut.
Code
SWE-Bench Verified
59.2%Aut.
Math
AIME 2025
91.6%Aut.
Humanity's Last Exam
14.4%Aut.
Índices de evaluación AA
Intelligence Index22.1
Coding Index11.0
Tau20.9
Ifbench0.5
Gpqa0.5
Scicode0.3
Lcr0.1
Hle0.0
Terminalbench Hard0.0
Puntuaciones por categoría LLM Stats
Tool Calling80
Biology80
Chemistry80
General80
Physics80
Agents60
Code60
Frontend Development60
Reasoning60
Math50
Search40
Vision10
Precios
Precio de entrada$0.07 / 1M tokens
Precio de salida$0.4 / 1M tokens
Precio mixto (3:1)$0.153 / 1M tokens
Velocidad
Tokens/seg94.6 tokens/s
Retraso del primer token0.89s
Tiempo hasta la respuesta0.89s
Proveedores disponibles
(Unidades internas LS)| Proveedor | Precio de entrada | Precio de salida |
|---|---|---|
| ZAI | 70K | 400K |