Passer au contenu principal

GLM-4.6 (Reasoning)

Z AIGLMOpen WeightMIT · Commercial OK

Description

GLM-4.6 is the latest version of Z.ai's flagship model, bringing significant improvements over GLM-4.5. Key features include: 200K token context window (expanded from 128K), superior coding performance with better real-world application in Claude Code/Cline/Roo Code/Kilo Code, advanced reasoning with tool use during inference, stronger agent capabilities, and refined writing aligned with human preferences. GLM-4.6 achieves competitive performance with DeepSeek-V3.2-Exp and Claude Sonnet 4, reaching near parity with Claude Sonnet 4 (48.6% win rate) on CC-Bench real-world coding tasks.

Date de sortie
2025-09-30
Paramètres
357.0B
Longueur du contexte
205K
Modalités
image, text, video

Radar de capacités

45
general
44
coding
85
reasoning
51
scienceest.
40
agents
20
multimodal

Science utilise un proxy de raisonnement lorsque les benchmarks scientifiques dédiés ne sont pas disponibles.

Classements

Domaine#RangScoreSource
Agents & Tools84
43.0
LS
Code Ranking111
58.0
AA
General Ranking135
61.0
AA
Math Reasoning54
87.0
AA
Science122
58.0
AA

Scores de benchmarks (LLM Stats)

Agents

BrowseComp45.1%Aut.
Terminal-Bench40.5%Aut.

Biology

GPQA81.0%Aut.

Code

SWE-Bench Verified68.0%Aut.

General

LiveCodeBench v682.8%Aut.

Math

AIME 202593.9%Aut.
Humanity's Last Exam17.2%Aut.

Indices d'évaluation AA

Math Index
86.0
Intelligence Index
32.5
Coding Index
29.5
Aime 25
0.9
Mmlu Pro
0.8
Gpqa
0.8
Tau2
0.7
Livecodebench
0.7
Lcr
0.5
Ifbench
0.4
Scicode
0.4
Terminalbench Hard
0.3
Hle
0.1

Scores par catégorie LLM Stats

Biology
80
Chemistry
80
General
80
Physics
80
Frontend Development
70
Math
60
Reasoning
60
Code
50
Search
50
Agents
40
Vision
20

Tarification

Prix d'entrée$0.55 / 1M tokens
Prix de sortie$2.2 / 1M tokens
Prix mixte (3:1)$0.963 / 1M tokens

Vitesse

Tokens/sec37.2 tokens/s
Délai du premier token0.82s
Temps de réponse54.62s

Fournisseurs disponibles

(Unités internes LS)
FournisseurPrix d'entréePrix de sortie
Fireworks550K2.2M
DeepInfra600K2.0M

Sources externes