GLM-4.5 (Reasoning)

Z AIGLMOpen WeightMIT · Commercial OK

विवरण

GLM-4.5 is an Agentic, Reasoning, and Coding (ARC) foundation model designed for intelligent agents, featuring 355 billion total parameters with 32 billion active parameters using MoE architecture. Trained on 23T tokens through multi-stage training, it is a hybrid reasoning model that provides two modes: thinking mode for complex reasoning and tool usage, and non-thinking mode for immediate responses. The model unifies agentic, reasoning, and coding capabilities with 128K context length support. It achieves exceptional performance with a score of 63.2 across 12 industry-standard benchmarks, placing 3rd among all proprietary and open-source models. Released under MIT open-source license allowing commercial use and secondary development.

रिलीज़ तिथि

2025-07-28

पैरामीटर

355.0B

संदर्भ लंबाई

131K

मोडैलिटीज़

text

क्षमता रडार

general

coding

reasoning

scienceअनुमानित

agents

multimodal

समर्पित विज्ञान बेंचमार्क उपलब्ध न होने पर Science तर्क प्रॉक्सी का उपयोग करके अनुमान लगाता है।

रैंकिंग

डोमेन	#रैंक	स्कोर	स्रोत
Agents & Tools	58	55.0	LS
Code Ranking	125	54.0	AA
General Ranking	187	52.0	AA
Math Reasoning	76	82.0	AA
Science	141	55.0	AA

बेंचमार्क स्कोर (LLM Stats)

Agents

BFCL-v3

77.8%स्वयं

Terminal-Bench

37.5%स्वयं

BrowseComp

26.4%स्वयं

Biology

GPQA

79.1%स्वयं

SciCode

41.7%स्वयं

Code

LiveCodeBench

72.9%स्वयं

SWE-Bench Verified

64.2%स्वयं

Communication

TAU-bench Retail

79.7%स्वयं

TAU-bench Airline

60.4%स्वयं

Finance

MMLU-Pro

84.6%स्वयं

General

AA-Index

67.7%स्वयं

Math

MATH-500

98.2%स्वयं

AIME 2024

91.0%स्वयं

Humanity's Last Exam

14.4%स्वयं

AA मूल्यांकन सूचकांक

Math Index

73.7

Intelligence Index

26.4

Coding Index

26.3

Math 500

1.0

Aime

0.9

Mmlu Pro

0.8

Gpqa

0.8

Livecodebench

0.7

Aime 25

0.7

Lcr

0.5

Ifbench

0.4

Tau2

0.4

Scicode

0.3

Terminalbench Hard

0.2

Hle

0.1

LLM Stats श्रेणी स्कोर

Structured Output

Finance

General

Healthcare

Language

Legal

Tool Calling

Communication

Math

Biology

Chemistry

Frontend Development

Physics

Reasoning

Agents

Code

Vision

मूल्य निर्धारण

इनपुट मूल्य$0.6 / 1M tokens

आउटपुट मूल्य$2.2 / 1M tokens

मिश्रित मूल्य (3:1)$1 / 1M tokens

गति

टोकन/सेकंड42.4 tokens/s

पहले टोकन में देरी1.03s

पहले उत्तर में देरी48.20s

उपलब्ध प्रदाता

(LS आंतरिक इकाइयाँ)

कोई प्रदाता डेटा उपलब्ध नहीं

बाहरी लिंक

LLM Stats