GLM-4.5-Air
Z AIGLMOpen WeightMIT · Commercial OK
설명
GLM-4.5-Air is a more compact variant of GLM-4.5 designed for efficient Agentic, Reasoning, and Coding (ARC) applications. It features 106 billion total parameters with 12 billion active parameters using MoE architecture. Like GLM-4.5, it is a hybrid reasoning model providing thinking mode for complex reasoning and tool usage, and non-thinking mode for immediate responses. Despite its compact design, GLM-4.5-Air delivers competitive performance with a score of 59.8 across 12 industry-standard benchmarks, ranking 6th overall while maintaining superior efficiency. It supports 128K context length and is released under MIT open-source license allowing commercial use.
출시일
2025-07-28
파라미터
106.0B
컨텍스트 길이
131K
모달리티
text
능력 레이더
38
general
40
coding
79
reasoning
45
science추정
70
agents
0
multimodal
전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.
랭킹
| 도메인 | #순위 | 점수 | 소스 |
|---|---|---|---|
| Agents & Tools | 89 | 38.0 | LS |
| Code Ranking | 159 | 49.0 | AA |
| General Ranking | 216 | 48.0 | AA |
| Math Reasoning | 73 | 83.0 | AA |
| Science | 211 | 47.0 | AA |
벤치마크 점수 (LLM Stats)
Agents
BFCL-v3
76.4%자체 보고
Terminal-Bench
30.0%자체 보고
BrowseComp
21.3%자체 보고
Biology
GPQA
75.0%자체 보고
SciCode
37.3%자체 보고
Code
LiveCodeBench
70.7%자체 보고
SWE-Bench Verified
57.6%자체 보고
Communication
TAU-bench Retail
77.9%자체 보고
TAU-bench Airline
60.8%자체 보고
Finance
MMLU-Pro
81.4%자체 보고
General
AA-Index
64.8%자체 보고
Math
MATH-500
98.1%자체 보고
AIME 2024
89.4%자체 보고
Humanity's Last Exam
10.6%자체 보고
AA 평가 지수
Math Index80.7
Coding Index23.8
Intelligence Index23.2
Math 5001.0
Mmlu Pro0.8
Aime 250.8
Gpqa0.7
Livecodebench0.7
Aime0.7
Tau20.5
Lcr0.4
Ifbench0.4
Scicode0.3
Terminalbench Hard0.2
Hle0.1
LLM Stats 카테고리 점수
Structured Output80
Finance80
Healthcare80
Language80
Legal80
Tool Calling70
Communication70
General70
Biology60
Chemistry60
Frontend Development60
Math60
Physics60
Reasoning60
Code50
Agents40
Search20
Vision10
가격
입력 가격$0.17 / 1M tokens
출력 가격$0.98 / 1M tokens
혼합 가격 (3:1)$0.372 / 1M tokens
속도
토큰/초84.4 tokens/s
첫 토큰 지연1.39s
첫 응답 지연25.10s
사용 가능한 프로바이더
(LS 내부 단위)프로바이더 데이터가 없습니다