Kimi K2
설명
Kimi K2-Instruct-0905 is the latest, most capable version of Kimi K2, achieving state-of-the-art performance in frontier knowledge, math, and coding among non-thinking models. This Mixture-of-Experts model features 32 billion activated parameters and 1 trillion total parameters, meticulously optimized for agentic tasks. Key features include enhanced agentic coding intelligence, extended context length to 256K tokens, and a hybrid architecture trained with MuonClip optimizer on 15.5T tokens. The model achieves 65.8% on SWE-bench Verified (single attempt), 47.3% on SWE-bench Multilingual, and excels at tool use with 70.6% on Tau2-retail. It is a reflex-grade model without long thinking, designed to act and execute complex tasks seamlessly.
능력 레이더
전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.
랭킹
| 도메인 | #순위 | 점수 | 소스 |
|---|---|---|---|
| Agents & Tools | 98 | 25.0 | LS |
| Code Ranking | 174 | 46.0 | AA |
| General Ranking | 172 | 55.0 | AA |
| Math Reasoning | 123 | 69.0 | AA |
| Reasoning | 47 | 69.0 | LS |
| Science | 181 | 50.0 | AA |
벤치마크 점수 (LLM Stats)
Agents
Biology
Chemistry
Code
Communication
Factuality
Finance
General
Math
Reasoning
AA 평가 지수
LLM Stats 카테고리 점수
가격
속도
사용 가능한 프로바이더
(LS 내부 단위)프로바이더 데이터가 없습니다