Kimi K2
説明
Kimi K2-Instruct-0905 is the latest, most capable version of Kimi K2, achieving state-of-the-art performance in frontier knowledge, math, and coding among non-thinking models. This Mixture-of-Experts model features 32 billion activated parameters and 1 trillion total parameters, meticulously optimized for agentic tasks. Key features include enhanced agentic coding intelligence, extended context length to 256K tokens, and a hybrid architecture trained with MuonClip optimizer on 15.5T tokens. The model achieves 65.8% on SWE-bench Verified (single attempt), 47.3% on SWE-bench Multilingual, and excels at tool use with 70.6% on Tau2-retail. It is a reflex-grade model without long thinking, designed to act and execute complex tasks seamlessly.
能力レーダー
専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。
ランキング
| ドメイン | #順位 | スコア | ソース |
|---|---|---|---|
| Agents & Tools | 98 | 25.0 | LS |
| Code Ranking | 174 | 46.0 | AA |
| General Ranking | 172 | 55.0 | AA |
| Math Reasoning | 123 | 69.0 | AA |
| Reasoning | 47 | 69.0 | LS |
| Science | 181 | 50.0 | AA |
ベンチマークスコア (LLM Stats)
Agents
Biology
Chemistry
Code
Communication
Factuality
Finance
General
Math
Reasoning
AA評価指数
LLM Statsカテゴリスコア
価格設定
速度
利用可能なプロバイダー
(LS内部単位)プロバイダーデータがありません