Kimi K2
Описание
Kimi K2-Instruct-0905 is the latest, most capable version of Kimi K2, achieving state-of-the-art performance in frontier knowledge, math, and coding among non-thinking models. This Mixture-of-Experts model features 32 billion activated parameters and 1 trillion total parameters, meticulously optimized for agentic tasks. Key features include enhanced agentic coding intelligence, extended context length to 256K tokens, and a hybrid architecture trained with MuonClip optimizer on 15.5T tokens. The model achieves 65.8% on SWE-bench Verified (single attempt), 47.3% on SWE-bench Multilingual, and excels at tool use with 70.6% on Tau2-retail. It is a reflex-grade model without long thinking, designed to act and execute complex tasks seamlessly.
Радар способностей
Science использует прокси на основе рассуждений, когда специализированные научные бенчмарки недоступны.
Рейтинги
| Домен | #Место | Оценка | Источник |
|---|---|---|---|
| Agents & Tools | 98 | 25.0 | LS |
| Code Ranking | 174 | 46.0 | AA |
| General Ranking | 172 | 55.0 | AA |
| Math Reasoning | 123 | 69.0 | AA |
| Reasoning | 47 | 69.0 | LS |
| Science | 181 | 50.0 | AA |
Оценки бенчмарков (LLM Stats)
Agents
Biology
Chemistry
Code
Communication
Factuality
Finance
General
Math
Reasoning
Индексы оценки AA
Оценки категорий LLM Stats
Цены
Скорость
Доступные провайдеры
(Внутренние единицы LS)Нет данных провайдеров