LongCat-Flash-Chat

MeituanオープンウエイトMIT · 商用利用可

説明

LongCat-Flash-Chat is Meituan's first open-source foundation model, a 560B parameter Mixture-of-Experts (MoE) model that dynamically activates 18.6B-31.3B parameters (~27B average) based on contextual demands. It features Zero-Computation Experts for efficient routing and supports 128K context. Optimized for conversational and agentic tasks, it shows competitive performance across reasoning, coding, instruction following, and domain benchmarks with particular strengths in tool use and complex multi-step interactions. Achieves over 100 tokens per second on H800 GPUs.

リリース日

2025-08-29

パラメータ

560.0B

コンテキスト長

—

モダリティ

text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Agents

Terminal-Bench

39.5%自己申告

Biology

GPQA

73.2%自己申告

Code

HumanEval

88.4%自己申告

SWE-Bench Verified

60.4%自己申告

LiveCodeBench

48.0%自己申告

Communication

Tau2 Telecom

73.7%自己申告

Tau2 Retail

71.3%自己申告

Tau2 Airline

58.0%自己申告

Finance

MMLU

89.7%自己申告

MMLU-Pro

82.7%自己申告

General

IFEval

89.6%自己申告

CMMLU

84.3%自己申告

Math

MATH-500

96.4%自己申告

DROP

79.1%自己申告

AIME 2025

61.3%自己申告

Reasoning

ZebraLogic

89.3%自己申告

AA評価指数

AA評価データがありません

LLM Statsカテゴリスコア

Instruction Following

Language

Legal

Structured Output

Finance

Healthcare

Math

General

Physics

Reasoning

Biology

Chemistry

Communication

Tool Calling

Frontend Development

Code

Agents

価格設定

価格データがありません

速度

速度データがありません

プロバイダー価格ランキング

プロバイダーデータがありません

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
エージェント能力	104	40.0	LS
推論	11	89.0	LS