跳轉到主要內容

LongCat-Flash-Chat

MeituanOpen WeightMIT · Commercial OK

描述

LongCat-Flash-Chat is Meituan's first open-source foundation model, a 560B parameter Mixture-of-Experts (MoE) model that dynamically activates 18.6B-31.3B parameters (~27B average) based on contextual demands. It features Zero-Computation Experts for efficient routing and supports 128K context. Optimized for conversational and agentic tasks, it shows competitive performance across reasoning, coding, instruction following, and domain benchmarks with particular strengths in tool use and complex multi-step interactions. Achieves over 100 tokens per second on H800 GPUs.

發布日期
2025-08-29
參數規模
560.0B
上下文長度
支援模態
text

能力雷達圖

80
general
60
coding
80
reasoning
60
science估算
70
agents
0
multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域#排名分數來源
智能体与工具86
40.0
LS
推理能力11
89.0
LS

基準測試分數 (LLM Stats)

Agents

Terminal-Bench39.5%自報

Biology

GPQA73.2%自報

Code

HumanEval88.4%自報
SWE-Bench Verified60.4%自報
LiveCodeBench48.0%自報

Communication

Tau2 Telecom73.7%自報
Tau2 Retail71.3%自報
Tau2 Airline58.0%自報

Finance

MMLU89.7%自報
MMLU-Pro82.7%自報

General

IFEval89.6%自報
CMMLU84.3%自報

Math

MATH-50096.4%自報
DROP79.1%自報
AIME 202561.3%自報

Reasoning

ZebraLogic89.3%自報

AA 評測指數

暫無 AA 評測資料

LLM Stats 分類評分

Structured Output
90
Finance
90
Healthcare
90
Instruction Following
90
Language
90
Legal
90
General
80
Math
80
Tool Calling
70
Biology
70
Chemistry
70
Communication
70
Physics
70
Reasoning
70
Code
60
Frontend Development
60
Agents
40

定價

輸入價格$0.3 / 1M tokens
輸出價格$1.2 / 1M tokens
混合價格(3:1)$0.525 / 1M tokens

速度

暫無速度資料

可用提供商

(LS 內部計價單位)
提供商輸入價格輸出價格
Meituan300K1.2M

外部連結