跳转到主要内容

LongCat-Flash-Chat

Meituan开源权重MIT · 商用许可

描述

LongCat-Flash-Chat is Meituan's first open-source foundation model, a 560B parameter Mixture-of-Experts (MoE) model that dynamically activates 18.6B-31.3B parameters (~27B average) based on contextual demands. It features Zero-Computation Experts for efficient routing and supports 128K context. Optimized for conversational and agentic tasks, it shows competitive performance across reasoning, coding, instruction following, and domain benchmarks with particular strengths in tool use and complex multi-step interactions. Achieves over 100 tokens per second on H800 GPUs.

发布日期
2025-08-29
参数规模
560.0B
上下文长度
支持模态
text

能力雷达图

80
general
60
coding
80
reasoning
60
science估算
70
agents
0
multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域#排名分数来源
智能体能力模型榜104
40.0
LS
推理能力11
89.0
LS

基准测试分数 (LLM Stats)

Agents

Terminal-Bench39.5%自报

Biology

GPQA73.2%自报

Code

HumanEval88.4%自报
SWE-Bench Verified60.4%自报
LiveCodeBench48.0%自报

Communication

Tau2 Telecom73.7%自报
Tau2 Retail71.3%自报
Tau2 Airline58.0%自报

Finance

MMLU89.7%自报
MMLU-Pro82.7%自报

General

IFEval89.6%自报
CMMLU84.3%自报

Math

MATH-50096.4%自报
DROP79.1%自报
AIME 202561.3%自报

Reasoning

ZebraLogic89.3%自报

AA 评测指数

暂无 AA 评测数据

LLM Stats 分类评分

Instruction Following
90
Language
90
Legal
90
Structured Output
90
Finance
90
Healthcare
90
Math
80
General
80
Physics
70
Reasoning
70
Biology
70
Chemistry
70
Communication
70
Tool Calling
70
Frontend Development
60
Code
60
Agents
40

定价

暂无定价数据

速度

暂无速度数据

供应商价格排行

暂无提供商数据

外部链接