Skip to main content

LongCat-Flash-Chat

MeituanOpen WeightMIT · Commercial OK

Description

LongCat-Flash-Chat is Meituan's first open-source foundation model, a 560B parameter Mixture-of-Experts (MoE) model that dynamically activates 18.6B-31.3B parameters (~27B average) based on contextual demands. It features Zero-Computation Experts for efficient routing and supports 128K context. Optimized for conversational and agentic tasks, it shows competitive performance across reasoning, coding, instruction following, and domain benchmarks with particular strengths in tool use and complex multi-step interactions. Achieves over 100 tokens per second on H800 GPUs.

Release Date
2025-08-29
Parameters
560.0B
Context Length
Modalities
text

Capability Radar

80
general
60
coding
80
reasoning
60
scienceest.
70
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Agents & Tools86
40.0
LS
Reasoning11
89.0
LS

Benchmark Scores (LLM Stats)

Agents

Terminal-Bench39.5%SR

Biology

GPQA73.2%SR

Code

HumanEval88.4%SR
SWE-Bench Verified60.4%SR
LiveCodeBench48.0%SR

Communication

Tau2 Telecom73.7%SR
Tau2 Retail71.3%SR
Tau2 Airline58.0%SR

Finance

MMLU89.7%SR
MMLU-Pro82.7%SR

General

IFEval89.6%SR
CMMLU84.3%SR

Math

MATH-50096.4%SR
DROP79.1%SR
AIME 202561.3%SR

Reasoning

ZebraLogic89.3%SR

AA Evaluation Indices

No AA evaluation data available

LLM Stats Category Scores

Structured Output
90
Finance
90
Healthcare
90
Instruction Following
90
Language
90
Legal
90
General
80
Math
80
Tool Calling
70
Biology
70
Chemistry
70
Communication
70
Physics
70
Reasoning
70
Code
60
Frontend Development
60
Agents
40

Pricing

Input Price$0.3 / 1M tokens
Output Price$1.2 / 1M tokens
Blended Price (3:1)$0.525 / 1M tokens

Speed

No speed data available

Available Providers

(LS internal units)
ProviderInput PriceOutput Price
Meituan300K1.2M

External Sources