LongCat-Flash-Chat

MeituanOpen WeightMIT · Commercial OK

Description

LongCat-Flash-Chat is Meituan's first open-source foundation model, a 560B parameter Mixture-of-Experts (MoE) model that dynamically activates 18.6B-31.3B parameters (~27B average) based on contextual demands. It features Zero-Computation Experts for efficient routing and supports 128K context. Optimized for conversational and agentic tasks, it shows competitive performance across reasoning, coding, instruction following, and domain benchmarks with particular strengths in tool use and complex multi-step interactions. Achieves over 100 tokens per second on H800 GPUs.

Release Date

2025-08-29

Parameters

560.0B

Context Length

—

Modalities

text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Agentic Capability	104	40.0	LS
Reasoning	11	89.0	LS

Benchmark Scores (LLM Stats)

Agents

Terminal-Bench

39.5%SR

Biology

GPQA

73.2%SR

Code

HumanEval

88.4%SR

SWE-Bench Verified

60.4%SR

LiveCodeBench

48.0%SR

Communication

Tau2 Telecom

73.7%SR

Tau2 Retail

71.3%SR

Tau2 Airline

58.0%SR

Finance

MMLU

89.7%SR

MMLU-Pro

82.7%SR

General

IFEval

89.6%SR

CMMLU

84.3%SR

Math

MATH-500

96.4%SR

DROP

79.1%SR

AIME 2025

61.3%SR

Reasoning

ZebraLogic

89.3%SR

AA Evaluation Indices

No AA evaluation data available

LLM Stats Category Scores

Instruction Following

Language

Legal

Structured Output

Finance

Healthcare

Math

General

Physics

Reasoning

Biology

Chemistry

Communication

Tool Calling

Frontend Development

Code

Agents

Pricing

No pricing data available

Speed

No speed data available

Provider Price Ranking

No provider data available

External Sources

LLM Stats Artificial Analysis