LongCat Flash Thinking

meituan/longcat-flash-thinking

LongCat-Flash-Thinking is a high-throughput MoE reasoning model (128k context) optimized for agentic tasks.

ReasoningTool Use

import { streamText } from 'ai'

const result = streamText({
  model: 'meituan/longcat-flash-thinking',
  prompt: 'Why is the sky blue?'
})

Playground

Try out LongCat Flash Thinking by Meituan. Usage is billed to your team at API rates. Free users get $5 of credits every 30 days, and you are considered a free user if you haven't made a payment.

Chat with

Providers

The AI Gateway supports routing requests across multiple AI providers. You can control provider preferences using the provider slugs available for copying with the buttons below. For more see the AI Gateway provider options documentation. By using the AI provider you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.

Provider

Context	Max Output	Latency	Throughput	Input	Output	Cache	Image Gen	Video Gen	Web Search	Capabilities	ZDR	Release Date

Legal:Terms

•

Privacy

128K

$0.15/M

$1.50/M

—

09/23/2025

More models by Meituan

Model

Context	Max Output	Latency	Throughput	Input	Output	Cache	Image Gen	Video Gen	Web Search	Capabilities	Providers	ZDR	Release Date

128K

100K

5.0s

110tps

—

08/30/2025

33K

13.3s

115tps

—

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

LongCat Flash Thinking

Playground

Providers

More models by Meituan

Playground

Providers

More models by Meituan