Skip to content

MiniMax M2.1

MiniMax M2.1 is MiniMax's second-generation model, focused on coding accuracy, tool use, instruction following, and long-horizon planning. It supports a context window of 204.8K tokens and a max output of 131.1K tokens per request.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'minimax/minimax-m2.1',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
MiniMax
205K
4.9s
485tps
$0.30/M$1.20/M
Read:$0.03/M
Write:$0.38/M
+1
10/27/2025
Novita AI
205K
1.2s
130tps
$0.30/M$1.20/M
Read:$0.03/M
Write:
+1
10/27/2025
Amazon Bedrock
205K
0.5s
132tps
$0.30/M$1.20/M
Read:$0.15/M
Write:
+1
10/27/2025