MiniMax M2.1

MiniMax M2.1 is MiniMax's second-generation model, focused on coding accuracy, tool use, instruction following, and long-horizon planning. It supports a context window of 204.8K tokens and a max output of 131.1K tokens per request.

ReasoningTool UseImplicit Caching

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'minimax/minimax-m2.1',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

MiniMax

205K

0.9s

52tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:$0.38/M

—

12/23/2025

Novita AI

205K

1.6s

75tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:—

—

12/23/2025

Amazon Bedrock

205K

0.8s

94tps

$0.30/M

$1.20/M

Read:$0.15/M

Write:—

—

12/23/2025

Agent Stack

Core Platform

Tools

Learn

Build

Explore

MiniMax M2.1

Providers