MiniMax M2.5

MiniMax M2.5 is a third-generation agentic model from MiniMax that handles full-stack development across Web, Android, iOS, Windows, and Mac platforms. It supports a context window of 1M tokens, a max output of 196K tokens, and completes tasks about 37% faster than M2.1. Your use subject to MiniMax's Terms & Privacy Policies.

ReasoningTool UseImplicit Caching

Use with AI Gateway View docs

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'minimax/minimax-m2.5',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Max Output	Latency	Throughput	Input	Output	Cache	Web Search	Capabilities	ZDR	No Training	Release Date

MiniMax

Legal:Terms•Privacy

205K

131K

0.9s

70tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:$0.38/M

—

02/12/2026

Nebius

Legal:Terms•Privacy

196K

0.9s

81tps

$0.30/M

$1.20/M

—

02/12/2026

Parasail

Legal:Terms•Privacy

197K

131K

0.6s

125tps

$0.30/M

$1.20/M

—

02/12/2026

DeepInfra

Legal:Terms•Privacy

197K

131K

20.7s

10tps

$0.27/M

$0.95/M

Read:$0.03/M

Write:—

—

02/12/2026

Bedrock

Legal:Terms•Privacy

0.7s

73tps

$0.30/M

$1.20/M

—

02/12/2026

Blackbox AI

Legal:Terms•Privacy

128K

$0.07/M

$0.57/M

—

02/12/2026

Agent Stack

Core Platform

Tools

Learn

Build

Explore

MiniMax M2.5

Providers