MiniMax M3
MiniMax M3 is MiniMax's first model with a 1M tokens context window and native multimodal input. It targets software engineering, terminal-based tool use, and agentic web browsing, with a max output of 1M tokens per request.
import { streamText } from 'ai'
const result = streamText({ model: 'minimax/minimax-m3', prompt: 'Why is the sky blue?'})Frequently Asked Questions
What is MiniMax Sparse Attention?
MSA is the attention variant behind MiniMax M3. It splits the key-value cache into blocks and pre-filters which blocks contribute to each query, which keeps compute manageable at the 1M tokens context length.
What input types does MiniMax M3 accept?
Text, image, and video input. Output is text. Multimodality is native to MiniMax M3 rather than added through a separate vision adapter.
What is the context window for MiniMax M3?
MiniMax M3 supports a context window of 1M tokens and a max output of 1M tokens per request.
How does MiniMax M3 compare to M2.7?
M2.7 focuses on multi-agent orchestration, dynamic tool search, and text-only enterprise workflows. MiniMax M3 extends the series with native multimodal input, the 1M tokens context window, and the MSA architecture for long-context efficiency.
Is there a faster variant of MiniMax M3?
Yes. Select
minimax/minimax-m3-highspeedwhere your provider exposes it. The highspeed variant targets higher throughput with the same output behavior.Does MiniMax M3 support automatic prompt caching?
Yes. Automatic prompt caching is enabled by default, which reduces effective cost on repeated context patterns. $0.06 per million cached input tokens applies where the provider exposes a cached rate.
How do I access MiniMax M3 through the AI SDK?
Set the model identifier to
minimax/minimax-m3in your AI SDK configuration. AI Gateway routes the request across the providers serving MiniMax M3 with configurable failover.Is Zero Data Retention available for MiniMax M3?
Zero Data Retention is not currently available for this model. Zero Data Retention is offered on a per-provider basis. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.