Question 1

What is MiniMax Sparse Attention?

Accepted Answer

MSA is the attention variant behind MiniMax M3. It splits the key-value cache into blocks and pre-filters which blocks contribute to each query, which keeps compute manageable at the 1M tokens context length.

Question 2

What input types does MiniMax M3 accept?

Accepted Answer

Text, image, and video input. Output is text. Multimodality is native to MiniMax M3 rather than added through a separate vision adapter.

Question 3

What is the context window for MiniMax M3?

Accepted Answer

MiniMax M3 supports a context window of 1M tokens and a max output of 1M tokens per request.

Question 4

How does MiniMax M3 compare to M2.7?

Accepted Answer

M2.7 focuses on multi-agent orchestration, dynamic tool search, and text-only enterprise workflows. MiniMax M3 extends the series with native multimodal input, the 1M tokens context window, and the MSA architecture for long-context efficiency.

Question 5

Is there a faster variant of MiniMax M3?

Accepted Answer

Yes. Select `minimax/minimax-m3-highspeed` where your provider exposes it. The highspeed variant targets higher throughput with the same output behavior.

Question 6

Does MiniMax M3 support automatic prompt caching?

Accepted Answer

Yes. Automatic prompt caching is enabled by default, which reduces effective cost on repeated context patterns. $0.06 per million cached input tokens applies where the provider exposes a cached rate.

Question 7

How do I access MiniMax M3 through the AI SDK?

Accepted Answer

Set the model identifier to `minimax/minimax-m3` in your AI SDK configuration. AI Gateway routes the request across the providers serving MiniMax M3 with configurable failover.

Question 8

Is Zero Data Retention available for MiniMax M3?

Accepted Answer

Yes, Zero Data Retention is available for this model. Zero Data Retention is offered on a per-provider basis. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

MiniMax M3

Frequently Asked Questions