Skip to content

MiniMax M3

MiniMax M3 is MiniMax's first model with a 1M tokens context window and native multimodal input. It targets software engineering, terminal-based tool use, and agentic web browsing, with a max output of 1M tokens per request.

ReasoningTool UseVision (Image)File InputImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'minimax/minimax-m3',
prompt: 'Why is the sky blue?'
})

Frequently Asked Questions

  • What is MiniMax Sparse Attention?

    MSA is the attention variant behind MiniMax M3. It splits the key-value cache into blocks and pre-filters which blocks contribute to each query, which keeps compute manageable at the 1M tokens context length.

  • What input types does MiniMax M3 accept?

    Text, image, and video input. Output is text. Multimodality is native to MiniMax M3 rather than added through a separate vision adapter.

  • What is the context window for MiniMax M3?

    MiniMax M3 supports a context window of 1M tokens and a max output of 1M tokens per request.

  • How does MiniMax M3 compare to M2.7?

    M2.7 focuses on multi-agent orchestration, dynamic tool search, and text-only enterprise workflows. MiniMax M3 extends the series with native multimodal input, the 1M tokens context window, and the MSA architecture for long-context efficiency.

  • Is there a faster variant of MiniMax M3?

    Yes. Select minimax/minimax-m3-highspeed where your provider exposes it. The highspeed variant targets higher throughput with the same output behavior.

  • Does MiniMax M3 support automatic prompt caching?

    Yes. Automatic prompt caching is enabled by default, which reduces effective cost on repeated context patterns. $0.06 per million cached input tokens applies where the provider exposes a cached rate.

  • How do I access MiniMax M3 through the AI SDK?

    Set the model identifier to minimax/minimax-m3 in your AI SDK configuration. AI Gateway routes the request across the providers serving MiniMax M3 with configurable failover.

  • Is Zero Data Retention available for MiniMax M3?

    Zero Data Retention is not currently available for this model. Zero Data Retention is offered on a per-provider basis. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.