MiMo V2 Pro
MiMo V2 Pro is the Pro variant in Xiaomi's MiMo v2 family with over 1T total parameters and 42B active, built for math, code, and multi-step reasoning within a context window of 1M tokens. It uses a hybrid attention architecture for long-context processing.
import { streamText } from 'ai'
const result = streamText({ model: 'xiaomi/mimo-v2-pro', prompt: 'Why is the sky blue?'})Frequently Asked Questions
How does MiMo V2 Pro differ from MiMo v2 Flash?
It's the Pro tier. MiMo V2 Pro targets harder reasoning, math, and code than Flash, with higher per-token cost and somewhat lower throughput than Flash.
What architecture does MiMo V2 Pro use?
A Mixture-of-Experts (MoE) setup: each forward pass activates a subset of parameters, which keeps inference cost manageable while the full parameter count holds broader knowledge.
What's the context window for MiMo V2 Pro?
1M tokens. Hybrid sliding window attention reduces KV-cache use so long-context runs stay practical.
How do I authenticate requests to MiMo V2 Pro through AI Gateway?
Add your API key in AI Gateway project settings. Use
xiaomi/mimo-v2-proin API calls. AI Gateway routes, retries, and fails over across xiaomi.What does MiMo V2 Pro cost?
See the pricing section on this page for today's rates. AI Gateway exposes each provider's pricing for MiMo V2 Pro.
Can I route between MiMo V2 Pro and the Flash variant automatically?
Yes. AI Gateway supports fallback and routing. You can send hard requests to MiMo V2 Pro and fall back to Flash for simpler tasks to control cost.
What tasks is MiMo V2 Pro best suited for?
Multi-step reasoning, code generation, math, and long-context analysis. For short or simple jobs, Flash is usually cheaper.
Is MiMo V2 Pro available under an open-source license?
Yes. The MiMo v2 line is under the MIT license, which allows commercial use, modification, and redistribution.