Question 1

How does MiMo V2 Pro differ from MiMo v2 Flash?

Accepted Answer

It's the Pro tier. MiMo V2 Pro targets harder reasoning, math, and code than Flash, with higher per-token cost and somewhat lower throughput than Flash.

Question 2

What architecture does MiMo V2 Pro use?

Accepted Answer

A Mixture-of-Experts (MoE) setup: each forward pass activates a subset of parameters, which keeps inference cost manageable while the full parameter count holds broader knowledge.

Question 3

What's the context window for MiMo V2 Pro?

Accepted Answer

1M tokens. Hybrid sliding window attention reduces KV-cache use so long-context runs stay practical.

Question 4

How do I authenticate requests to MiMo V2 Pro through AI Gateway?

Accepted Answer

Add your API key in AI Gateway project settings. Use `xiaomi/mimo-v2-pro` in API calls. AI Gateway routes, retries, and fails over across `xiaomi`.

Question 5

What does MiMo V2 Pro cost?

Accepted Answer

See the pricing section on this page for today's rates. AI Gateway exposes each provider's pricing for MiMo V2 Pro.

Question 6

Can I route between MiMo V2 Pro and the Flash variant automatically?

Accepted Answer

Yes. AI Gateway supports fallback and routing. You can send hard requests to MiMo V2 Pro and fall back to Flash for simpler tasks to control cost.

Question 7

What tasks is MiMo V2 Pro best suited for?

Accepted Answer

Multi-step reasoning, code generation, math, and long-context analysis. For short or simple jobs, Flash is usually cheaper.

Question 8

Is MiMo V2 Pro available under an open-source license?

Accepted Answer

Yes. The MiMo v2 line is under the MIT license, which allows commercial use, modification, and redistribution.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

MiMo V2 Pro

Frequently Asked Questions