MiMo V2 Flash

xiaomi/mimo-v2-flash

Xiaomi MiMo-V2-Flash is a proprietary MoE model developed by Xiaomi, designed for extreme inference efficiency with 309B total parameters (15B active). By incorporating an innovative Hybrid attention architecture and multi-layer MTP inference acceleration, it ranks among the top 2 global open-source models across multiple Agent benchmarks.

ReasoningTool Use

import { streamText } from 'ai'

const result = streamText({
  model: 'xiaomi/mimo-v2-flash',
  prompt: 'Why is the sky blue?'
})

Playground

Try out MiMo V2 Flash by Xiaomi. Usage is billed to your team at API rates. Free users get $5 of credits every 30 days, and you are considered a free user if you haven't made a payment.

Chat with

Providers

The AI Gateway supports routing requests across multiple AI providers. You can control provider preferences using the provider slugs available for copying with the buttons below. For more see the AI Gateway provider options documentation. By using the AI provider you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.

Provider

Context	Max Output	Latency	Throughput	Input	Output	Cache	Image Gen	Video Gen	Web Search	Capabilities	Release Date

Legal:Terms

•

Privacy

262K

32K

1.9s

139tps

$0.10/M

$0.30/M

Read:$0.02/M

Write:—

—

Dec 17, 2025

Legal:Terms

•

Privacy

262K

32K

1.7s

17tps

$0.09/M

$0.29/M

—

Dec 17, 2025

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

MiMo V2 Flash

Playground

Providers

Playground

Providers