MiMo V2 Flash
MiMo V2 Flash is Xiaomi's MiMo v2 Flash MoE reasoning model with 309B total parameters and 15B active per forward pass, using hybrid attention and multi-token prediction for inference efficiency. It supports a context window of 262.1K tokens at $0.1 per million input tokens and $0.3 per million output tokens.
import { streamText } from 'ai'
const result = streamText({ model: 'xiaomi/mimo-v2-flash', prompt: 'Why is the sky blue?'})