MiniMax M2.1 Lightning
MiniMax M2.1 Lightning is the throughput-optimized variant of MiniMax-M2.1. It supports a context window of 204.8K tokens and a max output of 131.1K tokens per request.
import { streamText } from 'ai'
const result = streamText({ model: 'minimax/minimax-m2.1-lightning', prompt: 'Why is the sky blue?'})