Skip to content

Pixtral 12B 2409

Pixtral 12B 2409 is a natively multimodal model with a 400M vision encoder and context window of 128K tokens, processing images at native resolution with support for multiple images per request.

Tool UseVision (Image)
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'mistral/pixtral-12b',
prompt: 'Why is the sky blue?'
})