Skip to content

Grok 2 Vision

Grok 2 vision model excels in vision-based tasks, delivering state-of-the-art performance in visual math reasoning (MathVista) and document-based question answering (DocVQA). It can process a wide variety of visual information including documents, diagrams, charts, screenshots, and photographs.

Tool UseVision (Image)
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'xai/grok-2-vision',
prompt: 'Why is the sky blue?'
})

Playground

Try out Grok 2 Vision by xAI. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.