Llama 3.2 90B Vision Instruct
Llama 3.2 90B Vision Instruct is Meta's highest-capability vision-language model at the Llama 3.2 launch. It pairs large-scale language generation with image reasoning, a context window of 128K tokens, and support for complex multi-element visual analysis.
import { streamText } from 'ai'
const result = streamText({ model: 'meta/llama-3.2-90b', prompt: 'Why is the sky blue?'})