Llama 3.2 90B Vision Instruct

Llama 3.2 90B Vision Instruct is Meta's highest-capability vision-language model at the Llama 3.2 launch. It pairs large-scale language generation with image reasoning, a context window of 128K tokens, and support for complex multi-element visual analysis.

Tool UseVision (Image)

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'meta/llama-3.2-90b',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

Latency24 hours

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Llama 3.2 90B Vision Instruct