Llama 3.2 11B Vision Instruct

Llama 3.2 11B Vision Instruct is Meta's entry point for vision-language capability in the Llama 3.2 family. This 11B parameter model adds image understanding through a cross-attention adapter, making it an accessible starting point for multimodal applications.

Tool UseVision (Image)

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'meta/llama-3.2-11b',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Uptime Status Similar FAQ

Playground

Try out Llama 3.2 11B Vision Instruct by Meta. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Qwen 3 32B

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Llama 3.2 11B Vision Instruct

Playground