Skip to content

Llama 3.3 70B Instruct

Llama 3.3 70B Instruct is Meta's refined text-only model. It targets 405B-class results at 70B serving cost, with improved instruction following and multilingual capability.

Tool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'meta/llama-3.3-70b',
prompt: 'Why is the sky blue?'
})
Latency

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.