Skip to content

Llama 4 Scout 17B 16E Instruct

Llama 4 Scout 17B 16E Instruct is a natively multimodal Mixture of Experts (MoE) model with a context window of 131.1K tokens, purpose-built for processing entire codebases, multi-document corpora, and extended user activity logs in a single inference call.

Tool UseVision (Image)
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'meta/llama-4-scout',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
DeepInfra
Legal:Terms
Privacy
131K
0.3s
38tps
$0.08/M$0.30/M
04/05/2025
Groq
Legal:Terms
Privacy
131K
0.4s
$0.11/M$0.34/M
04/05/2025
Amazon Bedrock
Legal:Terms
Privacy
128K
0.2s
176tps
$0.17/M$0.66/M
04/05/2025