Skip to content

GPT OSS 20B

GPT OSS 20B is OpenAI's smaller open-weight model with roughly 21 billion total parameters and 3.6 billion active per token, designed for low-latency, agentic, and on-device workloads.

ReasoningTool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'openai/gpt-oss-20b',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Amazon Bedrock
Legal:Terms
Privacy
128K
0.4s
219tps
$0.07/M$0.30/M
08/05/2025
Fireworks
Legal:Terms
Privacy
128K
0.9s
65tps
$0.07/M$0.30/M
Read:$0.04/M
Write:
08/05/2025
Groq
Legal:Terms
Privacy
131K
0.3s
$0.07/M$0.30/M
Read:$0.04/M
Write:
08/05/2025
DeepInfra
Legal:Terms
Privacy
131K
0.4s
39tps
$0.03/M$0.14/M
08/05/2025
Together AI
Legal:Terms
Privacy
131K
0.4s
115tps
$0.05/M$0.20/M
08/05/2025
Novita AI
Legal:Terms
Privacy
131K
0.6s
90tps
$0.04/M$0.15/M
08/05/2025
Parasail
Legal:Terms
Privacy
131K
0.5s
118tps
$0.04/M$0.20/M
08/05/2025