Skip to content
Find…
F
blog – AI Gateway – Vercel
blog
blog
AI Gateway
Model List
Model List
Give Feedback
Overview
Deployments
Logs
Analytics
Speed Insights
Observability
Firewall
CDN
Domains
Integrations
Storage
Flags
Agent
AI Gateway
Sandboxes
Workflows
Usage
Support
Settings
AI Gateway
Overview
Quick Start
Model List
API Keys
Bring Your Own Key
Templates
Leaderboards
Playground
Documentation
Account Settings
Feedback
Theme
Select a display theme:
system
light
dark
Home Page
Changelog
Help
Docs
Log Out
Platform Status
Loading status…
Browse Models
A catalog of models to help you build AI features for your Vercel project.
All
Text
Code
Image
Video
Embed
Rerank
All Use Cases
All Providers
Sort by
Release Date
Columns
(13)
Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Released
moo
nshotai/k
imi-k2.6
262K
3.6s
38tps
$0.95/M
$4.00/M
Read:
$0.16/M
Write:
—
—
—
04/20/2026
ali
baba/qwen-3.6-max
-preview
240K
2.4s
71tps
$1.30/M
$7.80/M
Read:
$0.26/M
Write:
$1.63/M
—
—
04/20/2026
ant
hropic/claude-
opus-4.7
1M
1.5s
99tps
$5.00/M
$25.00/M
Read:
$0.5/M
Write:
$6.25/M
$10/K
+ input costs
—
04/16/2026
byt
edance/seed
ance-2.0
—
—
04/14/2026
byt
edance/seedance-
2.0-fast
—
—
04/14/2026
zai/glm-5.1
203K
1.5s
34tps
$1.40/M
$4.40/M
Read:
$0.26/M
Write:
—
—
—
04/07/2026
ali
baba/qwen
3.6-plus
1M
0.6s
94tps
$0.50/M
$3.00/M
Read:
$0.1/M
Write:
$0.63/M
—
—
04/02/2026
goo
gle/gemma-
4-31b-it
262K
0.6s
13tps
$0.14/M
$0.40/M
—
—
04/02/2026
goo
gle/gemma-4-26
b-a4b-it
262K
0.6s
48tps
$0.13/M
$0.40/M
—
—
04/02/2026
arc
ee-ai/trinity-large-
thinking
262K
0.3s
238tps
$0.25/M
$0.90/M
—
—
04/01/2026
zai
/glm-
5v-turbo
200K
5.2s
112tps
$1.20/M
$4.00/M
Read:
$0.24/M
Write:
—
—
—
04/01/2026
kwa
ipilot/kat-code
r-pro-v2
256K
3.2s
102tps
$0.30/M
$1.20/M
Read:
$0.06/M
Write:
—
—
—
03/27/2026
min
imax/mini
max-m2.7
205K
0.4s
107tps
$0.30/M
$1.20/M
Read:
$0.06/M
Write:
$0.38/M
—
—
+1
03/18/2026
min
imax/minimax-m2.7-h
ighspeed
205K
1.5s
52tps
$0.60/M
$2.40/M
Read:
$0.06/M
Write:
$0.38/M
—
—
03/18/2026
xia
omi/mim
o-v2-pro
1M
2.7s
52tps
$1.00/M
$3.00/M
Read:
$0.2/M
Write:
—
—
—
03/18/2026
nvi
dia/nemotron-3-super-1
20b-a12b
256K
1.8s
135tps
$0.15/M
$0.65/M
—
—
03/18/2026
ope
nai/gpt-
5.4-mini
400K
1.3s
137tps
$0.75/M
$4.50/M
Read:
$0.07/M
Write:
—
$10.00/K
+ input costs
—
03/17/2026
ope
nai/gpt-
5.4-nano
400K
0.6s
40tps
$0.20/M
$1.25/M
Read:
$0.02/M
Write:
—
$10.00/K
+ input costs
—
03/17/2026
zai
/glm
-5-turbo
203K
4.3s
60tps
$1.20/M
$4.00/M
Read:
$0.24/M
Write:
—
—
—
03/15/2026
xai
/grok-4.20-reason
ing-beta
2M
0.6s
164tps
$2.00/M
$6.00/M
Read:
$0.2/M
Write:
—
—
—
03/11/2026
xai
/grok-4.20-non-reason
ing-beta
2M
0.4s
93tps
$2.00/M
$6.00/M
Read:
$0.2/M
Write:
—
—
—
03/11/2026
xai
/grok-4.20-multi-ag
ent-beta
2M
4.2s
1519tps
$2.00/M
$6.00/M
Read:
$0.2/M
Write:
—
—
—
03/11/2026
goo
gle/gemini-emb
edding-2
$0.20/M
+3 more
—
—
03/10/2026
xai
/grok-4.20-r
easoning
2M
1.0s
135tps
$2.00/M
$6.00/M
Read:
$0.2/M
Write:
—
—
—
03/09/2026
xai
/grok-4.20-non-r
easoning
2M
1.7s
95tps
$2.00/M
$6.00/M
Read:
$0.2/M
Write:
—
—
—
03/09/2026
xai
/grok-4.20-mul
ti-agent
2M
4.0s
1974tps
$2.00/M
$6.00/M
Read:
$0.2/M
Write:
—
—
—
03/09/2026
ope
nai
/gpt-5.4
1.1M
1.2s
52tps
$2.50/M
$15.00/M
Read:
$0.25/M
Write:
—
$10.00/K
+ input costs
—
03/05/2026
ope
nai/gpt
-5.4-pro
1.1M
$30.00/M
$180.00/M
$10/K
+ input costs
—
03/05/2026
goo
gle/gemini-3.1-flash-lite
-preview
1M
1.1s
293tps
$0.25/M
$1.50/M
Read:
$0.03/M
Write:
—
$14.00/K
+ input costs
—
03/03/2026
ope
nai/gpt-
5.3-chat
128K
0.7s
64tps
$1.75/M
$14.00/M
Read:
$0.17/M
Write:
—
$10.00/K
+ input costs
—
03/03/2026