Skip to content

Step 3.7 Flash

StepFun’s flagship multimodal reasoning model. Powered by a 198B-parameter / 11B-activation sparse MoE architecture, with native support for image and video understanding.

ReasoningVision (Image)Tool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'stepfun/step-3.7-flash',
prompt: 'Why is the sky blue?'
})

More models by StepFun

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
262K
0.2s
49tps
$0.09/M$0.30/M
Read:
Write:$0.02/M
deepinfra logo
01/29/2026