Step 3.7 Flash

Step 3.7 Flash is StepFun's flagship multimodal reasoning model, a sparse MoE with 11B active parameters and native image and video understanding. It supports a context window of 256K tokens and a max output of 256K tokens per request.

Implicit CachingReasoningTool UseVision (Image)

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'stepfun/step-3.7-flash',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

More models by StepFun

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

stepfun/step-3.5-flash

262K

1.3s

150tps

$0.09/M

$0.30/M

Read:$0.02/M

Write:—

—

01/29/2026

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Step 3.7 Flash

More models by StepFun