A new programming model for durable execution

Vercel Workflows is the easiest way to write long-running, durable, reliable, and observable agents and backends.

The gap between prototypes and production-ready systems is huge. Code that's trivial to run locally falls apart the moment it needs to handle failures, restarts, and real traffic.

Framework defined infrastructure solved this for web applications. When you deploy, Vercel infers the right configuration from the app itself. Workflows extends that model to long-running systems. Instead of managing a separate codebase for orchestration, durable workflows are an extension of your application code.

Since launching in beta in October 2025, Workflows has processed over 100 million runs and over 500 million steps across more than 1,500 customers, with more than 200K npm downloads every week.

Today, Vercel Workflows is generally available.

The built in observability dashboard for Workflow SDK on Vercel.

Copy link to headingBuilt for agents, backends, and long-running workloads

Workflows is built for any workload that doesn’t fit in a single request.

Agents: Deep integration with the AI SDK enables infinitely long running durable agents that can maintain state, tools, and handle external events or interruptions. AI SDK v7 is taking this further with WorkflowAgent.

Backends: Workflow SDK proved this programming model in TypeScript codebases, and we're bringing it to a new language. The Workflow Python SDK is now in beta.

Long-running workloads: Workflows can be used for any function that needs to execute reliably, including multi-step onboarding flows, payment processing, ETL pipelines, or any backend work that would otherwise require you to wire up your own queues and retry logic.

Copy link to headingHow it works

Shipping a reliable long-running process to production typically means splitting your code across queues, workers, status tables, retry logic, and monitoring. Dedicated orchestration services add yet another layer with long-lived background processes you run in Kubernetes, scale horizontally, and dedicate engineering time to keep healthy. They are distributed systems you pay for on top of your core application compute.

Workflows eliminates the orchestrator entirely. All coordination runs in your application code, not a separate service. The infrastructure is built on three components:

Event log: records every step input, output, stream chunk, sleep, hook, and error in a run. It is the single source of truth for execution state and history.
Your functions on Fluid compute: each step runs as its own function invocation on Fluid compute. The workflow library inside each function handles dequeueing, state loading, encryption, execution, and handoff to the next step.
Vercel Queues: each function enqueues the next step automatically. Queues run automatically on Vercel, in your own Postgres, or in-memory locally.

Because there is no separate orchestration service, you only pay for the compute your steps actually use when functions are running.

Copy link to headingThe programming model: your code is the orchestrator

Workflows lets you write long-running functions in TypeScript or Python using normal control flow and a small API surface.

In TypeScript you create a workflow with "use workflow", and isolate units of work with "use step". Workflows handles everything underneath: queues, retries, step isolation, observability, durable state, and streaming.

export async function createSite(input: { userId: string }) {
  "use workflow"
  const profile = await fetchUserProfile(input.userId)
  const plan = await generateSitePlan(profile)
  const site = await buildSite(plan)
  return site
}
async function fetchUserProfile(userId: string) {
  "use step"
  return db.user.findUnique({ where: { id: userId } })
}
async function generateSitePlan(profile: unknown) {
  "use step"
  return callModel({ prompt: `Generate a site plan for ${JSON.stringify(profile)}` })
}
async function buildSite(plan: unknown) {
  "use step"
  return provisionSite(plan)
}

A workflow that creates a site in three durable steps. "use workflow" marks the function as a workflow, and "use step" isolates each unit of work with automatic retries, persistence, and observability.

At first glance, this looks like one function calling another, and that is exactly the point. Each step gets isolation, retries, persistence, observability, and durable continuation automatically. The orchestration lives in the application code, not in a separate system.

A Next.js app with the Workflow SDK installed runs the same way locally as it does in production at scale, with the same code, real guarantees, and no separate orchestration tooling to configure.

Copy link to headingVercel Workflows in action: Guillermo's infinite chess game

One of the best examples from beta testing is Guillermo's infinite chess game. It continuously pits models against each other, feeding them the current board state, validating moves, rendering the game, and running matches indefinitely, turn after turn.

Each chess match is a workflow run, and when a game ends, the last step starts a new run. Infinity is modeled as recursion across runs.

Because every workflow run is pegged to a specific deployment, one game can safely finish on the version it started with while the next game begins on the latest deployment. That creates a clean upgrade boundary where each game remains stable within its version, and every new game picks up the latest improvements.

If the backend code powering the chess match crashes or encounters transient errors, the workflow run automatically retries it without causing the application to fail.

Copy link to headingWhy Workflows matters for agents

Workflows is purpose-built for the agentic era. It is the only security-first, durable SDK made for building agents and made for agents to build with.

Copy link to headingSecure by default

By default, Vercel Workflows encrypts all data, including step inputs, outputs, and stream chunks, before they leave your deployment. Nothing is readable in transit or at rest outside your environment, and decryption only happens inside the deployment running the workflow.

Encryption is built in and free, not a security add-on you configure after the fact. This is possible because Workflows owns both orchestration and execution in the same environment, so encryption can happen automatically without a separate service or any extra infrastructure on your end.

When you need to inspect encrypted data through the dashboard or workflow CLI, explicit decryption is supported with a full audit trail.

Copy link to headingFor building agents

Agents need more than longer timeouts, they need durable execution, reliable orchestration, resumable streams, and enough headroom to move large payloads through long-running systems.

Durable agents

Workflow SDK and AI SDK share a deep integration that gives agents durable execution, tool calling, state management, and the ability to handle external events or interruptions gracefully. Tools can be implemented as workflow steps for automatic retries or as regular workflow-level logic that uses primitives like sleep and hooks to suspend and resume cleanly. Agents process tool calls iteratively until completion, surviving restarts and failures along the way.

AI SDK v7 takes this further with WorkflowAgent, a fully native implementation.

Durable streams

Durable streams persist agent output. getWritable() gives you a persistent stream that multiple clients can connect to, disconnect from, reconnect to later, and resume from any point. The workflow keeps running even if the user closes the browser. When they come back, the client reconnects and continues exactly where the stream left off, no Redis or custom pub/sub required.

In this example, a flight booking agent streams itinerary updates as it plans a trip and searches for flights:

import { DurableAgent } from "@workflow/ai/agent";
import { getWritable } from "workflow";
import type { UIMessage, UIMessageChunk } from "ai";
export async function bookingAgent(messages: UIMessage[]) {
  "use workflow";
  const writable = getWritable<UIMessageChunk>();
  const agent = new DurableAgent({
    model: "anthropic/claude-haiku-4.5",
    system: "You are a flight booking assistant.",
    tools: { searchFlights },
  });
  await agent.stream({ messages, writable });
}
async function searchFlights(args: { from: string; to: string }) {
  "use step";
  const result = { flights: [{ from: args.from, to: args.to, price: 199 }] };
  return result;
}

A booking agent that streams durable output to the client. Tools marked with "use step" get automatic retries and persistence.

The API route starts the workflow and returns the durable stream to the client. The run ID in the response header is what enables reconnection:

// app/api/chat/route.ts
import { createUIMessageStreamResponse, type UIMessage } from "ai";
import { start } from "workflow/api";
import { bookingAgent } from "@/workflows/booking-agent";
export async function POST(req: Request) {
  const { messages }: { messages: UIMessage[] } = await req.json();
  const run = await start(bookingAgent, [messages]);
  return createUIMessageStreamResponse({
    stream: run.readable,
    headers: { "x-workflow-run-id": run.runId },
  });
}

The API route starts the workflow and returns the durable stream. The run ID enables reconnection.

If the user closes the browser mid-search, the workflow keeps running. When they re-connect, or share the link with a different user who opens the session, WorkflowChatTransport resumes the stream from the last event the client received.

"use client";
import { useChat } from "@ai-sdk/react";
import { WorkflowChatTransport } from "@workflow/ai";
export function BookingChat() {
  const { messages, sendMessage } = useChat({
    transport: new WorkflowChatTransport(),
  });
  // ...
}

The client reads the stream and resumes from where it left off if the connection is interrupted.

Hooks and sleep

Workflows can suspend without incurring any compute.

Hooks let a workflow wait for an external trigger to resume. Hooks are useful for building human-in-the-loop approval flows and integrating with third-party services.

Sleep lets a workflow suspend for any specified amount of time, from minutes to days or months. Sleep is useful for email drip campaigns and date-sensitive use cases.

Limits built for multimodal agents

Workflows supports 50 MB per step payload and up to 2 GB across an entire run, with generous event limits. That's plenty of headroom for agents passing images, video, and large context across long execution chains.

Copy link to headingFor agents to build with

Workflows is not just great for building agents. It is also designed for coding agents to use directly.

The programming model is agent-friendly

Workflows code is ordinary TypeScript. A workflow is a function, a step is a function, and because orchestration lives in the application code itself, a coding agent only has to reason about one system. There is no separate orchestration layer to configure and no worker fleet to manage.

Full observability from the CLI

Workflow SDK ships with a CLI that any agent can use for inspecting and debugging your workflow runs. If a human can inspect runs in the dashboard, an agent can inspect the same workflow state from the terminal.

npx workflow inspect runs <wrun_id>

This works locally with no config. For production deployments on Vercel, the CLI reuses your vercel CLI authentication with --backend vercel to make authenticated requests against the Vercel API. Agents can investigate state, inspect runs, and debug behavior without leaving the terminal.

Workflows ships with a skill

Agents can install the Workflows skill directly and use it to scaffold, debug, and manage workflows without hand-written product knowledge.

npx skills add vercel/workflow

Copy link to headingRun Workflows anywhere

Workflow SDK is open source and part of the same family as AI SDK and Chat SDK. The workflow npm package goes stable at GA with 200K+ weekly downloads and 75+ releases shipped during beta.

Worlds are the adapter system that makes Workflows portable. Each World provides the three components a workflow needs (an event log, compute, and a queue), backed by different infrastructure.

Managed: Vercel handles everything automatically. Deploy your app and Vercel Workflows runs on Fluid compute with Vercel Queues, zero-config E2E encryption, and built-in observability.

Self-hosted: Run Workflows on your own infrastructure. We maintain a Postgres reference implementation that real customers run in production, and the Local World ships built in for development.

Embedded: The SDK is not locked to one runtime, and the community is already building more Worlds, including adapters for MongoDB, Redis, Turso, Jazz Cloud, and Cloudflare. We'll continue to support anyone who wants to build Worlds.

Copy link to headingHow Vercel customers use Workflows

Copy link to headingMux powers durable video and AI pipelines

Mux built their media intelligence layer on Workflows, handling execution, retries, and orchestration for AI inference across complex video pipelines. They also shipped "use workflow" and "use step" directives inside their own @mux/ai SDK, so any developer can npm install @mux/ai and get a durable, multi-step pipeline as a normal imported function.

The real complexity isn't the models, it's everything around them. Workflow SDK took a big chunk of that off our plate so we could focus on building the product.
Dylan Jhaveri Director of Self Service @ Mux

Copy link to headingDurable runs hundreds of AI agents for 3 million small businesses

Durable's most critical path is website creation: dozens of parallel AI steps orchestrated by Vercel Workflows to deliver a complete site in under 30 seconds. Their small dev team ripped out their self-hosted infrastructure entirely and rewrote on Workflows with 160+ directives across 75 files.

Workflow gave our six engineers the ability to ship production AI infrastructure without ever hiring for infra.
Osama Kahn CPTO @ Durable

Copy link to headingFlora orchestrates creative AI agents across 50+ image models

Flora runs their entire media generation pipeline on Vercel Workflows, orchestrating 50+ image models with no queues, no state machines, and no separate service. They use rollbacks for credit refunds, recursive workflows for user-defined multi-step pipelines, and getWritable for progress streaming. Their customers kick off jobs, close their laptop, and come back to completed results.

We stopped having infrastructure debates and started having product debates.
Alec Jo Head of Applied AI @ FLORA

Copy link to headingWhat's next: Workflows 5 and Python support

Copy link to headingWorkflows 5

Workflows 4 focused on getting the developer experience and SDK model right. Workflows 5 keeps the same programming model while pushing harder on performance and runtime efficiency. Here’s what we’re cooking:

Native concurrency controls, including a lock primitive for coordinating work across multiple runs
Globally deployed Workflows infrastructure
A snapshot-based runtime to reduce replay overhead as event histories grow
Better bundling and stronger Next.js integration

Our goal is to make the overhead of opting into Workflows smaller and smaller until it is the obvious default for any project. Install workflow@beta to try Workflows 5 early (and share feedback on GitHub).

Copy link to headingPython support

The Python SDK is now in beta, making Workflows available across the broader AI and backend ecosystem. Here's a quick taste of how the opening example in this post looks in Python:

from vercel.workflow import Workflows


wf = Workflows()
@wf.workflow
async def create_site(*, user_id):
    profile = await fetch_user_profile(user_id)
    plan = await generate_site_plan(profile)
    return await build_site(plan)
@wf.step
async def fetch_user_profile(*, user_id):
    return await db.user.find_unique(id=user_id)
@wf.step
async def generate_site_plan(*, profile):
    return await call_model(
        prompt=f'Generate a site plan for {json.dumps(profile)}'
    )
@wf.step
async def build_site(*, plan):
    return await provision_site(plan)

Copy link to headingGet started

Durability, reliability, observability, security, and streaming should be part of your product from the beginning, not chores you take on months later.

When you use Workflows, we take on the complexity so you can focus on what makes your app unique. Learn more about Vercel Workflows or visit the Workflow SDK docs to get started.

Vercel Workflows is generally available on April 16.

Agent Stack

Core Platform

Tools

Learn

Build

Explore