Skip to content

SERHANT.'s playbook for rapid AI iteration

5 min read

Link to headingImpact at a glance

  • Started with Next.js on Vercel, which made it easier to expand to a React Native iOS app without rebuilding their backend

  • Engineers focus on AI design and iteration instead of platform plumbing

  • Orchestrates OpenAI, Claude, and Gemini by task to optimize cost vs output

  • Scaled from an internal pilot to 800–900+ real estate agents without replatforming

When Jeremy Bunting joined SERHANT. as VP of Engineering in February 2024, S.MPLE was already showing promise. 200 real estate agents were piloting the AI product, which was designed to save time by automating cumbersome and repetitive daily tasks, like market analysis and contact management.

S.MPLE was a Next.js progressive web app deployed on Vercel, and that foundation gave the team leverage. They could keep the API layer steady while expanding the client experience, including expansion to a React Native iOS app, all without a backend rebuild.

But Bunting had a problem that keeps many engineering leaders up at night: the AI landscape changes faster than most teams can implement infrastructure updates.

The team needed to move fast, scale confidently, and stay flexible enough to swap models, add new capabilities, and adapt to the rapidly changing AI landscape. Traditional approaches meant choosing between velocity and flexibility, but Bunting wanted both.

Link to headingAI SDK: Moving fast without vendor lock-in

As S.MPLE shifted from "one model” experiments to a production AI product, Bunting's team started evaluating Vercel's AI SDK, and he initially had concerns. "I asked, how much is this going to tie us in directly to Vercel?" he recalls.

Then one of his engineers pushed back. The AI SDK wasn't infrastructure lock-in, it was infrastructure independence. “It's just an SDK that abstracts away the complexity of working with different model providers”, the engineer pointed out.

Bunting also realized that if the team picked one frontier model and built tightly around it, every future change would come with a rewrite, with no clean path to fallback when reliability or cost shifted. With AI SDK, iteration meant simple configuration changes, not feature overhauls. "We are building agentic tools," said Bunting. "Having that consistent abstraction layer... really reduces the cognitive load."

We are building agentic tools. Having that consistent abstraction layer across the app for our developers really reduces the cognitive load of understanding how to work with each of the models and services.
Jeremy Bunting, VP of Engineering @ SERHANT.

AI Gateway added another layer of leverage: consolidated visibility into usage across apps and prototypes, even when teams bring their own keys. The result is faster debugging, faster optimization, and a clearer feedback loop on cost.

Link to headingUsing multiple models to balance cost, speed, and complexity

Because the SERHANT. S.MPLE team was not spending its time rebuilding infrastructure or maintaining one-off AI integrations, they could shift their attention to testing models against real product tasks and choosing the right tool for each job:

  • Claude Sonnet for complex, accuracy-critical analysis like comparative market analysis, where strong structured-data reasoning matters

  • Claude Haiku for lightweight intent and field-filling tasks where speed matters

  • OpenAI models for conversational voice and general chat behaviors

  • Gemini for image generation, browser automation, and computer-use workflows where reliability and speed are the priority

They are also experimenting with “models as guardrails” to validate or critique outputs, and with caching strategies to rein in token spend as usage grows.

Link to headingWorry-free scale: Adding users and assets

The value of their stack decisions became clear when S.MPLE launched publicly. "We moved from being an internal pilot program to more than 900 users without a lot of worry on infrastructure or scale," Bunting says. The API layer didn't require a single change, and Fluid compute handled the increase in workloads automatically.

That seamless scale matters because SERHANT. operates at a content generation pace that would break most systems. "SERHANT. generates about 35% more content than the top five brokerages combined," Bunting notes. Between property videos, listing descriptions, marketing materials, and now AI-generated assets, the volume is staggering.

Greg Parsons, Technical Director on the S.MPLE team, said that AI Gateway gives him visibility across their platform that wasn’t possible before. "We can gain insight into all of the disparate applications we are building across the business," Parsons explained.

We can gain insight into all of the disparate applications we are building across the business, from early prototypes to fully launched products, by bringing everything into a single view. This gives us unified visibility into token usage, API calls, and overall system output, all within one UI.
Greg Parsons, Technical Director @ SERHANT.

Link to headingWhat’s next: from linear workflows to conversational AI agents

S.MPLE began with linear workflows: real estate agents would trigger a single action, it would run end-to-end then return the result.

But real-world workflows are more complex and users want to execute multiple tasks in a single run, like producing all of the assets needed to market a property listing. Bunting’s team is now building toward conversational experiences where humans can run agents, steer and correct mid-flight, and combine multiple “recipes” into a single request.

It is a shift from one-off automations to a coordinated network of specialized agents, built to evolve as fast as the AI ecosystem itself.

Link to headingFuture-proofing an unpredictable landscape

Greg Chan, SERHANT.’s CTO, sees flexibility as the point. "In AI, things are evolving fast. What it looks like now is different than even three-to-six months ago. And it’ll be different months from now," Chan says.

For Chan, the win is that the team can keep building inside the ecosystem instead of rewriting the stack every time the world changes. "The last thing we want is to rebuild our stack every time a new model drops," he said.

The last thing we want is to rebuild our stack every time a new model drops. Flexibility with Vercel means we can adapt instantly and keep our engineers focused on building real competitive advantage for our agents.
Greg Chan, CTO @ SERHANT.

About SERHANT.

SERHANT. is an AI-native real estate and media company, and is the most-followed real estate brand in the world. 

Founded in 2020 by top real estate broker and entrepreneur Ryan Serhant, SERHANT. brings together brokerage, media, and education, with proprietary technology to revolutionize how properties are marketed, sold, and experienced. 

SERHANT. sells residential, commercial, luxury, and new development properties nationally through its specialized divisions, including SERHANT. Signature for high-net-worth clients and SERHANT. New Development, which delivers end-to-end branding, marketing, and sales for ground-up residential projects.

Powered by S.MPLE, SERHANT.’s proprietary AI platform, agents are empowered with real-time data and workflow automation across listings, transactions, and marketing to deliver faster, smarter, and more impactful results while saving time.  Award-winning SERHANT. Studios produces original content across social and streaming platforms, while SellIt.com is the company’s digital education hub, engaging members globally in more than 130 countries.

Learn more at serhant.com.