GPT-4 Turbo was announced at OpenAI's first DevDay conference on November 6, 2023. It introduced a context window of 128K tokens, enough to hold more than 300 pages of text in a single prompt. OpenAI also pushed the knowledge cutoff to April 2023, a meaningful update for applications dealing with events from the first half of that year.
Two features made GPT-4 Turbo immediately practical for production integrations. JSON mode let developers reliably request structured JSON output, simplifying downstream parsing without brittle prompt engineering. Vision input enabled image analysis directly within the Chat Completions API. Passing a URL or base64-encoded image let the model generate captions, extract data from photographs, and interpret diagrams or documents with figures.
Pricing was also reduced compared to the original GPT-4, with lower rates on both input and output tokens. This made GPT-4-class reasoning accessible to a much wider range of applications and unlocked use cases where the economics of the original GPT-4 had been prohibitive.