Question 1

What is the key difference between GPT-3.5 Turbo and GPT-3.5 Turbo Instruct?

Accepted Answer

GPT-3.5 Turbo uses the Chat Completions endpoint with a messages array. GPT-3.5 Turbo Instruct uses the legacy Completions endpoint with a single prompt string, a fundamental structural difference that affects how you construct requests.

Question 2

Does GPT-3.5 Turbo Instruct support function calling or JSON mode?

Accepted Answer

No. Function calling and JSON mode are features of the Chat Completions API. GPT-3.5 Turbo Instruct targets the Completions endpoint and doesn't support these capabilities.

Question 3

When would I choose the Completions format over Chat Completions?

Accepted Answer

Use it when your prompt structure works best as a single string with few-shot examples inline, when you're maintaining an existing integration, or when a direct prompt-response contract is semantically simpler than a roles-based message list.

Question 4

Is GPT-3.5 Turbo Instruct suitable for code completion features?

Accepted Answer

Yes. Single-turn code completion and fill-in-the-middle tasks map naturally to the Completions format, and the model's instruction tuning makes it responsive to explicit directives within the prompt.

Question 5

How do I access GPT-3.5 Turbo Instruct through AI Gateway?

Accepted Answer

Authenticate with an AI Gateway API key or OIDC token and route requests to the AI Gateway endpoint specifying this model's slug. No direct OpenAI credentials are required in your application.

Question 6

Can GPT-3.5 Turbo Instruct be used for multi-turn conversations?

Accepted Answer

Technically you can simulate turns by concatenating prior exchanges into a single prompt string, but the chat-format models handle multi-turn context more naturally and efficiently.

Question 7

What are typical latency characteristics?

Accepted Answer

This page shows live throughput and time-to-first-token metrics measured across real AI Gateway traffic.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

GPT-3.5 Turbo Instruct

Frequently Asked Questions