Skip to content

INTELLECT 3

INTELLECT 3 is a 100B+ Mixture-of-Experts language model from Prime Intellect. Through AI Gateway it runs with a context window of 131.1K tokens at $0.20 input and $1.10 output per million tokens. Model type: chat.

ReasoningTool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'prime-intellect/intellect-3',
prompt: 'Why is the sky blue?'
})

Frequently Asked Questions

  • What architecture does INTELLECT 3 use?

    It's a 100B+ Mixture-of-Experts model built on the GLM-4.5-Air base. MoE routes each input to a subset of expert parameters, so inference cost stays lower than a dense model with the same total parameter count.

  • How do I authenticate requests to INTELLECT 3 through AI Gateway?

    Set your API key in your AI Gateway project settings. Use prime-intellect/intellect-3 in API calls. AI Gateway handles provider routing, retries, and failover across parasail. Try requests in the playground: https://ai-sdk.dev/playground/parasail:parasail-primeintellect3.

  • How does INTELLECT 3 compare to larger models on benchmarks?

    Prime Intellect published benchmark results for this size class on math, code, science, and reasoning. MoE activates a subset of experts per forward pass, which changes the cost profile versus dense models.