INTELLECT 3
INTELLECT 3 is a 100B+ Mixture-of-Experts language model from Prime Intellect. Through AI Gateway it runs with a context window of 131.1K tokens at $0.20 input and $1.10 output per million tokens. Model type: chat.
import { streamText } from 'ai'
const result = streamText({ model: 'prime-intellect/intellect-3', prompt: 'Why is the sky blue?'})Frequently Asked Questions
What architecture does INTELLECT 3 use?
It's a 100B+ Mixture-of-Experts model built on the GLM-4.5-Air base. MoE routes each input to a subset of expert parameters, so inference cost stays lower than a dense model with the same total parameter count.
How do I authenticate requests to INTELLECT 3 through AI Gateway?
Set your API key in your AI Gateway project settings. Use
prime-intellect/intellect-3in API calls. AI Gateway handles provider routing, retries, and failover across parasail. Try requests in the playground: https://ai-sdk.dev/playground/parasail:parasail-primeintellect3.How does INTELLECT 3 compare to larger models on benchmarks?
Prime Intellect published benchmark results for this size class on math, code, science, and reasoning. MoE activates a subset of experts per forward pass, which changes the cost profile versus dense models.