Qwen3 Embedding 4B

Qwen3 Embedding 4B is a mid-tier 4-billion-parameter text embedding model producing 2560-dimensional vectors over a context of 32.8K tokens, designed for multilingual semantic search and code retrieval that balances quality with operational cost.

index.ts

import { embed } from 'ai';

const result = await embed({
  model: 'alibaba/qwen3-embedding-4b',
  value: 'Sunny day at the beach',
})

Overview About Providers Similar FAQ

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

DeepInfra

33K

$0.02/M

—

06/05/2025

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Qwen3 Embedding 4B

Providers