Meta released Llama 3.3 70B Instruct on December 6, 2024 as the final model in its 2024 Llama release cadence. The 3.3 70B is text-only, but it represents a targeted refinement of the 70B tier. Llama 3.3 70B Instruct delivers similar performance to the 3.1 405B at a fraction of the serving cost.
The core improvements center on instruction following and multilingual capability. Instruction following (the model's ability to accurately execute detailed or constrained directions) is one of the most important capabilities in production deployments where system prompts encode complex behavioral rules. The multilingual improvements matter for enterprise applications serving global audiences: better handling of non-English instructions reduces the engineering overhead of maintaining separate language-specific prompts.
Llama Stack, which Meta standardized throughout 2024 as a set of interfaces for RAG and agentic applications, is fully compatible with the 3.3 70B. Teams already using Llama Stack distributions for toolchain orchestration can upgrade to the 3.3 generation without rearchitecting their integration layer.