Models/Llama 4 Maverick
MetaMeta / Llama 4 Maverick
Released: 4/5/2025
texttextimagetext
Input: $0.22 / Output: $0.88

Llama 4 Maverick is a multimodal LLM with 17 billion active parameters, built on a mixture-of-experts (MoE) architecture with 128 experts and a total of 400 billion parameters. It is designed for balanced, high-quality performance across text, image understanding, reasoning, coding, and multilingual tasks.

It excels in delivering strong results on chat, code generation, complex reasoning problems, long-context retention (up to one million tokens), and advanced image analysis—outperforming or matching leading models like GPT-4o and Gemini 2.0 Flash in many benchmarks. Its MoE design enables efficient inference by activating only the most relevant subset of experts per token.

Some other noteworthy features of Llama 4 Maverick include native support for both text and images as input (enabling document intelligence use cases), robust multilingual capabilities across at least a dozen languages, and suitability for enterprise-scale applications such as customer support bots that process screenshots or creative assistants that generate content from multimodal documents.

MetricValue
Parameter Count400 billion total / 17 billion active
Mixture of ExpertsYes
Active Parameter Count17 billion
Context Length1 million tokens
MultilingualYes
Quantized*No

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

Meta models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInputOutputInputOutput
texttext$3.00$3.00
texttext$0.90$0.90
texttext$0.05$0.08
texttext$0.90$0.90
texttext$0.59$0.59
texttext$0.59$0.79
text, imagetext$0.22$0.88
texttext$0.20$0.20
texttext$0.08$0.30
See all models available on Oxen.ai