Models/Wan-AI/Wan2.2-T2V-A14B-Diffusers
AlibabaAlibaba / Wan-AI/Wan2.2-T2V-A14B-Diffusers
Released: 7/28/2025
textvideo
$NaN

Wan-AI/Wan2.2-T2V-A14B-Diffusers is a large vision model (LVM) designed for text-to-video generation using a Mixture-of-Experts (MoE) diffusion architecture.

It excels in generating high-quality, professional-grade 480p and 720p videos from text prompts, leveraging 14 billion active parameters per inference step for detailed and coherent video synthesis while maintaining efficient GPU memory usage.

Some other noteworthy features of Wan-AI/Wan2.2-T2V-A14B-Diffusers include support for image-to-video generation and the ability to handle advanced creative storytelling, marketing content, and filmmaking pre-visualization tasks.

MetricValue
Parameter Count27 billion (14B active)
Mixture of ExpertsYes
Active Parameter Count14 billion
Context LengthUnknown
MultilingualUnknown
Quantized*No

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

Alibaba models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInputOutputInputOutput
textvideoN/AN/A
textvideoN/AN/A
textvideoN/AN/A
textvideoN/AN/A
See all models available on Oxen.ai