Models/Wan-AI/Wan2.1-T2V-14B-Diffusers
AlibabaAlibaba / Wan-AI/Wan2.1-T2V-14B-Diffusers
Released: 2/25/2025
textvideo
$0.24

Wan-AI/Wan2.1-T2V-14B-Diffusers is a 14B parameter Large Vision Model (LVM) designed for high-fidelity text-to-video and image-to-video generation, including support for readable text in English and Chinese within generated videos.

It excels in generating temporally consistent videos from detailed prompts, offers customizable aspect ratios, and maintains stability at both 480p and 720p resolutions across consumer-grade hardware.

Some other noteworthy features of Wan-AI/Wan2.1-T2V-14B-Diffusers include prompt enhancement for improved video quality and precision, inspiration mode for artistic visual enrichment, sound effects generation, and efficient video encoding via a variational autoencoder (VAE).

MetricValue
Parameter Count14 billion
Mixture of ExpertsNo
Context LengthUnknown
MultilingualYes
Quantized*No

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

Alibaba models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInputOutputInputOutput
textvideoN/AN/A
textvideoN/AN/A
textvideoN/AN/A
textvideoN/AN/A
See all models available on Oxen.ai