Models/Wan-AI/Wan2.1-T2V-1.3B-Diffusers
AlibabaAlibaba / Wan-AI/Wan2.1-T2V-1.3B-Diffusers
textvideo
$NaN

Wan-AI/Wan2.1-T2V-1.3B-Diffusers is a text-to-video diffusion model. It excels in generating 480P videos from text prompts efficiently on consumer-grade GPUs, requiring only 8.19GB of VRAM, while maintaining competitive video quality.

Some other noteworthy features of Wan-AI/Wan2.1-T2V-1.3B-Diffusers include multilingual support (English and Chinese), image-to-video conversion, aspect ratio control, visual text rendering inside videos, prompt enhancement, and the ability to add sound effects or background music to generated videos.

MetricValue
Parameter Count1.3 billion
Mixture of ExpertsNo
Context LengthUnknown
MultilingualYes
Quantized*No

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

Alibaba models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInputOutputInputOutput
textvideoN/AN/A
textvideoN/AN/A
textvideoN/AN/A
textvideoN/AN/A
See all models available on Oxen.ai