QwenQwen / qwen-image
Released: 8/27/2025
imageimage
$0.03

qwen-image is a Large Vision Model (LVM) that excels in high-fidelity text rendering across images—particularly for both English and Chinese scripts—as well as general image generation, supporting a wide range of artistic styles and precise image editing tasks. It is especially effective for scenarios requiring seamless text integration into images and advanced image manipulation beyond basic adjustments.

Some other noteworthy features of qwen-image include support for image understanding tasks such as object detection, semantic segmentation, depth estimation, novel view synthesis, and super-resolution, making it a comprehensive tool for intelligent visual creation and manipulation.

MetricValue
Parameter Count20 billion
Mixture of ExpertsNo
Context LengthUnknown
MultilingualYes
Quantized*No

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

Qwen models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInputOutputInputOutput
texttext$0.90$0.90
texttext$0.22$0.88
texttext$0.90$0.90
texttext$2.00$2.00
imageimageN/AN/A
imageimageN/AN/A
texttextN/AN/A
texttext$0.90$0.90
imagetext$0.90$0.90
texttext$0.45$1.80
texttext$0.90$0.90
See all models available on Oxen.ai