Qwen / qwen-image
Released: 8/27/2025qwen-image is a Large Vision Model (LVM) that excels in high-fidelity text rendering across images—particularly for both English and Chinese scripts—as well as general image generation, supporting a wide range of artistic styles and precise image editing tasks. It is especially effective for scenarios requiring seamless text integration into images and advanced image manipulation beyond basic adjustments.
Some other noteworthy features of qwen-image include support for image understanding tasks such as object detection, semantic segmentation, depth estimation, novel view synthesis, and super-resolution, making it a comprehensive tool for intelligent visual creation and manipulation.
Metric | Value |
---|---|
Parameter Count | 20 billion |
Mixture of Experts | No |
Context Length | Unknown |
Multilingual | Yes |
Quantized* | No |
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.
Qwen models available on Oxen.ai
Modality | Price (1M tokens) | ||||
---|---|---|---|---|---|
Model | Input | Output | Input | Output | |
text | text | $0.90 | $0.90 | ||
text | text | $0.22 | $0.88 | ||
text | text | $0.90 | $0.90 | ||
text | text | $2.00 | $2.00 | ||
image | image | N/A | N/A | ||
image | image | N/A | N/A | ||
text | text | N/A | N/A | ||
text | text | $0.90 | $0.90 | ||
image | text | $0.90 | $0.90 | ||
text | text | $0.45 | $1.80 | ||
text | text | $0.90 | $0.90 |