Qwen-Image API, Deployment, Pricing

Qwen/Qwen-Image

Qwen-Image is an image generation foundation model released by the Alibaba Qwen team, featuring 20 billion parameters. The model has achieved significant advances in complex text rendering and precise image editing, excelling particularly at generating images with high-fidelity Chinese and English text. Qwen-Image can handle multi-line layouts and paragraph-level text while maintaining layout coherence and contextual harmony in the generated images. Beyond its superior text-rendering capabilities, the model supports a wide range of artistic styles, from photorealistic scenes to anime aesthetics, adapting fluidly to various creative prompts. It also possesses powerful image editing and understanding abilities, supporting advanced operations such as style transfer, object insertion or removal, detail enhancement, text editing, and even human pose manipulation, aiming to be a comprehensive foundation model for intelligent visual creation and manipulation where language, layout, and imagery converge

API Usage

curl --request POST \
  --url https://api.siliconflow.com/v1/images/generations \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "batch_size": 1,
  "num_inference_steps": 20,
  "guidance_scale": 7.5,
  "model": "Qwen/Qwen-Image",
  "prompt": "an island near sea, with seagulls, moon shining over the sea, light house, boats int he background, fish flying over the sea"
}'

Details

Model Provider

Qwen

Type

image

Sub Type

text-to-image

Publish Time

Sep 15, 2025

Price

$

undefined

/ Image

Tags

MoE,235B,128K

Compare with Other Models

See how this model stacks up against others.

FLUX 1.1 [pro]

FLUX 1.1 [pro]

FLUX1.1 Pro is an enhanced text-to-image model built on the FLUX.1 architecture, offering improved composition, detail, and rendering speed. With better visual consistency and artistic fidelity, it's suitable for illustration, creative content generation, and e-commerce visual assets—delivering diverse styles with strong prompt alignment.

FLUX 1.1 [pro]

FLUX 1.1 [pro] Ultra

FLUX 1.1 [pro] Ultra

FLUX1.1 Pro Ultra is the high-resolution version of FLUX1.1 Pro, capable of generating images up to 4 megapixels (2K resolution). It improves photo realism and prompt controllability for advanced use cases. The Ultra mode is optimized for composition and precision, while Raw mode prioritizes natural textures and realism—ideal for commercial visual production, art direction, and realistic concept rendering.

FLUX 1.1 [pro] Ultra

FLUX.1 Kontext [max]

FLUX.1 Kontext [max]

FLUX.1 Kontext Max is the most powerful and feature-rich model in the Kontext series, designed for high-resolution, high-precision visual editing and generation. It offers superior prompt adherence, detailed rendering, and advanced typographic control. Ideal for enterprise design systems, marketing visuals, and automated creative pipelines that require robust scene transformations and layout control.

FLUX.1 Kontext [max]

FLUX.1 Kontext [pro]

FLUX.1 Kontext [pro]

FLUX.1 Kontext Pro is an advanced image generation and editing model that supports both natural language prompts and reference images. It delivers high semantic understanding, precise local control, and consistent outputs, making it ideal for brand design, product visualization, and narrative illustration. It enables fine-grained edits and context-aware transformations with high fidelity.

FLUX.1 Kontext [pro]

Model FAQs: Usage, Deployment

Learn how to use, fine-tune, and deploy this model with ease.

Ready to accelerate your AI development?

Ready to accelerate your AI development?

© 2025 SiliconFlow Technology PTE. LTD.

© 2025 SiliconFlow Technology PTE. LTD.

© 2025 SiliconFlow Technology PTE. LTD.