Dettagli
Raccomandazioni
Qwen_Img_Edit_fp8_e4m3fn
Qwen-Image-Edit Full BF16
Qwen-Image-Lighting-4step
qwen_2.5_vl_7b_fp8_scaled
Qwen-Image-Edit VAE
Qwen-Image-Edit

Qwen-Image-Edit

9.5K
83
35
#Base Model
#Qwen

We are excited to introduce Qwen-Image-Edit, the image editing version of Qwen-Image. Built upon our 20B Qwen-Image model, Qwen-Image-Edit successfully extends Qwen-Image’s unique text rendering capabilities to image editing tasks, enabling precise text editing. Furthermore, Qwen-Image-Edit simultaneously feeds the input image into Qwen2.5-VL (for visual semantic control) and the VAE Encoder (for visual appearance control), achieving capabilities in both semantic and appearance editing.

Key Features:

  • Semantic and Appearance Editing: Qwen-Image-Edit supports both low-level visual appearance editing (such as adding, removing, or modifying elements, requiring all other regions of the image to remain completely unchanged) and high-level visual semantic editing (such as IP creation, object rotation, and style transfer, allowing overall pixel changes while maintaining semantic consistency).

  • Precise Text Editing: Qwen-Image-Edit supports bilingual (Chinese and English) text editing, allowing direct addition, deletion, and modification of text in images while preserving the original font, size, and style.

  • Strong Benchmark Performance: Evaluations on multiple public benchmarks demonstrate that Qwen-Image-Edit achieves state-of-the-art (SOTA) performance in image editing tasks, establishing it as a powerful foundation model for image editing.

Showcase

One of the highlights of Qwen-Image-Edit lies in its powerful capabilities for semantic and appearance editing. Semantic editing refers to modifying image content while preserving the original visual semantics. To intuitively demonstrate this capability, let's take Qwen's mascot—Capybara—as an example:

As can be seen, although most pixels in the edited image differ from those in the input image (the leftmost image), the character consistency of Capybara is perfectly preserved. Qwen-Image-Edit's powerful semantic editing capability enables effortless and diverse creation of original IP content. Furthermore, on Qwen Chat, we designed a series of editing prompts centered around the 16 MBTI personality types. Leveraging these prompts, we successfully created a set of MBTI-themed emoji packs based on our mascot Capybara, effortlessly expanding the IP's reach and expression.

Moreover, novel view synthesis is another key application scenario in semantic editing. As shown in the two example images below, Qwen-Image-Edit can not only rotate objects by 90 degrees, but also perform a full 180-degree rotation, allowing us to directly see the back side of the object:

Another typical application of semantic editing is style transfer. For instance, given an input portrait, Qwen-Image-Edit can easily transform it into various artistic styles such as Studio Ghibli. This capability holds significant value in applications like virtual avatar creation:

In addition to semantic editing, appearance editing is another common image editing requirement. Appearance editing emphasizes keeping certain regions of the image completely unchanged while adding, removing, or modifying specific elements. The image below illustrates a case where a signboard is added to the scene. As shown, Qwen-Image-Edit not only successfully inserts the signboard but also generates a corresponding reflection, demonstrating exceptional attention to detail.

Below is another interesting example, demonstrating how to remove fine hair strands and other small objects from an image.

Additionally, the color of a specific letter "n" in the image can be modified to blue, enabling precise editing of particular elements.

Appearance editing also has wide-ranging applications in scenarios such as adjusting a person's background or changing clothing. The three images below demonstrate these practical use cases respectively.

Another standout feature of Qwen-Image-Edit is its accurate text editing capability, which stems from Qwen-Image's deep expertise in text rendering. As shown below, the following two cases vividly demonstrate Qwen-Image-Edit's powerful performance in editing English text:

Qwen-Image-Edit can also directly edit Chinese posters, enabling not only modifications to large headline text but also precise adjustments to even small and intricate text elements.

Finally, let's walk through a concrete image editing example to demonstrate how to use a chained editing approach to progressively correct errors in a calligraphy artwork generated by Qwen-Image:

In this artwork, several Chinese characters contain generation errors. We can leverage Qwen-Image-Edit to correct them step by step. For instance, we can draw bounding boxes on the original image to mark the regions that need correction, instructing Qwen-Image-Edit to fix these specific areas. Here, we want the character "稽" to be correctly written within the red box, and the character "亭" to be accurately rendered in the blue region.

However, in practice, the character "稽" is relatively obscure, and the model fails to correct it correctly in one step. The lower-right component of "稽" should be "旨" rather than "日". At this point, we can further highlight the "日" portion with a red box, instructing Qwen-Image-Edit to fine-tune this detail and replace it with "旨".

Isn't it amazing? With this chained, step-by-step editing approach, we can continuously correct character errors until the desired final result is achieved.

Finally, we have successfully obtained a completely correct calligraphy version of Lantingji Xu (Orchid Pavilion Preface)! In summary, we hope that Qwen-Image-Edit can further advance the field of image generation, truly lower the technical barriers to visual content creation, and inspire even more innovative applications.

License Agreement

Qwen-Image-Edit is licensed under Apache 2.0.

Original Text and Models: https://huggingface.co/Qwen/Qwen-Image-Edit

Visualizza la traduzione

Valutazioni e recensioni

5.0 /5
0 valutazioni

Non ancora ricevute valutazioni o commenti sufficienti

no-data
Nessun dato disponibile
avatar
theally
23
620
Conversazione con il modello
Annuncio
2025-08-20
Pubblicare modello
2025-08-21
Aggiorna le informazioni del modello
Dettagli modello
Tipo
Checkpoint
Data di pubblicazione
2025-08-20
Modello Base
Qwen-Edit
Introduzione alla versione

qwen_image_edit_fp8_e4m3fn.safetensors Comfy Org Repack

Ambito di Licenza
Model Source: civitai

1. I diritti dei modelli ripubblicati appartengono ai creatori originali.

2. I creatori originali che desiderano reclamare il proprio modello devono contattare il personale di SeaArt AI tramite canali ufficiali. Clicca per rivendicare

Ambito di Licenza di Creazione
Immagini Online
Effettua una Fusione
Consenti Download
Licenza Commerciale
Le immagini generate possono essere vendute o utilizzate per scopi commerciali
Consenti la rivendita del modello o la vendita dopo la fusione
QR Code
Scarica l'app SeaArt
Continua il tuo viaggio di creazione AI su mobile