Details
Related
META4 - Helm
v0.4 BETA - Final
META4
v0.3 BETA
v0.2 BETA
v0.1 BETA
Qwen-Image ????

Qwen-Image ????

585
39
62
#concept
#Qwen-Image
#qwen-image lora

Still, FP8 + Lightning 8-steps Lora is recommended. If you don't like the Fluxy look, use the DPM++ series with Karras (more steps and a higher CFG are required).

End-of-Life

I guess I've learned enough about Qwen-Image, so further testing feels redundant. This repo will not receive any further uploads or attention. I captioned all images and released them as a sign of thanks for testing the beta releases. I hope it helps you, as it was a great resource for me. Good luck!

  • 6,537 captioned images sourced from CivitAI.

  • Prompts used for generating images (useless for Qwen-Image).

  • Captions in vulgar and profane style are in the Captions folder.

Caption Example:

This is a digital illustration showing a ??????? intense and ???????? scene in a movie theater. A muscular dude with short brown hair is sitting behind a blue-haired chick, who's wearing a blue dress and white sandals, and he's ??????? her ???????????. His ???? is huge and it's ??????????? a ??????? lot, with ??? dripping down her ????? and onto the seat. He's got one hand over her mouth, and she's looking surprised with wide eyes. The background shows other dudes sitting in red theater seats, looking bored or distracted. The camera angle is straight-on, focusing on the ??????? action. The dude's hand is gripping her tight.

Prompt Example:

score_9, score_8_up, score_7_up, absurd_res, hi_res, anime_source, (big man sitting on chair in movie_theater, cute girl on lap:1.1), open pants, hug, ????, ???????????, stealth_???, ???,sundress, smug, surprised, exhausted, rolling_eyes, covering_mouth, detailed face, intricate details, hyperdetailed, very aesthetic, motion_lines, <lora:NAI Smooth Boys Style SDXL_LoRA_20r_20e_8i_nr32_a16_Pony Diffusion V6 XL:0.5> <lora:Concept Art Twilight Style SDXL_LoRA_Pony Diffusion V6 XL:0.7>

META4 - Helm [Plz read]

  • Helm needs to be used with META-4 (Strength 0.6) + Helm (Strength 1).

  • Qwen-Image doesn't respond well to Booru tags.

This is in line with other BETA releases to figure out how to deal with anime (low detail) and realism (high detail). Qwen-Image is not specialized as illustrious for anime, so much of the anime actions need to be done via LoRA. Although, if you're crazy enough, you can do it via prompt only.

META-4 will improve ???? details to some degree (Still in BETA, not perfect, but better than what Qwen presents).

  • Helm needs to be used with META-4 (Strength 0.6) + Helm (Strength 1). Merge it with META-4 using your own settings, following the provided code in the article (link in META-4 description).

  • Trained on 139 randomly picked images (very limited) with no moderation for testing purposes. therefore, it doesn't satisfy an anime enthusiast right away.

  • 7 epochs, 1000 steps, LR 0.0003 (to see if META-4 can act as a refiner).

  • A dataset with 2 caption variations (Tags, Vulgar) is provided in case you're interested.

  • If you make one, please ping me.

Datset source: https://civitai.com/models/1215490/helm-nikke-sdxl-lora-illustrious-or-3-outfits

Last BETA Releases:

META-4

Please read the article related to META-4

https://civitai.com/articles/18798/qwen-image-????-lora-notes

This version is a linear merge with tuned weights from four releases, each focused on a specific aspect of the training. While it is still far from perfect, it can be useful in some cases.

DO NOT MERGE version 0.4 with the other releases. Overfitting issues. Overfitting occurs when a model learns the training data too well.

v0.4 BETA

  • I experimented with the learning rate to determine exactly where overfitting will occur.

  • There is a better skin tone, but signs of overfitting, as well as bad or deformed ?????????, will occur more often than in v0.3.

v0.3 BETA

Experimented with a more friendly and maintainable prompting style. Use one or a combination of them:

  • Descriptive Style: "A photo-realistic shoot from above featuring a woman in a provocative pose on a bed..."

  • SDXL Tag-Based Style: "1girl, long hair, ???????, looking at viewer, open mouth."

  • Segmentation Style:

  • ??? Acts: Penetration, ??????? intercourse.

  • ?????? Positions: ???????????????????.

  • Male ?????????: Large, ?????, dark-skinned, circumcised, with visible veins.

This BETA is all about prompting and testing the results. I've removed anime images from the dataset to save time and resources and to speed up the process.

Next, I'll focus on the details and finding a way to eliminate the current issues with ???????.

v0.2 BETA

Experimented to find a sweet spot for more detailed ?????????

  • Used Qwen-Image captioning style (This means you need a detailed description of what you want).

  • The focus was on experimentation rather than quality, hence the BETA.

  • New auto-generated realistic images were used. Extreme sizes were spotted, but I didn't filter them out.

Pro: Better output compared to BETA 1.

Con: I did a few tests, and writing a wall of prompts is not maintainable. However, Qwen-Image is detail-hungry, otherwise, it takes over, and in the case of ???? content, we don't want the model's influence.

Next: I'll try mixing Danbooru tags with descriptive captioning, focusing on vulgar slang, and using a better dataset.

v0.1 BETA

This LoRA is primarily trained on Civitai images for experimenting with Qwen-Image LoRA training. 80% of the dataset consists of anime-based images, while the remaining images are semi-realistic, which will likely dominate the output. (mostly vertical sizes)

Using FP8 with 8 steps Lightning LoRA generates acceptable results. All images in the showcase are the best from two batches

Based on the tests I've conducted, the results are promising. This indicates that we don't have the same level of censorship as Flux.

Prompt Guide: I used Joy Caption, Stable Diffusion style of captioning. Example:
[Update: Upon further testing, it turned out that using the SD style for captioning was a bad idea. I will try a different approach in the next beta.]
"""
????, digital painting, close-up, girl with green eyes, black hair in two buns, red halter top, ?????????????, hand grabbing her right ??????, ?????? exposed, gold necklace, light skin, subtle blush, camera angle from below, looking up, soft lighting, realistic style, detailed shading, hand on ??????, suggestive, hand touching ??????, ?????? grab, hand on ??????, upper body, focused on face and ???????, red halter top, bouncy hair, soft texture, high detail, hand on ??????, realistic shading, realistic style, soft lighting, subtle blush, looking up, gold necklace, realistic eyes, halter top, realistic ???????, realistic skin, realistic lighting, hand on ??????, detailed shading, high detail, soft texture
"""

View Translation

Rating & Review

-- /5
0 Ratings

Not enough ratings or reviews received yet

no-data
No data available
S
Chat with the model
Notice
2025-08-25
Publish Model
2025-08-31
Update Model Info
Model Details
Type
LORA
Publish Time
2025-08-31
Base Model
Qwen-Image
Version Introduction

meta4 as a semi-refiner

License Scope
Model Source: civitai

1. The rights to reposted models belong to original creators.

2. Original creators should contact SeaArt.AI staff through official channels to claim their models. We are committed to protecting every creator's rights. Click to Claim

Creative License Scope
Online Image Generation
Merge
Allow Downloads
Commercial License Scope
Sale or Commercial Use of Generated Images
Resale of Models or Their Sale After Merging
QR Code
Download SeaArt App
Continue your AI creation journey on mobile devices