详情

Qwen 2vl Flux

879

175

#基礎模型

#FLUX

Original Project found here: https://huggingface.co/Djrango/Qwen2vl-Flux

Qwen2vl-Flux is a state-of-the-art multimodal image generation model that enhances FLUX with Qwen2VL's vision-language understanding capabilities. This model excels at generating high-quality images based on both text prompts and visual references, offering superior multimodal understanding and control.

ComfyUI currently doesn't support and there is no available nodes to load the CLIP+LLM portion into it
This is just for reviewing/testing the finetuned trained part of the Flux model
CFG set to 1 on KSampler
Rendered an image in 150s using 8GB GPU @ 512px / 10 steps using the bf16 model
This model comes will be available in 3 formats named after the folder it should be in
- diffusion_models - This one is in diffusers format, it is just the merged safetensors file from HuggingFace page
- checkpoints - This one has been converted to Flux Transformers format and prefix for stable_diffusion compatibility, does not include CLIP and VAE
- unet - I will provide the q4_0 and q8 variants, make a comment if you'd like to see any other quants

查看译文

評分與評論

-- /5

0 個評分

尚未收到足夠的評分或評論

暫無數據

azimuthalobserver

與模型對話

公告

2024-11-26

发布模型

2024-11-26

更新模型資訊

模型详情

類型

Checkpoint

发布時間

2024-11-26

基础模型

Flux.1 D

版本介绍

This file goes in the unet folder
Loaded with UNET Loader (GGUF)
Quantsized from bf16 to q8_0

许可范围

來源: civitai

1. 轉载模型僅供學習與交流分享，其版權及最終解释權归原作者。

2. 模型原作者如需認領模型，請通過官方渠道联系海藝AI工作人員進行認證。我們致力於保護每一位創作者的權益。點擊去認領

創作许可范围

在線生圖

進行融合

允许下载

商業许可范围

生成圖片可出售或用於商業目的

允许模型轉售或融合后出售

下載SeaArt App

在移動端继續你的AI創作之旅