详情

Qwen 2vl Flux

880

175

#基础模型

#FLUX

Original Project found here: https://huggingface.co/Djrango/Qwen2vl-Flux

Qwen2vl-Flux is a state-of-the-art multimodal image generation model that enhances FLUX with Qwen2VL's vision-language understanding capabilities. This model excels at generating high-quality images based on both text prompts and visual references, offering superior multimodal understanding and control.

ComfyUI currently doesn't support and there is no available nodes to load the CLIP+LLM portion into it
This is just for reviewing/testing the finetuned trained part of the Flux model
CFG set to 1 on KSampler
Rendered an image in 150s using 8GB GPU @ 512px / 10 steps using the bf16 model
This model comes will be available in 3 formats named after the folder it should be in
- diffusion_models - This one is in diffusers format, it is just the merged safetensors file from HuggingFace page
- checkpoints - This one has been converted to Flux Transformers format and prefix for stable_diffusion compatibility, does not include CLIP and VAE
- unet - I will provide the q4_0 and q8 variants, make a comment if you'd like to see any other quants

查看译文

评分与评论

-- /5

0个评分

尚未收到足够的评分或评论

暂无数据

azimuthalobserver

与模型对话

公告

2024-11-26

发布模型

2024-11-26

更新模型信息

模型详情

类型

Checkpoint

发布时间

2024-11-26

基础模型

Flux.1 D

版本介绍

This file goes in the unet folder
Loaded with UNET Loader (GGUF)
Quantsized from bf16 to q8_0

许可范围

来源: civitai

1. 转载模型仅供学习与交流分享，其版权及最终解释权归原作者。

2. 模型原作者如需认领模型，请通过官方渠道联系海艺AI工作人员进行认证。我们致力于保护每一位创作者的权益。点击去认领

创作许可范围

在线生图

进行融合

允许下载

商业许可范围

生成图片可出售或用于商业目的

允许模型转售或融合后出售

下载SeaArt App

在移动端继续你的AI创作之旅