상세 정보

Qwen 2vl Flux

870

175

#기본 모델

#FLUX

Original Project found here: https://huggingface.co/Djrango/Qwen2vl-Flux

Qwen2vl-Flux is a state-of-the-art multimodal image generation model that enhances FLUX with Qwen2VL's vision-language understanding capabilities. This model excels at generating high-quality images based on both text prompts and visual references, offering superior multimodal understanding and control.

ComfyUI currently doesn't support and there is no available nodes to load the CLIP+LLM portion into it
This is just for reviewing/testing the finetuned trained part of the Flux model
CFG set to 1 on KSampler
Rendered an image in 150s using 8GB GPU @ 512px / 10 steps using the bf16 model
This model comes will be available in 3 formats named after the folder it should be in
- diffusion_models - This one is in diffusers format, it is just the merged safetensors file from HuggingFace page
- checkpoints - This one has been converted to Flux Transformers format and prefix for stable_diffusion compatibility, does not include CLIP and VAE
- unet - I will provide the q4_0 and q8 variants, make a comment if you'd like to see any other quants

번역문 보기

평점 및 리뷰

-- /5

0 개의 평점

충분한 평가나 댓글을 받지 못했습니다.

데이터 없음

azimuthalobserver

모델과 대화하기

공고

2024-11-26

모델 게시

2024-11-26

모델 정보 업데이트

모델 상세정보

유형

Checkpoint

게시 날짜

2024-11-26

기본 모델

Flux.1 D

버전 소개

This file goes in the unet folder
Loaded with UNET Loader (GGUF)
Quantsized from bf16 to q8_0

허가 범위

모델 출처: civitai

1. 재게시된 모델의 권리는 원 제작자에게 있습니다.

2. 모델 원작자가 모델을 인증받으려면 공식 채널을 통해 SeaArt.AI 직원에게 문의하세요. 저희는 모든 창작자의 권리를 보호하기 위해 노력합니다. 인증하러 이동

창작 허가 범위

온라인 생방송

혼합 진행

다운로드 허용

상업적 허가 범위

생성된 이미지를 판매하거나 상업적 목적으로 사용 가능

모델의 재판매 또는 융합 후 판매 허용

SeaArt 앱 다운로드

모바일에서 AI 창작 여정을 계속하세요