Details
Related
base_fxied_vae_V2_fp16
refiner_fixed_vae_V2_fp16
Base_fixedvae_V1_fp16
SDXL_fixedvae_fp16( Watermark)

SDXL_fixedvae_fp16( Watermark)

10.5K
538
3.2K
#basemodel
#SDXL
#basemodel
#SDXL

This is merge model for:

1. 100% stable-diffusion-xl-base-1.0 and 100% stable-diffusion-xl-refine-1.0

https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0

https://huggingface.co/stabilityai/stable-diffusion-xl-refiner-1.0

2. sdxl-vae-fp16-fix

https://huggingface.co/madebyollin/sdxl-vae-fp16-fix

you can use this directly or finetune.

same license on stable-diffusion-xl-base-1.0

same vae license on sdxl-vae-fp16-fix

SDXL-VAE-FP16-Fix

SDXL-VAE-FP16-Fix is the SDXL VAE, but modified to run in fp16 precision without generating NaNs.

VAEDecoding in float32 / bfloat16 precisionDecoding in float16 precisionSDXL-VAE✅⚠️SDXL-VAE-FP16-Fix✅

Details

SDXL-VAE generates NaNs in fp16 because the internal activation values are too big:

SDXL-VAE-FP16-Fix was created by finetuning the SDXL-VAE to:

1. keep the final output the same, but

2. make the internal activation values smaller, by

3. scaling down weights and biases within the network

There are slight discrepancies between the output of SDXL-VAE-FP16-Fix and SDXL-VAE, but the decoded images should be close enough for most purposes.

Benchmark from here:by Kubuxu

https://huggingface.co/madebyollin/sdxl-vae-fp16-fix/discussions/7

Evaluation on COCO val-2017, 256x256, RandomCrop with padding
Metrics:
LPIPS:
https://github.com/richzhang/PerceptualSimilarity/ (lower better) and structural similarity index measure via skimage.metrics (higher better)
Metrics given as: mean [79% credibility interval]

View Translation

Rating & Review

5.0 /5
0 Ratings

Not enough ratings or reviews received yet

no-data
No data available
B
bdsqlsz
41
2.1K
Chat with the model
Notice
2023-07-27
Publish Model
2023-08-11
Update Model Info
Model Details
Type
Checkpoint
Publish Time
2023-07-30
Base Model
SDXL 1.0
Version Introduction

Improved decoder weights

* Further-reduced risk of NaNs
* Further-reduced discrepancies with original SDXL-VAE (0.9) decoder

Encoder weights are unchanged.

License Scope
Model Source: civitai

1. The rights to reposted models belong to original creators.

2. Original creators should contact SeaArt.AI staff through official channels to claim their models. We are committed to protecting every creator's rights. Click to Claim

Creative License Scope
Online Image Generation
Merge
Allow Downloads
Commercial License Scope
Sale or Commercial Use of Generated Images
Resale of Models or Their Sale After Merging
QR Code
Download SeaArt App
Continue your AI creation journey on mobile devices