Buckets:

Duck63677
/

gpt-oss-120b-bucket

0 Bytes

31 files

Updated 2 days ago

Ctrl+K

Name	Size	Uploaded	Xet hash
metal		3 days ago	1 items
original		3 days ago	10 items
.gitattributes	1.52 kB xet	2 days ago	818ba6de
LICENSE	11.4 kB xet	3 days ago	b3141f8a
README.md	3.17 kB xet	2 days ago	3f22e45b
USAGE_POLICY	201 Bytes xet	3 days ago	d2f4077d
chat_template.jinja	16.7 kB xet	3 days ago	e81346c5
config.json	607 Bytes xet	2 days ago	52a81c75
diffusion_pytorch_model.bin	335 MB xet	2 days ago	c86c48f1
diffusion_pytorch_model.safetensors	335 MB xet	2 days ago	2ca0423e
generation_config.json	177 Bytes xet	3 days ago	601bdcb5
model-00000-of-00014.safetensors	4.63 GB xet	3 days ago	a0757e75
model-00001-of-00014.safetensors	4.12 GB xet	3 days ago	12f467f9
model-00002-of-00014.safetensors	4.63 GB xet	3 days ago	35fc9d08
model-00003-of-00014.safetensors	4.12 GB xet	3 days ago	dc46f56b
model-00004-of-00014.safetensors	4.63 GB xet	3 days ago	f7025b37
model-00005-of-00014.safetensors	4.12 GB xet	3 days ago	1355f3a2
model-00006-of-00014.safetensors	4.63 GB xet	3 days ago	a2881ce0
model-00007-of-00014.safetensors	4.06 GB xet	3 days ago	f1e7786f
model-00008-of-00014.safetensors	4.63 GB xet	3 days ago	8cfef1e4
model-00009-of-00014.safetensors	4.17 GB xet	3 days ago	7ed1e3ed
model-00010-of-00014.safetensors	4.63 GB xet	3 days ago	7cf4c11b
model-00011-of-00014.safetensors	4.12 GB xet	3 days ago	65f779c3
model-00012-of-00014.safetensors	4.06 GB xet	3 days ago	e0e7237c
model-00013-of-00014.safetensors	4.63 GB xet	3 days ago	8ce2f500
model-00014-of-00014.safetensors	4.12 GB xet	3 days ago	e0c98062
model.safetensors.index.json	54.5 kB xet	3 days ago	409bd42c
sdxl_vae.safetensors	335 MB xet	2 days ago	905e1d46
special_tokens_map.json	98 Bytes xet	3 days ago	135e0230
tokenizer.json	27.9 MB xet	3 days ago	c5584668
tokenizer_config.json	4.2 kB xet	3 days ago	a337618b

README.md

SDXL - VAE

How to use with 🧨 diffusers

You can integrate this fine-tuned VAE decoder to your existing diffusers workflows, by including a vae argument to the StableDiffusionPipeline

from diffusers.models import AutoencoderKL
from diffusers import StableDiffusionPipeline

model = "stabilityai/your-stable-diffusion-model"
vae = AutoencoderKL.from_pretrained("stabilityai/sdxl-vae")
pipe = StableDiffusionPipeline.from_pretrained(model, vae=vae)

Model

SDXL is a latent diffusion model, where the diffusion operates in a pretrained, learned (and fixed) latent space of an autoencoder. While the bulk of the semantic composition is done by the latent diffusion model, we can improve local, high-frequency details in generated images by improving the quality of the autoencoder. To this end, we train the same autoencoder architecture used for the original Stable Diffusion at a larger batch-size (256 vs 9) and additionally track the weights with an exponential moving average (EMA). The resulting autoencoder outperforms the original model in all evaluated reconstruction metrics, see the table below.

Evaluation

SDXL-VAE vs original kl-f8 VAE vs f8-ft-MSE

COCO 2017 (256x256, val, 5000 images)

Model	rFID	PSNR	SSIM	PSIM	Link	Comments

SDXL-VAE	4.42	24.7 +/- 3.9	0.73 +/- 0.13	0.88 +/- 0.27	https://huggingface.co/stabilityai/sdxl-vae/blob/main/sdxl_vae.safetensors	as used in SDXL
original	4.99	23.4 +/- 3.8	0.69 +/- 0.14	1.01 +/- 0.28	https://ommer-lab.com/files/latent-diffusion/kl-f8.zip	as used in SD
ft-MSE	4.70	24.5 +/- 3.7	0.71 +/- 0.13	0.92 +/- 0.27	https://huggingface.co/stabilityai/sd-vae-ft-mse-original/resolve/main/vae-ft-mse-840000-ema-pruned.ckpt	resumed with EMA from ft-EMA, emphasis on MSE (rec. loss = MSE + 0.1 * LPIPS), smoother outputs

Total size: 0 Bytes

Files: 31

Last updated: Jun 20

Pre-warmed CDN: US EU US EU

SDXL - VAE

How to use with 🧨 diffusers

Model

Evaluation

COCO 2017 (256x256, val, 5000 images)

Contributors