This repository contains a model that generates highly aesthetic images of resolution 1024x1024. You can use the model with Hugging Face 🧨 Diffusers.

Playground v2 is a diffusion-based text-to-image generative model. The model was trained from scratch by the research team at Playground.
Images generated by Playground v2 are favored 2.5 times more than those produced by Stable Diffusion XL, according to Playground’s user study.
We are thrilled to release [intermediate checkpoints](#intermediate-base-models) at different training stages, including evaluation metrics, to the community. We hope this will encourage further research into foundational models for image generation.
Lastly, we introduce a new benchmark, MJHQ-30K, for automatic evaluation of a model’s aesthetic quality.
Please see our blog for more details.
- Developed by: Playground
- Model type: Diffusion-based text-to-image generative model
- License: Playground v2 Community License
Install diffusers >= 0.24.0 and some dependencies:
pip install transformers accelerate safetensorsTo use the model, run the following snippet.
Note: It is recommend to use `guidance_scale=3.0`.
from diffusers import DiffusionPipeline
import torch
pipe = DiffusionPipeline.from_pretrained(
"playgroundai/playground-v2-1024px-aesthetic",
torch_dtype=torch.float16,
use_safetensors=True,
add_watermarker=False,
variant="fp16"
)
pipe.to("cuda")
prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"
image = pipe(prompt=prompt, guidance_scale=3.0).images[0]In order to use the model with software such as Automatic1111 or ComfyUI you can use playground-v2.fp16.safetensors file.
1. 轉载模型僅供學習與交流分享,其版權及最終解释權归原作者。
2. 模型原作者如需認領模型,請通過官方渠道联系海藝AI工作人員進行認證。我們致力於保護每一位創作者的權益。 點擊去認領
