This repository contains a model that generates highly aesthetic images of resolution 1024x1024. You can use the model with Hugging Face 🧨 Diffusers.

Playground v2 is a diffusion-based text-to-image generative model. The model was trained from scratch by the research team at Playground.
Images generated by Playground v2 are favored 2.5 times more than those produced by Stable Diffusion XL, according to Playground’s user study.
We are thrilled to release [intermediate checkpoints](#intermediate-base-models) at different training stages, including evaluation metrics, to the community. We hope this will encourage further research into foundational models for image generation.
Lastly, we introduce a new benchmark, MJHQ-30K, for automatic evaluation of a model’s aesthetic quality.
Please see our blog for more details.
- Developed by: Playground
- Model type: Diffusion-based text-to-image generative model
- License: Playground v2 Community License
Install diffusers >= 0.24.0 and some dependencies:
pip install transformers accelerate safetensorsTo use the model, run the following snippet.
Note: It is recommend to use `guidance_scale=3.0`.
from diffusers import DiffusionPipeline
import torch
pipe = DiffusionPipeline.from_pretrained(
"playgroundai/playground-v2-1024px-aesthetic",
torch_dtype=torch.float16,
use_safetensors=True,
add_watermarker=False,
variant="fp16"
)
pipe.to("cuda")
prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"
image = pipe(prompt=prompt, guidance_scale=3.0).images[0]In order to use the model with software such as Automatic1111 or ComfyUI you can use playground-v2.fp16.safetensors file.
1. Quyền đối với các mô hình được đăng lại thuộc về người sáng tạo ban đầu.
2. Người sáng tạo gốc muốn xác nhận mô hình vui lòng liên hệ nhân viên SeaArt AI qua kênh chính thức. Nhấp để xác nhận
