



I gathered 15 images of Selene from Underworld. I am only using Birme to crop the HD photos instead of using Faceswap to align the faces. Some of the images were full body as I wanted to retain her face even when zoomed out. I used Blip captioning to generate the filewords and edited each individually to reduce potential hallucinations.
I used 0.005:100,0.0025:250,0.001:500,0.0005 for my learning rate. I am going for 5K training steps total. I am using a batch size of 1 with Gradient Accumulation Steps set to 3. I am running on a RTX 4090 on the cloud. I am using 12.5 out of 24 GB. The estimated time of completion is 1.5 hours. For the embedding I am using 5 vectors per token. I switched to SD 1.5 EMA Only model for training.
1. Quyền đối với các mô hình được đăng lại thuộc về người sáng tạo ban đầu.
2. Người sáng tạo gốc muốn xác nhận mô hình vui lòng liên hệ nhân viên SeaArt AI qua kênh chính thức. Nhấp để xác nhận
