



I gathered 15 images of Selene from Underworld. I am only using Birme to crop the HD photos instead of using Faceswap to align the faces. Some of the images were full body as I wanted to retain her face even when zoomed out. I used Blip captioning to generate the filewords and edited each individually to reduce potential hallucinations.
I used 0.005:100,0.0025:250,0.001:500,0.0005 for my learning rate. I am going for 5K training steps total. I am using a batch size of 1 with Gradient Accumulation Steps set to 3. I am running on a RTX 4090 on the cloud. I am using 12.5 out of 24 GB. The estimated time of completion is 1.5 hours. For the embedding I am using 5 vectors per token. I switched to SD 1.5 EMA Only model for training.
1. 轉载模型僅供學習與交流分享,其版權及最終解释權归原作者。
2. 模型原作者如需認領模型,請通過官方渠道联系海藝AI工作人員進行認證。我們致力於保護每一位創作者的權益。 點擊去認領
