



I gathered 15 images of Selene from Underworld. I am only using Birme to crop the HD photos instead of using Faceswap to align the faces. Some of the images were full body as I wanted to retain her face even when zoomed out. I used Blip captioning to generate the filewords and edited each individually to reduce potential hallucinations.
I used 0.005:100,0.0025:250,0.001:500,0.0005 for my learning rate. I am going for 5K training steps total. I am using a batch size of 1 with Gradient Accumulation Steps set to 3. I am running on a RTX 4090 on the cloud. I am using 12.5 out of 24 GB. The estimated time of completion is 1.5 hours. For the embedding I am using 5 vectors per token. I switched to SD 1.5 EMA Only model for training.
1. Dieses Modell dient nur Lernzwecken. Urheber- und Auslegungsrechte liegen beim Originalautor.
2. Bist du der Originalautor eines Modells, kontaktiere uns bitte zur Authentifizierung über unsere offiziellen Kanäle. Wir schützen die Rechte aller Schöpfer. Hier klicken, um es zu verifizieren.
