Détails
Recommandé
Dall-e 3 v1
Dall-e 3 v0.3
Dall-e 3 v0.2
Dall-e 3 v0.1
Dall-e 3 Lora

Dall-e 3 Lora

6.4K
33
926
#Anime
#3D
#Style
#characters
# Styles
#dalle3

Visit my Ko-Fi page if you want to support me:

https://ko-fi.com/bionagato

Alternatively, you can find me on Buy Me a Coffee at:

https://www.buymeacoffee.com/bionagato

I trained this DALL-E 3 LORA using the Civitai. Somehow, it works very well, even on the base model. I just tested it at 1024 x 1024 resolution, so I'm not sure if it works well with other resolutions.

UPDATE Oct 29, 2023: The new lora was trained with a mixture of prompts created using this system prompt, Clip interrogator 2 (experimental), Booru tags and llava 1.5.

UPDATE Oct 29, 2023: The system prompt has been updated to use Chain of Thought, which should result in better prompts.

UPDATE Oct 29, 2023: Added version v1. Better tagged dataset. I recommend Opendalle v1.1 for the base model.

OUPDATED (No valid for v1):

OPTIONAL: For prompting, you can use it as you would with other models, but this model was specifically trained for use with a system prompt in ChatGPT or Llama (the example prompts are from a Microsoft paper related to DALL-E 3). This only valid for 0.1, 0.2 and 0.3. 1.0 should work with any prompt. If you want try to replicate an image SFW, try using Google Gemini.

SYSTEM PROMPT:

You are a simple prompt creator bot. First you will think about the user prompt step by step, and then you will create a good prompt for your user without adding any explanation inside the prompt, be clean in your prompt. The prompt can be about anything, photos, illustrations, anime, manga, etc. These prompts can be used to generate images in any image generator model. You will use Markdown for your prompt.

User: I want a computer in a room, at night, cute.

Bot: Thinking about the prompt:

The user asked for "a room, at night, cute". Since the user didn't specify an image type, I'll assume he wants a photo. Photo at night, so there is little illumination, so better use a lower ISO, I will use ISO 100. A wider aperture allows more light to enter the lens, so I will use the aperture to its widest setting, f/1.8.

I will use a flash at this setting to compensate for the lower illumination. I will also use the RAW format for more flexibility in post-processing, especially for adjusting exposure and reducing noise. The user didn't specify the lighting, so I'll add a standard incandescent bulb, which produces a warm, orange light, and I'll use Tungsten to correct the color temperature.

I will also add these keywords to make the photo more professional: award-winning, professional, highly detailed.

Bot: Your prompt is ready:

```Breathtaking photo of a black old computer in a corner of a cozy room at night, its bright monitor showing a DOS terminal, low light, incandescent lighting, sharp focus, ISO 100, f/1.8, RAW, tungsten, award-winning, professional, highly detailed```

SYSTEM PROMPT ENDS HERE

After adding the system prompt to your chat model, simply chat with it, give it the prompt of the image you want to generate, and the model will convert it into a 'better prompt' that works better with this LORA. This is because I tagged the images for the LORA using llava 1.5 13b with that prompt as a system prompt for llava.


Remember that the System Prompt is optional, I generated some good images using short prompts.

I will upload two versions: 8 epochs and 10 epochs.

Voir la traduction

Notes & Commentaires

4.8 /5
0 Notes

Pas encore reçu suffisamment d'évaluations ou de commentaires

B
Chatter avec le modèle
Annonce
2024-01-06
Publier un modèle
2024-01-06
Mettre à jour les informations du modèle
Détails du modèle
Type
LORA
Temps de Publication
2024-01-06
Modèle Basique
SDXL 1.0
Introduction de version

This was trained on a new dataset tagged with Google's Gemini API:

"View the images and provide all the details about it, including information about the subjects (mention the names of each characters if there are recognizable characters or logos), style (such as anime, manga, realistic, 3D render, pixel art, photo, drawing, etc.), details about a person or character (clothing, hair color, eyes, etc.), character expression, scenery, camera position and character view (front view, profile, etc.), concept, the scenario depicted in the image, and colors of various elements (hair, eyes, clothing, car, house, etc.). Describe the image visually in extreme details. Name all characters in the image."

I think this is the most stable version of this LORA. It was trained on 1000 images because that is the limit of CivitAI, I have a dataset of 8000 images tagged with Gemini.

I'm sharing a mini script created with GPT-4 for usign the Gemini API for tagging images.

https://mega.nz/file/Jx8U2JJJ#OV35ygry0GiJL4R7cuWxeNcmTk2IiFyrqi4kl1p6USo

How to use:

  1. In the same place that the bard_tagger.py create a folder and name it "Dataset".

  2. Inside of "Dataset" create another folder with the name "????". All your rejected images will be moved here.

  3. Move the images you want to images to the tag to "Dataset"

  4. Open bard_tagger.py and replace "YOU_API_KEY_HERE" with your API.

Gemini is really good, it recognizes images very well and also recognizes many characters by name, for example, it recognizes Kurumi from Date a Live and Shinobu Oshino from Monogatari.

Périmètre de la licence
Source: civitai

1. Modèle partagé uniquement à l'apprentissage et au partage. Droits d'auteur et interprétation finale réservés à l'auteur original.

2. Auteur souhaitant revendiquer le modèle : Contactez officiellement SeaArt AI pour l'authentification. Nous protégeons les droits de chaque auteur. Cliquer pour revendiquer

Périmètre de la licence de création
Génération d'images en ligne
Effectuer une fusion
Autoriser le téléchargement
Périmètre de la licence de commerce
Les images générées peuvent être vendues ou utilisées à des fins commerciales
La revente ou la vente après fusion du modèle est autorisée.
QR Code
Télécharger l'App SeaArt
Poursuivez votre voyage de création AI sur mobile