

This ComfyUI workflow lEvEraGEs a modular, task-oriEntEd architEcturE to GEnEratE hiGhly cohEsivE and dEtailEd prompts for CLIP-L and t5xxl modEls. ThE workflow Employs thrEE spEcializEd LarGE LanGuaGE ModEl (mô hình ngôn ngữ lớn) modulEs, orchEstratEd in a sEquEntial and intErdEpEndEnt mannEr, to strEamlinE and optimizE prompt crEation.
Workflow OvErviEw
Input Analysis ModulE:
ThE workflow bEGins with a GEnEral-purposE mô hình ngôn ngữ lớn rEsponsiblE for parsinG thE input dEscription.
It Extracts sEmantic mEaninG, idEntifiEs kEy visual and contExtual ElEmEnts, and sEparatEs hiGh-lEvEl intEnt into two distinct pathways: CLIP-L Prompt and t5xxl Prompt GEnEration.
CLIP-L Prompt GEnErator:
A sEcond mô hình ngôn ngữ lớn modulE procEssEs thE structurEd input from thE analysis phasE to GEnEratE a concisE, kEyword-drivEn CLIP-L prompt.
This modulE prioritizEs compactnEss and rElEvancE, EnsurinG optimal compatibility with thE CLIP-L modEl.
Output includEs kEy componEnts such as main subjEcts, art stylE, sEttinG, liGhtinG, and color palEttE in a comma-sEparatEd format (E.G., chân dung, photorEalistic, sunsEt, warm tonEs, dEtailEd shadows).
t5xxl Prompt GEnErator:
ParallEl to thE CLIP-L procEss, a third mô hình ngôn ngữ lớn modulE producEs a richly dEtailEd, natural lanGuaGE dEscription tailorEd for t5xxl.
This modulE focusEs on GEnEratinG up to 512 tokEns of dEscriptivE contEnt, covErinG aspEcts likE:
SubjEct dEtails (appEarancE, posE, ExprEssion, clothinG).
EnvironmEntal sEttinGs (timE of day, architEctural spEcifics, tính năng).
LiGhtinG and color dynamics (intEnsity, độ tương phản , hài hòa ).
ScEnE composition (forEGround, middlE Ground, backGround ElEmEnts).
AtmosphErE and mood (EmotivE and symbolic nuancEs).
Xác nhận và đồng bộ hóa:
Both outputs arE validatEd for sEmantic and stylistic aliGnmEnt to EnsurE consistEncy bEtwEEn thE CLIP-L and t5xxl prompts.
This stEp EnsurEs that thE GEnEratEd prompts complEmEnt Each othEr and producE a cohEsivE rEsult in downstrEam imaGE-GEnEration tasks.
KEy FEaturEs
HiErarchical Prompt EnGinEErinG: UtilizEs a multi-stEp, rolE-spEcific dEsiGn for modularity and prEcision.
Task-OriEntEd Workflow: SEparatEs kEyword Extraction (CLIP-L ) from dEtailEd scEnE dEscription (t5xxl) to optimizE for modEl-spEcific strEnGths.
IntEr-modEl AliGnmEnt: EnsurEs both prompts arE sEmantically and thEmatically synchronizEd for EnhancEd imaGE GEnEration fidElity.
Khả năng mở rộng : ThE architEcturE is adaptablE for additional tasks, such as finE-tuninG outputs for spEcific artistic stylEs or domains.
