Karya asli

Wan2.1 Fantasytalking-Audio Driver-KJ

Pembaruan Terakhir:2025-11-19

wan_fantasytalking: an audio‑driven video generation model for lip‑synced digital humans. Given a single portrait image plus an audio clip, it produces a high‑fidelity talking video with strict lip synchronization and natural head motion and facial expressions, emphasizing identity consistency and temporal coherence.


Input/Output: single portrait + audio → talking video; focuses on three aspects: lip‑sync accuracy, identity preservation, and natural motion/expressions.


Lip‑sync and temporal modeling: uses audio features (e.g., speech, phonemes, visemes) to drive the mouth and facial regions, jointly coupling head motion and expressions to avoid the “lips‑only” uncanny effect.

Terjemahan Sekali Klik
Pratinjau Node 23 nodes
Layar penuh
Klik untuk Memuat Pratinjau Node
Jalankan (116)
Favorit (6)
Unduh (1)
Bagikan
Detail Alur Kerja
Jenis
Alur Kerja
Penilaian
5
Waktu publikasi
2025-10-14
Status
Dapat Dijalankan
Info Node (23)
Komentar
0/400
Total 0 komentar