Original

WAN2.1 Fantasytalking-Audio Driver-KJ

Última actualización:2025-11-19

wan_fantasytalking: an audio‑driven video generation model for lip‑synced digital humans. Given a single portrait image plus an audio clip, it produces a high‑fidelity talking video with strict lip synchronization and natural head motion and facial expressions, emphasizing identity consistency and temporal coherence.


Input/Output: single portrait + audio → talking video; focuses on three aspects: lip‑sync accuracy, identity preservation, and natural motion/expressions.


Lip‑sync and temporal modeling: uses audio features (e.g., speech, phonemes, visemes) to drive the mouth and facial regions, jointly coupling head motion and expressions to avoid the “lips‑only” uncanny effect.

Traducción con un clic
Vista previa de nodos 23 nodes
Pantalla completa
Haz clic para cargar la vista previa del nodo
Ejecutar (116)
Favorito (6)
Descarga (1)
Compartir
Detalles del flujo de trabajo
Tipo
Flujo de trabajo
Calificación
5
Fecha de lanzamiento
2025-10-14
Estado
Ejecutable
Información del nodo (23)
Comentario
0/400
0 comentario(s)