原創

萬無限對話- 音頻驅動程式-KJ

最后更新:2025-11-19

- Core concept: An audio-driven lip-sync model using a sparse-frame/keyframe strategy for video dubbing; it preserves identity consistency over long durations and naturally couples head motion, facial expressions, and body pose to the audio. It supports an “image + audio → talking video” mode (starting from a single image) with no upper limit on video length.


- Input/Output: Inputs speech audio (optionally with text/phoneme alignments) and a reference portrait (video or a single image); outputs a talking-face video that closely matches the audio while preserving natural head/expression dynamics and the subject’s identity beyond just the lips.

一键翻译
節點预览 24 nodes
全屏
點擊加载節點预览
運行 (639)
收藏 (17)
下载 (12)
分享
工作流详情
類型
工作流
评分
4.9
发布時間
2025-10-13
狀态
可運行
節點信息 (24)
评論
0/400
共 0 条评論