Insert image and video,The image will generate a video based on the video action,Text input:For example, the image is of James,The video is a video of a woman dancing,The text will be written as basketball star James dancing on the court(Image reference video action comes to life)
1 is depth map control mode
2 is Openpose pose control mode
The higher the resolution, the higher the clarity of the characters,, but it will affect the consistency of the reference image,choose your own,After all, it would be nice to have a 14B model like 1.3B model,Trial version
