>>19941084
It's several 5 second generations where the final frame + original image are fed into the next training model which has a different prompt and different loras applied. This is done 5 times for a total of 600 frames. Output is a choice of 25s 24fps, 20s 30fps, 16.6s 36fps. Middle option usually looks most natural
random previous video