I think the unnecessary yapping in wan2.2 is caused by latent noise going from high to low. The mouth never settles correctly in each frame.

Is there a way to smoothen latent noise in a single region via a mask?