Does the alignment file simply tell the model where to find the face, then the model analyzes the pixels for the swap... or are the alignments playing a larger role in the swap process? Ie., Does the model look to the alignments for the face generation or only to the underlying video... after the alignments tell it where to find faces?
I ask because there are clearly nuances in facial expressions that are not part of the alignments, leading me to believe that the alignments only serve as a direction as to where to find a face. Or is it a combination?