High-Resolution Neural Face Swapping for Visual Effects
J. Naruniec, L. Helminger, C. Schroers, R. M. Weber
Proceedings of Eurographics Symposium on Rendering (EGSR) (London, UK, June 29 -- July 3, 2020), Computer Graphics Forum, vol. 39, no. 4, 2020, pp. 173-184
Abstract
In this paper, we propose an algorithm for fully automatic neural face swapping in images and videos. To the best of our knowledge, this is the first method capable of rendering photo-realistic and temporally coherent results at megapixel resolution. To this end, we introduce a progressively trained multi-way comb network and a light- and contrast-preserving blending method. We also show that while progressive training enables generation of high-resolution images, extending the architecture and training data beyond two people allows us to achieve higher fidelity in generated expressions. When compositing the generated expression onto the target face, we show how to adapt the blending strategy to preserve contrast and low-frequency lighting. Finally, we incorporate a refinement strategy into the face landmark stabilization algorithm to achieve temporal stability, which is crucial for working with high-resolution videos. We conduct an extensive ablation study to show the influence of our design choices on the quality of the swap and compare our work with popular state-of-the-art methods