Self-Supervised Effective Resolution Estimation With Adversarial Augmentations
M. Kansy, J. Balletshofer, J. Naruniec, C. Schroers, G. Mignone, M. Gross, R. M. Weber
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops (Waikoloa, USA, January 3-7, 2023), pp. 573-582
Abstract
High-resolution, high-quality images of human faces are desired as training data and output for many modern applications, such as avatar generation, face super-resolution, and face swapping. The terms high-resolution and high-quality are often used interchangeably; however, the two concepts are not equivalent, and high-resolution does not always imply high-quality. To address this, we motivate and precisely define the concept of effective resolution in this paper. We thereby draw connections to signal and information theory and show why baselines based on frequency analysis or compression fail. Instead, we propose a novel self-supervised learning scheme to train a neural network for effective resolution estimation without human-labeled data. It leverages adversarial augmentations to bridge the domain gap between synthetic and real, authentic degradations -- thus allowing us to train on domains, such as human faces, for which no or only few human labels exist. Finally, we demonstrate that our method outperforms state-of-the-art image quality assessment methods in estimating the sharpness of real and generated human faces, despite using only unlabeled data during training.