Computer Graphics Laboratory ETH Zurich

ETH

Self-Supervised Effective Resolution Estimation With Adversarial Augmentations

M. Kansy, J. Balletshofer, J. Naruniec, C. Schroers, G. Mignone, M. Gross, R. M. Weber

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops (Waikoloa, USA, January 3-7, 2023), pp. 573-582

Abstract

High-resolution, high-quality images of human faces are desired as training data and output for many modern applications, such as avatar generation, face super-resolution, and face swapping. The terms high-resolution and high-quality are often used interchangeably; however, the two concepts are not equivalent, and high-resolution does not always imply high-quality. To address this, we motivate and precisely define the concept of effective resolution in this paper. We thereby draw connections to signal and information theory and show why baselines based on frequency analysis or compression fail. Instead, we propose a novel self-supervised learning scheme to train a neural network for effective resolution estimation without human-labeled data. It leverages adversarial augmentations to bridge the domain gap between synthetic and real, authentic degradations -- thus allowing us to train on domains, such as human faces, for which no or only few human labels exist. Finally, we demonstrate that our method outperforms state-of-the-art image quality assessment methods in estimating the sharpness of real and generated human faces, despite using only unlabeled data during training.




Downloads

Download Paper
[PDF]
Download Paper
[PDF suppl.]
Download Paper
[BibTeX]