Introduction
The goal of our research is to study and develop algorithms for image and video processing, editing, analysis and synthesis. Our focus lies on developing highly efficient algorithms which can be applied to real-world high resolution image and video data.
Topics
Video Segmentation
Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video. A pixel-accurate, spatio-temporal bipartition of the video is instrumental to several applications including, among others, action recognition, object tracking, video summarization, and rotoscoping for video editing. Despite remarkable progress in recent years, video object segmentation still remains a challenging problem and most existing approaches still exhibit too severe limitations in terms of quality and efficiency to be applicable in practical applications, e.g. for processing large datasets, or video post-production and editing in the visual effects industry.
Phase-based Methods for Video
Phase-based methods rely on the assumption that small motions can be encoded in the phase shift of an individual pixel. Due to per-pixel phase modifications this concept allows the design of highly efficient algorithms. Currently, however, the spatial displacement which can be encoded in the phase information with these methods is highly limited. Our research focuses on overcoming these limitations and designing algorithms for various applications such as frame interpolation and view synthesis.
Video-based Rendering
Video-based rendering aims to generate virtual views of a real world scene that was recorded by one or more video cameras. The goal is to achieve as realistic as possible images based on only the camera input, e.g. from standard TV cameras. We developed novel representation and rendering methods that result in images visually not distinguishable from original camera images. Together with our collaborator, LiberoVision AG, we show a successful application for this: virtual replays of sports events.
Light Field Processing
Since its introduction to the computer graphics community the light field has been widely used as an alternative way to represent visual aspects of 3D objects and scenes. Although it has already proven its potential in many areas such as visual effects in movie industries, still many of such tasks are done manually using conventional tools that are not aware of light fields. We pursue developing algorithms and computational tools to automate its processing and enable new paradigms of image and video manipulation. We also aim to extend its use to a broader range of graphics and vision problems. Our research consists of light field acquisition, spatio-temporal manipulation, geometry reconstruction, and 3D rendering, spanning the entire life cycle of light fields.
Stereoscopy
Stereoscopic 3D has gained significant importance in the entertainment industry. However, production of high quality stereoscopic content is still a challenging art that requires mastering the complex interplay of human perception, 3D display properties, and artistic intent. Our research ranges from development of the stereoscopic camera system to computational methods to deal with the difficulty that arises in the course of stereoscopic content creation.
Publications
2024
T. N. Schnabel, Y. Lill, B. K. Benitez, P. Nalabothu, P. Metzler, A. A. Mueller, M. Gross, B. Gözcü, B. Solenthaler
Large-Scale 3D Infant Face Model
Medical Image Computing and Computer Assisted Intervention - MICCAI 2024 (Marrakesh, Morocco, October 06-10, 2024), pp. 217-227
Available files:
[
PDF] [
Video] [
BibTeX]
[
Abstract]
2023
Z. Chen, L. Relic, R. Azevedo, Y. Zhang, M. Gross, D. Xu, L. Zhou, C. Schroers
Neural Video Compression with Spatio-Temporal Cross-Covariance Transformers
MM '23: Proceedings of the 31st ACM International Conference on Multimedia (Ottawa, Canada, October 29-November 3, 2023), pp. 8543-8551
Available files:
[
PDF][
PDF suppl.] [
BibTeX]
[
Abstract]
M. Kansy, A. Raël, G. Mignone, J. Naruniec, C. Schroers, M. Gross, R. M. Weber
Controllable Inversion of Black-Box Face Recognition Models via Diffusion
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops (Paris, France, October 02-06, 2023), pp. 3159-3169
Available files:
[
PDF][
PDF suppl.] [
BibTeX]
[
Abstract]
[
YouTube]
K. M. Briedis, A. Djelouah, R. Ortiz, M. Meyer, M. Gross, C. Schroers
Kernel-based Frame Interpolation for Spatio-temporally Adaptive Rendering
SIGGRAPH '23: ACM SIGGRAPH 2023 Conference Proceedings (Los Angeles,CA,USA, August 6-10, 2023), pp. 59:1-59:11
Available files:
[
PDF][
PDF suppl.] [
BibTeX]
[
Abstract]
[
YouTube]
M. Bernasconi, A. Djelouah, F. Salehi, M. Gross, C. Schroers
Kernel Aware Resampler
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (Vancouver, Canada, June 18-22, 2023), pp. 22347-22355
Available files:
[
PDF][
PDF suppl.] [
BibTeX]
[
Abstract]
[
YouTube]
M. Kansy, J. Balletshofer, J. Naruniec, C. Schroers, G. Mignone, M. Gross, R. M. Weber
Self-Supervised Effective Resolution Estimation With Adversarial Augmentations
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops (Waikoloa, USA, January 3-7, 2023), pp. 573-582
Available files:
[
PDF][
PDF suppl.] [
BibTeX]
[
Abstract]
[
YouTube]
2021
K. M. Briedis, A. Djelouah, M. Meyer, I. McGonigal, M. Gross, C. Schroers
Neural Frame Interpolation for Rendered Content
Proceedings of ACM SIGGRAPH Asia (Tokyo, Japan, Dec. 14-17, 2021), ACM Transactions on Graphics, vol. 40, no. 6, pp. 239:1-239:13
Available files:
[
PDF][
PDF suppl.] [
BibTeX]
[
Abstract]
[
YouTube]
2020
J. Naruniec, L. Helminger, C. Schroers, R. M. Weber
High-Resolution Neural Face Swapping for Visual Effects
Proceedings of Eurographics Symposium on Rendering (EGSR) (London, UK, June 29 -- July 3, 2020), Computer Graphics Forum, vol. 39, no. 4, 2020, pp. 173-184
Available files:
[
PDF] [
BibTeX]
[
Abstract]
[
YouTube]
R. Roveri, A. C. Öztireli, I. Pandele, M. Gross
PointProNets: Consolidation of Point Clouds with Convolutional Neural Networks
Proceedings of Eurographics (Delft, The Netherlands, April 16-20, 2018), Computer Graphics Forum, vol. 37, no. 2, pp. 87-99
Available files:
[
PDF] [
Video] [
BibTeX]
[
Abstract]
B. Kim, O. Wang, A. C. Öztireli, M. Gross
Semantic Segmentation for Line Drawing Vectorization Using Neural Networks
Proceedings of Eurographics (Delft, Netherlands, April 16-20, 2018), Computer Graphics Forum, vol. 37, no. 2, pp. 329-338
Available files:
[
PDF] [
BibTeX]
[
Abstract]
R. Danecek*, E. Dibra*, A. C. Öztireli, R. Ziegler, M. Gross
DeepGarment : 3D Garment Shape Estimation from a Single Image
Proceedings of Eurographics (Lyon, France, April 24-28, 2017), Computer Graphics Forum, vol. 36, no. 2, pp. 269-280
Available files:
[
PDF][
PDF suppl.] [
BibTeX]
[
Abstract]
E. Dibra, H. Jain, C. Öztireli, R. Ziegler, M. Gross
HS-Nets: Estimating Human Body Shape from Silhouettes with Convolutional Neural Networks
Proceedings of the Fourth International Conference on 3D Vision, 3DV (Stanford, CA, USA, October 25-28, 2016), pp. 108-117
Available files:
[
PDF][
PDF suppl.] [
BibTeX]
[
Abstract]
J.C. Bazin, C. Plüss (Kuster), G. Yu, T. Martin, A. Jacobson, M. Gross
Physically Based Video Editing
Proceedings of the Pacific Conference on Computer Graphics and Applications (Okinawa, Japan, October 11-14, 2016), Computer Graphics Forum, vol. 35, no. 7, 2016, pp. 421-429
Available files:
[
PDF] [
Video] [
BibTeX]
[
Abstract]
A. Chapiro, T. Aydin, N. Stefanoski, S. Croci, A. Smolic, M. Gross
Art-Directable Continuous Dynamic Range Video
Computers and Graphics, Elsevier, vol. 53, no. , 2015, pp. 54-62
Available files:
[
PDF] [
Video] [
BibTeX]
[
Abstract]
A. Chapiro, C. O'Sullivan, W. Jarosz, M. Gross, A. Smolic
Stereo from Shading
Proceedings of Eurographics Symposium on Rendering (EGSR) (Darmstadt,Germany, Jun 24-26, 2015), pp. --
Available files:
[
PDF] [
Video] [
BibTeX]
[
Abstract]
F. Perazzi, O. Sorkine-Hornung, A. Sorkine-Hornung
Efficient Salient Foreground Detection for Images and Video using Fiedler Vectors
Proceedings of Eurographics (Zurich,Switzerland,, May 4-8, 2015), Computer Graphics Forum, vol. 34, no. 2, pp. 21-29
Available files:
[
PDF] [
BibTeX]
[
Abstract]
F. Perazzi, A. Sorkine-Hornung, H. Zimmer, P. Kaufmann, O. Wang, S. Watson, M. Gross
Panoramic Video from Unstructured Camera Arrays
Proceedings of Eurographics (Zurich,Switzerland,, May 4-8, 2015), Computer Graphics Forum, vol. 34, no. 2, pp. 57-68
Available files:
[
PDF] [
Video] [
BibTeX]
[
Abstract]
2014
T. Aydin, N. Stefanoski, S. Croci, M. Gross, A. Smolic
Temporally Coherent Local Tone Mapping of HDR Video
Proceedings of ACM SIGGRAPH Asia (Shenzhen, December 3 - December 6, 2014), ACM Transactions on Graphics, vol. 33, no. 6, pp. 196:1--196:13
Available files:
[
PDF][
PDF suppl.] [
Video] [
BibTeX]
[
Abstract]
F. Angehrn, O. Wang, Y. Aksoy, M. Gross, A. Smolic
MasterCam FVV: Robust registration of multiview sports video to a static high-resolution master camera for free viewpoint video
Image Processing (ICIP), 2014 IEEE International Conference on, , vol. , no. , 2014, pp. 3474-3478
Available files:
[
PDF] [
BibTeX]
[
Abstract]
A. Chapiro, S. Heinzle, T. Aydin, S. Poulakos, M. Zwicker, A. Smolic, M. Gross
Optimizing Stereo-to-Multiview Conversion for Autostereoscopic Displays
Proceedings of Eurographics (Strasbourg,France, Apr 7-11, 2014), Computer Graphics Forum, vol. 33, no. 2, pp. 63-72
Available files:
[
PDF] [
Video] [
BibTeX]
[
Abstract]
D. Saner, O. Wang, S. Heinzle, Y. Pritch, A. Smolic, A. Sorkine-Hornung, M. Gross
High-Speed Object Tracking Using an Asynchronous Temporal Contrast Sensor
Vision, Modeling and Visualization (Darmstadt, Germany, October 8-10, 2014), pp. 87-94
Available files:
[
PDF] [
BibTeX]
[
Abstract]
J. Rueegg, O. Wang, A. Smolic, M. Gross
DuctTake: Spatiotemporal Video Compositing
Proceedings of Eurographics (Girona, Spain, May 6-10, 2013), Computer Graphics Forum, vol. 32, no. , pp. 51-61
Available files:
[
PDF] [
BibTeX]
[
Abstract]
C. Kim, H. Zimmer, Y. Pritch, A. Sorkine-Hornung, M. Gross
Scene Reconstruction from High Spatio-Angular Resolution Light Fields
Proceedings of ACM SIGGRAPH (Anaheim, USA, July 21-25, 2013), ACM Transactions on Graphics, vol. 32, no. 4, pp. 73:1-73:12
Available files:
[
PDF][
PDF suppl.] [
BibTeX]
[
Abstract]
2012
M. Lang, O. Wang, T. Aydin, A. Smolic, M. Gross
Practical Temporal Consistency for Image-based Graphics Applications
Proceedings of ACM SIGGRAPH (Los Angeles,USA, August, 2012), ACM Transactions on Graphics, vol. 31, no. 4, pp. 34:1--34:8
Available files:
[
PDF] [
Video] [
BibTeX]
[
Abstract]
T. Oskam, A. Hornung, R. Sumner, M. Gross
Fast and Stable Color Balancing for Images and Augmented Reality
3D Imaging, Modeling, Processing, Visualization and Transmission (3DIMPVT) (Zurich, Switzerland, October 13-15, 2012), pp. 49 - 56
Available files:
[
PDF] [
BibTeX]
[
Abstract]
H. Bowles, K. Mitchell, R. Sumner, J. Moore, M. Gross
Iterative Image Warping
Proceedings of Eurographics (Cagliari, Italy, May 13-18, 2012), Computer Graphics Forum, vol. 31, no. 2, pp. 237-246
Available files:
[
PDF] [
Video] [
Abstract]
M. Germann, T. Popa, R. Keiser, R. Ziegler, M. Gross
Novel-View Synthesis of Outdoor Sport Events Using an Adaptive View-Dependent Geometry
Proceedings of Eurographics (Cagliari, Italy, May 13-18, 2012), Computer Graphics Forum, vol. 31, no. 2, pp. 325-333
Available files:
[
PDF] [
BibTeX]
[
Abstract]
2011
A. Smolic, S. Poulakos, S. Heinzle, P. Greisen, M. Lang, A. Hornung, M. Farre, N. Stefanoski, O. Wang, L. Schnyder, R. Monroy, M. Gross
Disparity-Aware Stereo 3D Production Tools
Proceedings of Conference for Visual Media Production (CVMP) (London, UK, Nov 16-17, 2011), pp. 165-173
Available files:
[
PDF] [
BibTeX]
[
Abstract]
C. Kim, A. Hornung, S. Heinzle, W. Matusik, M. Gross
Multi-Perspective Stereoscopy from Light Fields
Proceedings of ACM SIGGRAPH Asia (Hong Kong, China, December 12-15, 2011), ACM Transactions on Graphics, vol. 30, no. 6, pp. 190:1-190:10
Available files:
[
PDF] [
Video] [
BibTeX]
[
Abstract]
T. Oskam, A. Hornung, H. Bowles, K. Mitchell, M. Gross
OSCAM - Optimized Stereoscopic Camera Control for Interactive 3D
Proceedings of ACM SIGGRAPH Asia (Hong Kong, China, December 12-15, 2011), ACM Transactions on Graphics, vol. 30, no. 6, pp. 189:1-189:8
Available files:
[
PDF] [
Video] [
BibTeX]
[
Abstract]
S. Heinzle, P. Greisen, D. Gallup, C. Chen, D. Saner, A. Smolic, A. Burg, W. Matusik, M. Gross
Computational Stereo Camera System with Programmable Control Loop
Proceedings of ACM SIGGRAPH (Vancouver, Canada, August 7-11, 2011), ACM Transactions on Graphics, vol. 30, no. 4, pp. 94:1-94:10
Available files:
[
PDF] [
Video] [
BibTeX]
[
Abstract]
M. Lang, A. Hornung, O. Wang, S. Poulakos, A. Smolic, M. Gross
Nonlinear Disparity Mapping for Stereoscopic 3D
Proceedings of ACM SIGGRAPH (Los Angeles, USA, July 25-29, 2010), ACM Transactions on Graphics, vol. 29, no. 3, pp. 75:1-75:10
Available files:
[
PDF] [
Video] [
BibTeX]
[
Abstract]
M. Germann, A. Hornung, R. Keiser, R. Ziegler, S. Würmlin, M. Gross
Articulated Billboards for Video-based Rendering
Proceedings of Eurographics (Norrköping, Sweden, May 3-7, 2010), Computer Graphics Forum, vol. 29, no. 2, pp. 585-594
Available files:
[
PDF] [
Video] [
BibTeX]
[
Abstract]