Deep Models for Handheld Light Field Acquisition

Applicant Dr. Paramanand Chandramouli

Subject Area Image and Language Processing, Computer Graphics and Visualisation, Human Computer Interaction, Ubiquitous and Wearable Computing

Term from 2020 to 2024

Project identifier Deutsche Forschungsgemeinschaft (DFG) - Project number 437172262

Final Report Year 2024

Final Report Abstract

Light ﬁelds capture 3D scenes from multiple view points and have applications in precise free viewpoint rendering of a scene or the estimation of geometries or materials of objects. The aim of this project was to develop generalizable deep models for light ﬁeld representation and algorithms for light ﬁeld reconstruction, in contrast to existing approaches that consider ﬁxed acquisition setups. In the course of this research project, important scientiﬁc results were achieved in developing novel generative models for light ﬁelds, implicit representation for light ﬁelds, text guided generalizable image reconstruction and manipulation and robustness of image reconstruction methods, with an overarching theme of developing ﬂexible, robust models for light ﬁeld and image recovery. We developed the ﬁrst generative model for light ﬁelds, a generative autoencoder conditioned on the central view. We utilized this model as a prior for light ﬁeld recovery for diverse tasks including light ﬁeld view synthesis, spatial-angular super resolution, and recovery from coded projections, with advantages in terms of ﬂexibility and robustness in comparison with end-to-end trained networks. We proposed a neural implicit representation for 4D light ﬁelds conditioned on a sparse set of input views to produce the light ﬁeld values for a continuous range of query spatio-angular coordinates. This scheme could super-resolve sparse set of input views to any desired spatial and angular resolution, and can additionally handle corrupt input views with missing pixels. Further, we demonstrated the use of text conditioned image diﬀusion models for image restoration and manipulation. We devised a fast, zero-shot method for text-guided image manipulation that maintains content consistency without further optimization or ﬁne-tuning. We proposed text-guided ﬂexible image super-resolution to generate semantically accurate reconstructions that maintain data consistency with low-resolution inputs. Our approach produced diverse solutions that are semantically aligned with the input text, while maintaining consistency with the degraded images for ﬂexible upsampling factors. We investigated the robustness of deep networks for image recovery considering image deblurring and computed tomography. We investigated the robustness to diﬀerent adversarial attacks, studied the transferability of attacks across methods, and further studied the eﬀect of architectural components on adversarial robustness. We further showed that localized attacks can be used in a beneﬁcial manner to explore solutions to ill-posed reconstruction problems.

Publications

Light Field Implicit Representation for Flexible Resolution Reconstruction
Paramanand Chandramouli, Hendrik Sommerhoﬀ & Andreas Kolb
A Generative Model for Generic Light Field Reconstruction. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(4), 1712-1724.
Chandramouli, Paramanand; Gandikota, Kanchana Vaishnavi; Goerlitz, Andreas; Kolb, Andreas & Moeller, Michael
LDEdit: Towards generalized text guided image manipulation via latent diﬀusion models. British Machine Vision Conference (BMVC), 2022
Paramanand Chandramouli & Kanchana Vaishnavi Gandikota
On Adversarial Robustness of Deep Image Deblurring. 2022 IEEE International Conference on Image Processing (ICIP), 3161-3165. IEEE.
Gandikota, Kanchana Vaishnavi; Chandramouli, Paramanand & Moeller, Michael
Evaluating Adversarial Robustness of Low dose CT Recovery. Medical Imaging with Deep Learning (MIDL)
Kanchana Vaishnavi Gandikota, Paramanand Chandramouli, Hannah Droege & Michael Moeller
Exploring Open Domain Image Super-Resolution through Text. ICML Workshop on Artiﬁcial Intelligence & Human-Computer Interaction, 2023
Kanchana Vaishnavi Gandikota & Paramanand Chandramouli
On the unreasonable vulnerability of transformers for image restoration – and an easy fix. 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 3709-3719. IEEE.
Agnihotri, Shashank; Gandikota, Kanchana Vaishnavi; Grabinski, Julia; Chandramouli, Paramanand & Keuper, Margret
Text-Guided Explorable Image Super-Resolution. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 25900-25911. IEEE.
Gandikota, Kanchana Vaishnavi & Chandramouli, Paramanand

Servicenavigation

Hauptnavigation

Deep Models for Handheld Light Field Acquisition

Final Report Abstract

Publications

Additional Information

Servicenavigation

Hauptnavigation

Deep Models for Handheld Light Field Acquisition

Final Report Abstract

Publications

Additional Information

Textvergrößerung und Kontrastanpassung