Strong response consistency for more robust, disentangled and generalizable machine vision

Applicant Professor Dr. Wieland Brendel

Subject Area Image and Language Processing, Computer Graphics and Visualisation, Human Computer Interaction, Ubiquitous and Wearable Computing

Term since 2021

Project identifier Deutsche Forschungsgemeinschaft (DFG) - Project number 448601360

Project Description

Deep learning is the workhorse of today's machine intelligence that has produced numerous success stories all over science, industry, and society. However, this rapid and wide-spread success has masked four severe and fundamental issues that limit deep learning to comparatively few, narrowly defined application areas. For one, deep neural networks (DNNs) are not robust and even small input perturbations like JPEG compression or imperceptible but carefully engineered noise can derail their behavior. Second, the representations of DNNs are not disentangled and cannot be easily related to concepts understandable by humans. Third, the representations of DNNs do not generalize well to other tasks or domains without additional supervised training. Finally, we have little theoretical understanding of the representations learned by DNNs, making it difficult to solve the aforementioned issues.To make headway in these areas, I here propose a set of four consistency conditions that the visual representations of DNNs should fulfill. These conditions provide a strong theoretical grounding for unsupervised representation learning that opens up new avenues to study and enhance robustness, disentanglement, and generalization. By targeting these fundamental issues of deep learning, this work will reduce the gap between human and machine vision, increase reliability and explainability of DNNs, accelerate method development, and open up new application areas.

DFG Programme Independent Junior Research Groups

Major Instrumentation Computing Server with 8 GPUs

Instrumentation Group 7030 Dedizierte, dezentrale Rechenanlagen, Prozeßrechner

Servicenavigation

Hauptnavigation

Strong response consistency for more robust, disentangled and generalizable machine vision

Additional Information

Servicenavigation

Hauptnavigation

Strong response consistency for more robust, disentangled and generalizable machine vision

Additional Information

Textvergrößerung und Kontrastanpassung