Project Details
Projekt Print View

Strong response consistency for more robust, disentangled and generalizable machine vision

Subject Area Image and Language Processing, Computer Graphics and Visualisation, Human Computer Interaction, Ubiquitous and Wearable Computing
Term since 2021
Project identifier Deutsche Forschungsgemeinschaft (DFG) - Project number 448601360
 
Deep learning is the workhorse of today's machine intelligence that has produced numerous success stories all over science, industry, and society. However, this rapid and wide-spread success has masked four severe and fundamental issues that limit deep learning to comparatively few, narrowly defined application areas. For one, deep neural networks (DNNs) are not robust and even small input perturbations like JPEG compression or imperceptible but carefully engineered noise can derail their behavior. Second, the representations of DNNs are not disentangled and cannot be easily related to concepts understandable by humans. Third, the representations of DNNs do not generalize well to other tasks or domains without additional supervised training. Finally, we have little theoretical understanding of the representations learned by DNNs, making it difficult to solve the aforementioned issues.To make headway in these areas, I here propose a set of four consistency conditions that the visual representations of DNNs should fulfill. These conditions provide a strong theoretical grounding for unsupervised representation learning that opens up new avenues to study and enhance robustness, disentanglement, and generalization. By targeting these fundamental issues of deep learning, this work will reduce the gap between human and machine vision, increase reliability and explainability of DNNs, accelerate method development, and open up new application areas.
DFG Programme Independent Junior Research Groups
Major Instrumentation Computing Server with 8 GPUs
Instrumentation Group 7030 Dedizierte, dezentrale Rechenanlagen, Prozeßrechner
 
 

Additional Information

Textvergrößerung und Kontrastanpassung