Academic Journal

An Information-Theoretic Method to Automatic Shortcut Avoidance and Domain Generalization for Dense Prediction Tasks

التفاصيل البيبلوغرافية
العنوان: An Information-Theoretic Method to Automatic Shortcut Avoidance and Domain Generalization for Dense Prediction Tasks
المؤلفون: Wei Qin Chuah, Ruwan Tennakoon, Reza Hoseinnezhad, David Suter, Alireza Bab-Hadiashar
سنة النشر: 2023
مصطلحات موضوعية: Machine learning not elsewhere classified, Dense Prediction Tasks, Domain Generalization, Estimation, Optical Flow, Optical imaging, Robustness, Semantic Segmentation, Shortcut Learning, Stereo Matching, Synthetic data, Task analysis, Training
الوصف: Deep convolutional neural networks for dense prediction tasks are commonly optimized using synthetic data, as generating pixel-wise annotations for real-world data is laborious. However, the synthetically trained models do not generalize well to real-world environments. This poor “synthetic to real” (S2R) generalization we address through the lens of shortcut learning. We demonstrate that the learning of feature representations in deep convolutional networks is heavily influenced by synthetic data artifacts (shortcut attributes). To mitigate this issue, we propose an Information-Theoretic Shortcut Avoidance (ITSA) approach to automatically restrict shortcut-related information from being encoded into the feature representations. Specifically, our proposed method minimizes the sensitivity of latent features to input variations: to regularize the learning of robust and shortcut-invariant features in synthetically trained models. To avoid the prohibitive computational cost of direct input sensitivity optimization, we propose a practical yet feasible algorithm to achieve robustness. Our results show that the proposed method can effectively improve S2R generalization in multiple distinct dense prediction tasks, such as stereo matching, optical flow, and semantic segmentation. Importantly, the proposed method enhances the robustness of the synthetically trained networks and outperforms their fine-tuned counterparts (on real data) for challenging out-of-domain applications.
نوع الوثيقة: article in journal/newspaper
اللغة: unknown
Relation: 10779/rmit.27568650.v1
الاتاحة: https://figshare.com/articles/journal_contribution/An_Information-Theoretic_Method_to_Automatic_Shortcut_Avoidance_and_Domain_Generalization_for_Dense_Prediction_Tasks/27568650
Rights: All rights reserved
رقم الانضمام: edsbas.6BA5BC47
قاعدة البيانات: BASE