An information-Theoretic Approach to Semi-supervised Transfer Learning
Jakubovitz, Daniel, Uliel, David, Rodrigues, Miguel, Giryes, Raja
–arXiv.org Artificial Intelligence
Abstract--Transfer learning is a valuable tool in deep learning as it allows propagating information from one "source dataset" to another "target dataset", especially in the case of a small number of training examples in the latter. Yet, discrepancies between the underlying distributions of the source and target data are commonplace and are known to have a substantial impact on algorithm performance. In this work we suggest novel information-theoretic approaches for the analysis of the performance of deep neural networks in the context of transfer learning. We focus on the task of semi-supervised transfer learning, in which unlabeled samples from the target dataset are available during network training on the source dataset. Our theory suggests that one may improve the transferability of a deep neural network by incorporating regularization terms on the target data based on information-theoretic quantities, namely the Mutual Information and the Lautum Information. We demonstrate the effectiveness of the proposed approaches in various semi-supervised transfer learning experiments.
arXiv.org Artificial Intelligence
Jun-11-2023
- Country:
- North America
- United States
- Wisconsin > Dane County
- Madison (0.04)
- Washington > King County
- Seattle (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Wisconsin > Dane County
- Canada > Quebec
- Montreal (0.04)
- United States
- Europe
- United Kingdom > England
- Greater London > London (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- United Kingdom > England
- Asia
- China (0.04)
- Middle East > Israel
- Tel Aviv District > Tel Aviv (0.04)
- Jerusalem District > Jerusalem (0.04)
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- North America
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Education (0.92)
- Technology: