Enhancing Transfer Learning with Flexible Nonparametric Posterior Sampling

Lee, Hyungi, Nam, Giung, Fong, Edwin, Lee, Juho

Mar-11-2024–arXiv.org Artificial Intelligence

Transfer learning has recently shown significant performance across various tasks involving deep neural networks. In these transfer learning scenarios, the prior distribution for downstream data becomes crucial in Bayesian model averaging (BMA). While previous works proposed the prior over the neural network parameters centered around the pre-trained solution, such strategies have limitations when dealing with distribution shifts between upstream and downstream data. This paper introduces nonparametric transfer learning (NPTL), a flexible posterior sampling method to address the distribution shift issue within the context of nonparametric learning. The nonparametric learning (NPL) method is a recent approach that employs a nonparametric prior for posterior sampling, efficiently accounting for model misspecification scenarios, which is suitable for transfer learning scenarios that may involve the distribution shift between upstream and downstream tasks. Through extensive empirical validations, we demonstrate that our approach surpasses other baselines in BMA performance. In Bayesian deep learning, we regard the parameters of a deep neural network as random variables. Instead of optimizing for a single-point estimate of these parameters, this approach involves inferring the posterior distribution of these parameters given the provided training data and predefined parameter prior distribution. After we have the posterior distribution, we make predictions through Bayesian model averaging (BMA). BMA entails computing predictions from multiple parameter values and weighting them based on their respective densities within the posterior. The success of Bayesian deep learning often depends on the choice of the prior distribution.

artificial intelligence, machine learning, posterior, (17 more...)

arXiv.org Artificial Intelligence

Mar-11-2024

arXiv.org PDF

Add feedback

Country:
- Europe (0.28)
- North America > United States
  - Colorado (0.14)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (1.00)
    - Neural Networks > Deep Learning (1.00)
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found