Transfer Learning with Kernel Methods
Radhakrishnan, Adityanarayanan, Luyten, Max Ruiz, Prasad, Neha, Uhler, Caroline
–arXiv.org Artificial Intelligence
Transfer learning refers to the machine learning problem of utilizing knowledge from a source task to improve performance on a target task. Recent approaches to transfer learning have achieved tremendous empirical success in many applications including in computer vision [17, 45], natural language processing [16, 40, 43], and the biomedical field [15, 19]. Since transfer learning approaches generally rely on complex deep neural networks, it can be difficult to characterize when and why they work [44]. Kernel methods [46] are conceptually and computationally simple machine learning models that have been found to be competitive with neural networks on a variety of tasks including image classification [3, 29, 42] and drug screening [42]. Their simplicity stems from the fact that training a kernel method involves performing linear regression after transforming the data. There has been renewed interest in kernels due to a recently established equivalence between wide neural networks and kernel methods [2, 25], which has led to the development of modern, neural tangent kernels (NTKs) that are competitive with neural networks. Given their simplicity and effectiveness, kernel methods could provide a powerful approach for transfer learning and also help characterize when transfer learning between a source and target task would be beneficial. However, developing an algorithm for transfer learning with kernel methods for general source and target tasks has been an open problem. In particular, while there is a standard transfer learning approach for neural networks that involves replacing and re-training the last layer of a pre-trained network, there is no known corresponding operation for kernels.
arXiv.org Artificial Intelligence
Oct-31-2022
- Country:
- North America
- Canada > Ontario
- Toronto (0.14)
- United States > Massachusetts (0.04)
- Canada > Ontario
- North America
- Genre:
- Research Report (0.82)
- Industry:
- Technology:
- Information Technology > Artificial Intelligence > Machine Learning
- Kernel Methods (1.00)
- Neural Networks > Deep Learning (0.66)
- Transfer Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning