Post-Transfer Learning Statistical Inference in High-Dimensional Regression
Tam, Nguyen Vu Khai, My, Cao Huyen, Duy, Vo Nguyen Le
Transfer learning (TL) for high-dimensional regression (HDR) is an important problem in machine learning, particularly when dealing with limited sample size in the target task. However, there currently lacks a method to quantify the statistical significance of the relationship between features and the response in TL-HDR settings. In this paper, we introduce a novel statistical inference framework for assessing the reliability of feature selection in TL-HDR, called PTL-SI (Post-TL Statistical Inference). The core contribution of PTL-SI is its ability to provide valid $p$-values to features selected in TL-HDR, thereby rigorously controlling the false positive rate (FPR) at desired significance level $\alpha$ (e.g., 0.05). Furthermore, we enhance statistical power by incorporating a strategic divide-and-conquer approach into our framework. We demonstrate the validity and effectiveness of the proposed PTL-SI through extensive experiments on both synthetic and real-world high-dimensional datasets, confirming its theoretical properties and utility in testing the reliability of feature selection in TL scenarios.
Apr-25-2025
- Country:
- Asia
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- Vietnam > Hồ Chí Minh City
- Hồ Chí Minh City (0.04)
- Japan > Honshū
- North America > United States
- California (0.04)
- Florida (0.04)
- Massachusetts (0.04)
- New Jersey (0.04)
- Ohio (0.04)
- Pennsylvania (0.04)
- Texas (0.04)
- Asia
- Genre:
- Research Report > Experimental Study (0.51)
- Industry:
- Health & Medicine (0.68)
- Technology: