Provable Target Sample Complexity Improvements as Pre-Trained Models Scale

Fukuchi, Kazuto, Hataya, Ryuichiro, Matsui, Kota

Feb-5-2026–arXiv.org Machine Learning

Pre-trained models have become indispensable for efficiently building models across a broad spectrum of downstream tasks. The advantages of pre-trained models have been highlighted by empirical studies on scaling laws, which demonstrate that larger pre-trained models can significantly reduce the sample complexity of downstream learning. However, existing theoretical investigations of pre-trained models lack the capability to explain this phenomenon. In this paper, we provide a theoretical investigation by introducing a novel framework, caulking, inspired by parameter-efficient fine-tuning (PEFT) methods such as adapter-based fine-tuning, low-rank adaptation, and partial fine-tuning. Our analysis establishes that improved pre-trained models provably decrease the sample complexity of downstream tasks, thereby offering theoretical justification for the empirically observed scaling laws relating pre-trained model size to downstream performance, a relationship not covered by existing results.

machine learning, natural language, pre-trained model, (18 more...)

arXiv.org Machine Learning

Feb-5-2026

arXiv.org PDF

Add feedback

Country:
- Europe
  - Switzerland (0.04)
  - Romania > Sud - Muntenia Development Region
    - Giurgiu County > Giurgiu (0.04)
  - France > Auvergne-Rhône-Alpes
    - Isère > Grenoble (0.04)
- Asia > Japan
  - Honshū
    - Kantō
      - Tokyo Metropolis Prefecture > Tokyo (0.14)
      - Ibaraki Prefecture > Tsukuba (0.04)
    - Kansai > Kyoto Prefecture
      - Kyoto (0.04)

Genre:
- Research Report > New Finding (0.67)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (0.93)
  - Artificial Intelligence
    - Natural Language (1.00)
    - Vision (0.94)
    - Machine Learning
      - Statistical Learning (1.00)
      - Neural Networks > Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found