Provable and Efficient Dataset Distillation for Kernel Ridge Regression

May-27-2025, 11:03:22 GMT–Neural Information Processing Systems

Deep learning models are now trained on increasingly larger datasets, making it crucial to reduce computational costs and improve data quality. Dataset distillation aims to distill a large dataset into a small synthesized dataset such that models trained on it can achieve similar performance to those trained on the original dataset. While there have been many empirical efforts to improve dataset distillation algorithms, a thorough theoretical analysis and provable, efficient algorithms are still lacking. In this paper, by focusing on dataset distillation for kernel ridge regression (KRR), we show that one data point per class is already necessary and sufficient to recover the original model's performance in many settings. For linear ridge regression and KRR with surjective feature mappings, we provide necessary and sufficient conditions for the distilled dataset to recover the original model's parameters.

dataset distillation algorithm, kernel ridge regression, provable and efficient dataset distillation, (4 more...)

Neural Information Processing Systems

May-27-2025, 11:03:22 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Performance Analysis > Accuracy (0.87)
  - Neural Networks > Deep Learning (0.78)