Goto

Collaborating Authors

 Education


MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models

Neural Information Processing Systems

Experiments of pretraining 410M and 1B models on the C4 dataset demonstrate that MA TES significantly outperforms random data selection on extensive downstream tasks. It doubles the gains achieved by the state-of-the-art data selection approach that leverages larger reference models and reduces the total FLOPs required to reach certain performances by half. Further analyses validate the effectiveness of the locally probed oracle data influence and the approximation with data influence models. Our code is open-sourced at https://github.com/cxcscmu/MA


Transformation-Invariant Learning and Theoretical Guarantees for OOD Generalization

Neural Information Processing Systems

Much remains to be understood, however, in statistical learning under distribution shifts. This paper focuses on a distribution shift setting where train and test distributions can be related by classes of (data) transformation maps.



Memorize WhatMatters: EmergentSceneDecompositionfromMultitraverse

Neural Information Processing Systems

Morespecifically,3DGM formulates multitraverse environmental mapping as a robust 3D representation learning problem, treating pixels of the environment and objects as inliers and outliers, respectively.