Multi-armed bandits for resource efficient, online optimization of language model pre-training: the use case of dynamic masking
Urteaga, Iñigo, Draïdia, Moulay-Zaïdane, Lancewicki, Tomer, Khadivi, Shahram
–arXiv.org Artificial Intelligence
We design and evaluate a Bayesian optimization framework for resource efficient pre-training of Transformer-based language models (TLMs). TLM pre-training requires high computational resources and introduces many unresolved design choices, such as selecting its pre-training hyperparameters. We propose a multi-armed bandit framework for the sequential selection of TLM pre-training hyperparameters, aimed at optimizing language model performance, in a resource efficient manner. We design a Thompson sampling algorithm, with a surrogate Gaussian process reward model of the Masked Language Model (MLM) pre-training objective, for its sequential minimization. Instead of MLM pre-training with fixed masking probabilities, the proposed Gaussian process-based Thompson sampling (GP-TS) accelerates pre-training by sequentially selecting masking hyperparameters that improve performance. We empirically demonstrate how GP-TS pre-trains language models efficiently, i.e., it achieves lower MLM loss in fewer epochs, across a variety of settings. In addition, GP-TS pre-trained TLMs attain competitive downstream performance, while avoiding expensive hyperparameter grid search. GP-TS provides an interactive framework for efficient and optimized TLM pre-training that, by circumventing costly hyperparameter selection, enables substantial computational savings.
arXiv.org Artificial Intelligence
May-30-2023
- Country:
- North America > United States
- New York > New York County
- New York City (0.04)
- Florida > Broward County
- Fort Lauderdale (0.04)
- California > Santa Clara County
- San Jose (0.04)
- New York > New York County
- Europe
- Italy > Sardinia (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Spain
- Canary Islands (0.04)
- Basque Country > Biscay Province
- Bilbao (0.04)
- Germany > North Rhine-Westphalia
- Cologne Region > Aachen (0.04)
- France > Hauts-de-France
- Asia > Middle East
- Israel (0.04)
- North America > United States
- Genre:
- Research Report (1.00)
- Instructional Material > Course Syllabus & Notes (0.46)
- Industry:
- Technology: