Reinforced In-Context Black-Box Optimization
Song, Lei, Gao, Chenxiao, Xue, Ke, Wu, Chenyang, Li, Dong, Hao, Jianye, Zhang, Zongzhang, Qian, Chao
–arXiv.org Artificial Intelligence
Black-Box Optimization (BBO) has found successful applications in many fields of science and engineering. Recently, there has been a growing interest in meta-learning particular components of BBO algorithms to speed up optimization and get rid of tedious hand-crafted heuristics. As an extension, learning the entire algorithm from data requires the least labor from experts and can provide the most flexibility. In this paper, we propose RIBBO, a method to reinforce-learn a BBO algorithm from offline data in an end-to-end fashion. RIBBO employs expressive sequence models to learn the optimization histories produced by multiple behavior algorithms and tasks, leveraging the in-context learning ability of large models to extract task information and make decisions accordingly. Central to our method is to augment the optimization histories with \textit{regret-to-go} tokens, which are designed to represent the performance of an algorithm based on cumulative regret over the future part of the histories. The integration of regret-to-go tokens enables RIBBO to automatically generate sequences of query points that satisfy the user-desired regret, which is verified by its universally good empirical performance on diverse problems, including BBO benchmark functions, hyper-parameter optimization and robot control problems.
arXiv.org Artificial Intelligence
Jul-4-2024
- Country:
- Africa
- Ethiopia > Addis Ababa
- Addis Ababa (0.04)
- Rwanda > Kigali
- Kigali (0.04)
- Ethiopia > Addis Ababa
- Asia > China
- Jiangsu Province > Nanjing (0.04)
- Tianjin Province > Tianjin (0.04)
- Europe
- Germany > Bavaria
- Lower Franconia > Würzburg (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Spain > Andalusia
- Granada Province > Granada (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Germany > Bavaria
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- United States
- Arizona > Maricopa County
- Phoenix (0.04)
- District of Columbia > Washington (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Maryland > Baltimore (0.04)
- Texas > Travis County
- Austin (0.04)
- Arizona > Maricopa County
- Canada
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Africa
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Transportation > Air (0.61)
- Technology: