Tuning Mixed Input Hyperparameters on the Fly for Efficient Population Based AutoRL

Oct-11-2024, 13:06:08 GMT–Neural Information Processing Systems

Despite a series of recent successes in reinforcement learning (RL), many RL algorithms remain sensitive to hyperparameters. As such, there has recently been interest in the field of AutoRL, which seeks to automate design decisions to create more general algorithms. Recent work suggests that population based approaches may be effective AutoRL algorithms, by learning hyperparameter schedules on the fly. In particular, the PB2 algorithm is able to achieve strong performance in RL tasks by formulating online hyperparameter optimization as time varying GP-bandit problem, while also providing theoretical guarantees. However, PB2 is only designed to work for \emph{continuous} hyperparameters, which severely limits its utility in practice.

autorl, efficient population, input hyperparameter, (3 more...)

Neural Information Processing Systems

Oct-11-2024, 13:06:08 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.44)
  - Artificial Intelligence > Machine Learning
    - Reinforcement Learning (0.64)