Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning

Oct-11-2024, 05:26:01 GMT–Neural Information Processing Systems

Reinforcement learning (RL) is a general framework for modeling sequential decision making problems, at the core of which lies the dilemma of exploitation and exploration. An agent failing to explore systematically will inevitably fail to learn efficiently. Optimism in the face of uncertainty (OFU) is a conventionally successful strategy for efficient exploration. An agent following the OFU principle explores actively and efficiently. However, when applied to model-based RL, it involves specifying a confidence set of the underlying model and solving a series of nonlinear constrained optimization, which can be computationally intractable.

bayesian optimistic optimization, model-based reinforcement learning, optimistic exploration, (3 more...)

Neural Information Processing Systems

Oct-11-2024, 05:26:01 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)