Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian Network
Hsiao, Vincent, Roberts, Mark, Hiatt, Laura M., Konidaris, George, Nau, Dana
–arXiv.org Artificial Intelligence
A major challenge for reinforcement learning is automatically generating curricula to reduce training time or improve performance in some target task. We introduce SEBNs (Skill-Environment Bayesian Networks) which model a probabilistic relationship between a set of skills, a set of goals that relate to the reward structure, and a set of environment features to predict policy performance on (possibly unseen) tasks. We develop an algorithm that uses the inferred estimates of agent success from SEBN to weigh the possible next tasks by expected improvement. We evaluate the benefit of the resulting curriculum on three environments: a discrete gridworld, continuous control, and simulated robotics. The results show that curricula constructed using SEBN frequently outperform other baselines.
arXiv.org Artificial Intelligence
Feb-21-2025
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- District of Columbia > Washington (0.04)
- Maryland > Prince George's County
- College Park (0.14)
- Michigan > Wayne County
- Detroit (0.04)
- New York > New York County
- New York City (0.04)
- Rhode Island > Providence County
- Providence (0.04)
- Europe > United Kingdom
- Genre:
- Research Report > New Finding (0.66)
- Industry:
- Education (1.00)
- Leisure & Entertainment > Sports (0.46)