Program Synthesis Guided Reinforcement Learning for Partially Observed Environments

Jan-19-2025, 14:29:53 GMT–Neural Information Processing Systems

A key challenge for reinforcement learning is solving long-horizon planning problems. Recent work has leveraged programs to guide reinforcement learning in these settings. However, these approaches impose a high manual burden on the user since they must provide a guiding program for every new task. Partially observed environments further complicate the programming task because the program must implement a strategy that correctly, and ideally optimally, handles every possible configuration of the hidden regions of the environment. We propose a new approach, model predictive program synthesis (MPPS), that uses program synthesis to automatically generate the guiding programs.

new task, observed environment, program synthesis guided reinforcement learning, (1 more...)

Neural Information Processing Systems

Jan-19-2025, 14:29:53 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report > New Finding (0.40)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (0.93)
  - Representation & Reasoning > Logic & Formal Reasoning (0.88)