Learning to Synthesize Programs as Interpretable and Generalizable Policies

Dec-24-2025, 23:13:31 GMT–Neural Information Processing Systems

Recently, deep reinforcement learning (DRL) methods have achieved impressive performance on tasks in a variety of domains. However, neural network policies produced with DRL methods are not human-interpretable and often have difficulty generalizing to novel scenarios. To address these issues, prior works explore learning programmatic policies that are more interpretable and structured for generalization. Yet, these works either employ limited policy representations (e.g.

interpretable and generalizable policy, name change, synthesize program, (2 more...)

Neural Information Processing Systems

Dec-24-2025, 23:13:31 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.59)