Learning to Synthesize Programs as Interpretable and Generalizable Policies
–Neural Information Processing Systems
Recently, deep reinforcement learning (DRL) methods have achieved impressive performance on tasks in a variety of domains. However, neural network policies produced with DRL methods are not human-interpretable and often have difficulty generalizing to novel scenarios. To address these issues, prior works explore learning programmatic policies that are more interpretable and structured for generalization. Yet, these works either employ limited policy representations (e.g.
Neural Information Processing Systems
Dec-24-2025, 23:13:31 GMT
- Technology: