Imitation-Projected Programmatic Reinforcement Learning
Abhinav Verma, Hoang Le, Yisong Yue, Swarat Chaudhuri
–Neural Information Processing Systems
We study the problem of programmatic reinforcement learning, in which policies are represented as short programs in a symbolic language. Programmatic policies can be more interpretable, generalizable, and amenable to formal verification than neural policies; however, designing rigorous learning approaches for such policies remains a challenge.
Neural Information Processing Systems
Jan-23-2025, 23:48:14 GMT
- Country:
- Europe (1.00)
- North America > United States (1.00)
- Genre:
- Research Report (0.46)
- Industry:
- Education (0.46)
- Leisure & Entertainment > Sports (0.46)
- Technology: