Verifiable Reinforcement Learning via Policy Extraction
Osbert Bastani, Yewen Pu, Armando Solar-Lezama
–Neural Information Processing Systems
Neural Information Processing Systems
May-26-2025, 10:43:24 GMT
Osbert Bastani, Yewen Pu, Armando Solar-Lezama
–Neural Information Processing Systems
Neural Information Processing Systems
May-26-2025, 10:43:24 GMT