Verifiable Reinforcement Learning via Policy Extraction
Osbert Bastani, Yewen Pu, Armando Solar-Lezama
–Neural Information Processing Systems
Neural Information Processing Systems
Nov-18-2025, 23:28:01 GMT
- Country:
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- California (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Canada > Quebec
- North America
- Technology: