Counterexample Guided RL Policy Refinement Using Bayesian Optimization
–Neural Information Processing Systems
Constructing Reinforcement Learning (RL) policies that adhere to safety requirements is an emerging field of study.
Neural Information Processing Systems
Aug-17-2025, 04:44:31 GMT
- Country:
- North America > United States
- Montana (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Asia > India
- West Bengal > Kharagpur (0.04)
- North America > United States
- Industry:
- Transportation (0.46)
- Technology: