Provably Optimal Reinforcement Learning under Safety Filtering

Open in new window