Goto

Collaborating Authors

 log 2


Black-Box Differential Privacy for Interactive ML

Neural Information Processing Systems

We show that any (possibly non-private) learning rule can be effectively transformed to a private learning rule with only a polynomial overhead in the mistake bound.










Provably Safe Reinforcement Learning with Step-wise Violation Constraints

Neural Information Processing Systems

We name this problem Safe-RL-SW . Our step-wise violation constraint differs from prior expected violation constraint (Wachi & Sui, 2020; Efroni et al., 2020b; Kalagarla et al., 2021) in two aspects: (i) Minimizing the step-wise violation enables the agent to learn an optimal policy that avoids unsafe regions deterministically,