Review for NeurIPS paper: Dynamic Regret of Policy Optimization in Non-Stationary Environments

Open in new window