SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP

Open in new window