Active Offline Policy Selection

Neural Information Processing Systems 

Several off-policy evaluation (OPE) techniques have been proposed to assess the value of policies using only logged data.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found