$\Delta\text{-}{\rm OPE}$: Off-Policy Estimation with Pairs of Policies

Open in new window