Practical Improvements of A/B Testing with Off-Policy Estimation