Fast Offline Policy Optimization for Large Scale Recommendation