Log-Sum-Exponential Estimator for Off-Policy Evaluation and Learning