Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits