Off-Policy Evaluation of Ranking Policies under Diverse User Behavior