Breaking Determinism: Stochastic Modeling for Reliable Off-Policy Evaluation in Ad Auctions