$K$-Nearest-Neighbor Resampling for Off-Policy Evaluation in Stochastic Control