Rethinking Evaluation Metric for Probability Estimation Models Using Esports Data