Loss-Calibrated Monte Carlo Action Selection