MedCalc-Bench: Evaluating Large Language Models for Medical Calculations