Towards Unbiased Evaluation of Detecting Unanswerable Questions in EHRSQL