Evaluating Large Language Models on Rare Disease Diagnosis: A Case Study using House M.D