Superhuman performance of a large language model on the reasoning tasks of a physician