Large Language Models in Medical Term Classification and Unexpected Misalignment Between Response and Reasoning