Hypothesis Testing for Quantifying LLM-Human Misalignment in Multiple Choice Settings