LLMs and their Limited Theory of Mind: Evaluating Mental State Annotations in Situated Dialogue