SemanticScanpath: Combining Gaze and Speech for Situated Human-Robot Interaction Using LLMs