Large Language Models for Multi-Modal Human-Robot Interaction