Situated and Interactive Multimodal Conversations