Multimodal Fusion with LLMs for Engagement Prediction in Natural Conversation

Open in new window