Multimodal Fusion and Coherence Modeling for Video Topic Segmentation

Open in new window