OmniSegmentor: AFlexible Multi-Modal Learning Framework for Semantic Segmentation

Neural Information Processing Systems 

Recent research on representation learning has proved the merits of multi-modal clues for robust semantic segmentation. Nevertheless, a flexible pretrain-andfinetune pipeline for multiple visual modalities remains unexplored. In this paper, we propose a novel multi-modal learning framework, termed OmniSegmentor.