CoupAlign: Coupling Word-PixelwithSentence-Mask AlignmentsforReferringImageSegmentation

Neural Information Processing Systems 

In our experiment, we use four WPA modules, two of which are in the early encoding stage and theother twoareinthelateencoding stage.