TFANet: Three-Stage Image-Text Feature Alignment Network for Robust Referring Image Segmentation

Open in new window