Few Shot Semantic Segmentation: a review of methodologies and open challenges

Catalano, Nico, Matteucci, Matteo

arXiv.org Artificial Intelligence 

Many surveys and reviews like [22, 23, 39] describe semantic segmentation as the Computer Vision (CV) task of predicting a category label at the pixel level. It builds upon simpler vision tasks such as image classification and object detection, and also shares some similarities with more advanced challenges like parts segmentation, instance segmentation, and panoptic segmentation. A visual comparison between the related Computer Vision (CV) tasks is reported in Figure 1. Image classification aims at understanding the overall scene in an image by giving it one or more labels, while object detection (Figure 1b) focuses on predicting the location of one or more objects in an image usually providing bounding boxes. Pixel-level prediction tasks like parts segmentation (Figure 1d) is a closer problem to semantic segmentation (Figure 1c), as it aims at predicting pixel-level segmentation masks covering the parts that compose the intended subject, such as face parts like the chin, nose and eyes. Instance segmentation (Figure 1e) aims to distinguish individual objects in an image, even if they are of the same kind, but does not necessarily assign them a category. Finally, panoptic segmentation (Figure 1f) combines semantic segmentation with instance segmentation, predicting the pixel-level category and distinguishing each object in the scene. Overall, we can place semantic segmentation as a midpoint on a spectrum of image understanding tasks ranging from coarse to fine.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found