K-Net: Towards Unified Image Segmentation

Oct-10-2024, 11:21:31 GMT–Neural Information Processing Systems

Semantic, instance, and panoptic segmentations have been addressed using different and specialized frameworks despite their underlying connections. This paper presents a unified, simple, and effective framework for these essentially similar tasks. The framework, named K-Net, segments both instances and semantic categories consistently by a group of learnable kernels, where each kernel is responsible for generating a mask for either a potential instance or a stuff class. To remedy the difficulties of distinguishing various instances, we propose a kernel update strategy that enables each kernel dynamic and conditional on its meaningful group in the input image. K-Net can be trained in an end-to-end manner with bipartite matching, and its training and inference are naturally NMS-free and box-free.

k-net, panoptic segmentation, unified image segmentation, (1 more...)

Neural Information Processing Systems

Oct-10-2024, 11:21:31 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (0.74)
  - Artificial Intelligence (0.67)