Unlocking the Power of Open Set : A New Perspective for Open-set Noisy Label Learning

Wan, Wenhai, Wang, Xinrui, Xie, Mingkun, Huang, Shengjun, Chen, Songcan, Li, Shaoyuan

May-7-2023–arXiv.org Artificial Intelligence

Learning from noisy data has attracted much attention, where most methods focus on closed-set label noise. However, a more common scenario in the real world is the presence of both open-set and closed-set noise. Existing methods typically identify and handle these two types of label noise separately by designing a specific strategy for each type. However, in many real-world scenarios, it would be challenging to identify open-set examples, especially when the dataset has been severely corrupted. Unlike the previous works, we explore how models behave when faced open-set examples, and find that a part of open-set examples gradually get integrated into certain known classes, which is beneficial for the seperation among known classes. Motivated by the phenomenon, in this paper, we propose a novel two-step contrastive learning method called CECL, which aims to deal with both types of label noise by exploiting the useful information of open-set examples. Specifically, we incorporate some open-set examples into closed-set classes to enhance performance while treating others as delimiters to improve representative ability. Extensive experiments on synthetic and real-world datasets with diverse label noise demonstrate that CECL can outperform state-of-the-art methods.

machine learning, natural language, open-set example, (17 more...)

arXiv.org Artificial Intelligence

May-7-2023

arXiv.org PDF

Add feedback

Genre:
- Research Report > Promising Solution (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Machine Translation (0.68)
  - Machine Learning
    - Inductive Learning (0.70)
    - Neural Networks (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found