Dens-PU: PU Learning with Density-Based Positive Labeled Augmentation

Sevetlidis, Vasileios, Pavlidis, George, Mouroutsos, Spyridon, Gasteratos, Antonios

Mar-21-2023–arXiv.org Artificial Intelligence

Labeled data are often scarce and expensive to obtain in many real-world applications, making training machine-learning models a challenging task [1]. In traditional supervised learning, the goal is to train a model to predict the correct class label for every sample in a training dataset [2]. The training data consist of labeled examples associated with a known class label. Typically, the class distribution of labeled data is assumed to be representative of the class distribution of unlabeled ones. Prior knowledge of the labels makes it easy to train a model to accurately predict the class labels for unseen samples. As Figure 1 shows, in the case of PU learning, the class label is known only for data belonging to a single class; thus, for negative samples, the label is unknown [3]. The lack of this knowledge makes it impossible to effectively train a typical binary classification model to distinguish between positive and negative classes.

artificial intelligence, machine learning, unlabeled data, (16 more...)

arXiv.org Artificial Intelligence

Mar-21-2023

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - New South Wales > Sydney (0.04)
- North America > United States
  - New York (0.04)
- Europe
  - Greece (0.04)
  - Finland (0.04)
  - North Macedonia > Skopje Statistical Region
    - Skopje Municipality > Skopje (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks (1.00)
  - Performance Analysis > Accuracy (0.46)