Learning Classifiers on Positive and Unlabeled Data with Policy Gradient
Li, Tianyu, Wang, Chien-Chih, Ma, Yukun, Ortal, Patricia, Zhao, Qifang, Stenger, Bjorn, Hirate, Yu
--Existing algorithms aiming to learn a binary classifier from positive (P) and unlabeled (U) data require estimating the class prior or label noise ahead of building a classification model. However, the estimation and classifier learning are normally conducted in a pipeline instead of being jointly optimized. In this paper, we propose to alternatively train the two steps using reinforcement learning. Our proposal adopts a policy network to adaptively make assumptions on the labels of unlabeled data, while a classifier is built upon the output of the policy network and provides rewards to learn a better policy. The dynamic and interactive training between the policy maker and the classifier can exploit the unlabeled data in a more effective manner and yield a significant improvement in terms of classification performance. Furthermore, we present two different approaches to represent the actions taken by the policy. The first approach considers continuous actions as soft labels, while the other uses discrete actions as hard assignment of labels for unlabeled examples. We validate the effectiveness of the proposed method on two public benchmark datasets as well as one e-commerce dataset. The results show that the proposed method is able to consistently outperform state-of-the-art methods in various settings. PU learning refers to the problem of learning from a dataset where only a subset of examples are positively labeled and the rest are not annotated at all. It is a critical task due to its prevalence in various real-world applications [1], [2], [3]. In many common situations only positive data are available, for instance, an e-commerce website may only record users who have clicked on advertisements or purchased items. Meanwhile, it is not possible to simply assume that unlabeled instances are negative.
Oct-15-2019
- Country:
- Europe > Spain
- Catalonia (0.28)
- North America > United States
- California (0.28)
- Europe > Spain
- Genre:
- Research Report > New Finding (0.48)
- Industry:
- Education (1.00)
- Information Technology > Services
- e-Commerce Services (1.00)
- Technology: