Deep multi-class learning from label proportions

Dulac-Arnold, Gabriel, Zeghidour, Neil, Cuturi, Marco, Beyer, Lucas, Vert, Jean-Philippe

May-30-2019–arXiv.org Machine Learning

The standard setting of supervised classification in machine learning assumes that we have access to a training set of samples and to their labels; our goal is then to estimate a classifier able to predict the label of new samples. In many real-world situations, however, collecting training sets of labeled examples is not possible, and alternative learning scenarios must be considered. We focus in this paper on a particular setting where one has access to bags of examples, and where for each bag only the proportions of the labels in the bag are available; the task is still to learn a classifier to predict the label of individual samples. This setting, which following Yu et al. [2013] we refer to as learning from label proportions (LLP), is relevant in many situations where labeling of individual samples is time-consuming, difficult, or just not possible, while side-channel information can be used to reconstruct the proportions of label within a given bag. For example, Musicant et al. [2007] explain how LLP is a natural setting to analyze single particle mass spectrometry data, while Quadrianto et al. [2009] discuss applications in e-commerce, politics or spam filtering.

artificial intelligence, label proportion, machine learning, (15 more...)

arXiv.org Machine Learning

May-30-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - District of Columbia > Washington (0.04)
  - New York > New York County
    - New York City (0.05)
  - Nebraska > Douglas County
    - Omaha (0.04)
  - Louisiana > Orleans Parish
    - New Orleans (0.04)
- Europe
  - United Kingdom
    - Scotland > City of Edinburgh
      - Edinburgh (0.04)
    - England > Cambridgeshire
      - Cambridge (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)

Genre:
- Research Report (0.50)

Industry:
- Health & Medicine (0.68)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning (1.00)
  - Neural Networks (1.00)
  - Learning Graphical Models > Directed Networks
    - Bayesian Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found