Pool-Based Sequential Active Learning for Regression

May-12-2018–arXiv.org Machine Learning

Active learning (AL) [33], a subfield of machine learning, considers the following problem: if the learning algorithm can choose the training data, then which training samples should it choose to maximize the learning performance, under a fixed budget, e.g., the maximum number of labeled training samples? As an example, consider emotion estimation in affective computing [28]. Emotions can be represented as continuous numbers in the 2D space of arousal and valence [30], or in the 3D space of arousal, valence, and dominance [26]. However, emotions are very subjective, subtle, and uncertain. So, usually multiple human assessors are needed to obtain the groundtruth emotion values for each affective sample (video, audio, image, physiological signal, etc). For example, 14-16 assessors were used to evaluate each video clip in the DEAP dataset [21], six to 17 assessors for each utterance in the VAM (Vera am Mittag in German, Vera at Noon in English) spontaneous speech corpus [16], and at least 110 assessors for each sound in the IADS-2 (International Affective Digitized Sounds 2nd Edition) dataset [4]. This is very time-consuming and labor-intensive. How should we optimally select the affective samples to label so that an accurate regression model can be built with the minimum cost (i.e., the minimum number of labeled samples)?

alr approach, artificial intelligence, machine learning, (16 more...)

arXiv.org Machine Learning

May-12-2018

arXiv.org PDF

Add feedback

Country:
- Europe (1.00)
- Asia (0.68)
- North America > United States
  - Wisconsin (0.28)
  - Florida (0.28)

Genre:
- Research Report > New Finding (0.67)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.91)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found