Active Learning for Regression by Inverse Distance Weighting

Bemporad, Alberto

arXiv.org Artificial Intelligence 

Active learning (AL) strategies are used in supervised learning to let the training algorithm "ask questions" [34], i.e., choose the feature vectors to query for the corresponding target value during the training phase, usually based on the model learned so far. The main aim of AL is to possibly reduce the number of training samples required to train the model, or in other words, to get a model of the same prediction quality with a smaller dataset. This is particularly useful when knowing the target value associated with a given combination of features is an expensive operation, for example, it may involve asking a human to "label" samples manually, running a costly and time-consuming laboratory experiment, or performing a complex computer simulation. AL methods are usually categorized in query synthesis (or population-based) methods, in which the feature vector to query can be chosen arbitrarily, pool-based sampling methods, in which the vector can only be chosen within a given finite set (or "pool") of unlabeled values, and selective-sampling methods, in which vectors are proposed in a streaming flow and the AL algorithm can only decide online whether to ask for the corresponding target or not [34]. Several approaches to AL are available in the literature, see, e.g., the survey papers [1, 16,22,34,39]. Most of the literature focuses on classification problems [1,33], although AL has been investigated also for regression [9-13,25,27,38,41,42].

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found