Absolute convergence and error thresholds in non-active adaptive sampling
Ferro, Manuel Vilares, Bilbao, Victor M. Darriba, Ferro, Jesús Vilares
–arXiv.org Artificial Intelligence
In this sense, the operating principle for adaptive sampling is simple and involves beginning with an initial number of examples and then iteratively learning the model, evaluating it and acquiring additional observations if necessary. Accordingly, there are two questions to be considered: it is necessary to determine the training data to be acquired at each cycle, and also to define a halting condition to terminate the loop once a certain degree of performance has been achieved by the learner. Both tasks confer the character of research issues to the formalization of scheduling and stopping criteria (John and Langley, 1996), respectively. The former has been researched for decades in terms of fixed (John and Langley, 1996; Provost et al., 1999) or adaptive (Provost et al., 1999) sequencing, and it is not our objective. As regards the halting criteria, they are independent of the scheduling and mostly start from the hypothesis that learning curves are wellbehaved, including an initial steeply sloping portion, a more gently sloping middle one and a final balanced zone (Meek et al., 2002). Accordingly, the purpose is to identify the moment in which such a curve reaches a plateau, namely when adding more data instances does not improve the accuracy, although this often does not strictly verify. Instead, extra learning efforts almost always result in modest increases. This justifies the interest in having a proximity condition, understood as a measure of the degree of convergence attained from a given iteration, rather than a stopping one. In short, this will make it possible to select the level of reliability in predicting a learner's performance, both in terms of accuracy and computational costs.
arXiv.org Artificial Intelligence
Feb-4-2024
- Country:
- Asia
- India > NCT
- New Delhi (0.04)
- South Korea (0.04)
- India > NCT
- Europe
- Denmark > Capital Region
- Copenhagen (0.04)
- Germany > Baden-Württemberg
- Freiburg (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Spain > Galicia
- A Coruña Province > A Coruña (0.04)
- Ourense Province > Ourense (0.04)
- Denmark > Capital Region
- North America > United States
- California > San Diego County
- San Diego (0.04)
- Florida > Broward County
- Fort Lauderdale (0.04)
- California > San Diego County
- Asia
- Genre:
- Research Report (0.50)
- Technology: