v-Arc: Ensemble Learning in the Presence of Outliers

Rätsch, Gunnar, Schölkopf, Bernhard, Smola, Alex J., Müller, Klaus-Robert, Onoda, Takashi, Mika, Sebastian

Dec-31-2000–Neural Information Processing Systems

The idea of a large minimum margin [17] explains the good generalization performance of AdaBoost in the low noise regime. However, AdaBoost performs worse on noisy tasks [10, 11], such as the iris and the breast cancer benchmark data sets [1]. On the latter tasks, a large margin on all training points cannot be achieved without adverse effects on the generalization error. This experimental observation was supported by the study of [13] where the generalization error of ensemble methods was bounded by the sum of the fraction of training points which have a margin smaller than some value p, say, plus a complexity term depending on the base hypotheses and p. While this bound can only capture part of what is going on in practice, it nevertheless already conveys the message that in some cases it pays to allow for some points which have a small margin, or are misclassified, if this leads to a larger overall margin on the remaining points. To cope with this problem, it was mandatory to construct regularized variants of AdaBoost, which traded off the number of margin errors and the size of the margin 562 G. Riitsch, B. Sch6lkopf, A. J. Smola, K.-R.

algorithm, health & medicine, oncology, (20 more...)

Neural Information Processing Systems

Dec-31-2000

Conferences PDF

Add feedback

Country:
- Asia (0.28)
- Europe (0.28)
- North America > United States (0.28)
- Oceania > Australia (0.28)

Industry:
- Health & Medicine
  - Pharmaceuticals & Biotechnology (0.34)
  - Therapeutic Area (0.54)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Duplicate Docs Excel Report

Title
v-Arc: Ensemble Learning in the Presence of Outliers
v-Arc: Ensemble Learning in the Presence of Outliers

Similar Docs Excel Report more

Title	Similarity	Source
None found