Logarithmic Time Online Multiclass prediction John Langford Courant Institute of Mathematical Sciences Microsoft Research New York, NY, USA
–Neural Information Processing Systems
We study the problem of multiclass classification with an extremely large number of classes (k), with the goal of obtaining train and test time complexity logarithmic in the number of classes. We develop top-down tree construction approaches for constructing logarithmic depth trees. On the theoretical front, we formulate a new objective function, which is optimized at each node of the tree and creates dynamic partitions of the data which are both pure (in terms of class labels) and balanced. We demonstrate that under favorable conditions, we can construct logarithmic depth trees that have leaves with low label entropy. However, the objective function at the nodes is challenging to optimize computationally. We address the empirical problem with a new online decision tree construction procedure. Experiments demonstrate that this online algorithm quickly achieves improvement in test error compared to more common logarithmic training time approaches, which makes it a plausible method in computationally constrained large-k applications.
Neural Information Processing Systems
Mar-13-2024, 04:30:54 GMT
- Country:
- North America > United States
- New York > New York County
- New York City (0.40)
- Illinois > Cook County
- Chicago (0.04)
- Florida > Palm Beach County
- Boca Raton (0.04)
- New York > New York County
- North America > United States
- Technology: