Large-Margin Convex Polytope Machine
–Neural Information Processing Systems
We present the Convex Polytope Machine (CPM), a novel non-linear learning algorithm for large-scale binary classification tasks. The CPM finds a large margin convex polytope separator which encloses one class. We develop a stochastic gradient descent based algorithm that is amenable to massive datasets, and augment it with a heuristic procedure to avoid sub-optimal local minima. Our experimental evaluations of the CPM on large-scale datasets from distinct domains (MNIST handwritten digit recognition, text topic, and web security) demonstrate that the CPM trains models faster, sometimes several orders of magnitude, than state-ofthe-art similar approaches and kernel-SVM methods while achieving comparable or better classification performance. Our empirical results suggest that, unlike prior similar approaches, we do not need to control the number of sub-classifiers (sides of the polytope) to avoid overfitting.
Neural Information Processing Systems
Mar-13-2024, 12:00:14 GMT
- Country:
- North America > United States
- Virginia > Arlington County
- Arlington (0.04)
- New York > New York County
- New York City (0.05)
- Virginia > Arlington County
- Asia > Japan
- Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- North America > United States
- Genre:
- Research Report > New Finding (0.48)
- Technology: