Feature Selection and Regularization in Multi-Class Classification: An Empirical Study of One-vs-Rest Logistic Regression with Gradient Descent Optimization and L1 Sparsity Constraints
Arafat, Jahidul, Tasmin, Fariha, Poudel, Sanjaya
–arXiv.org Artificial Intelligence
Multi-class wine classification presents fundamental trade-offs between model accuracy, feature dimensionality, and interpretability - critical factors for production deployment in analytical chemistry. This paper presents a comprehensive empirical study of One-vs-Rest logistic regression on the UCI Wine dataset (178 samples, 3 cultivars, 13 chemical features), comparing from-scratch gradient descent implementation against scikit-learn's optimized solvers and quantifying L1 regularization effects on feature sparsity. Manual gradient descent achieves 92.59 percent mean test accuracy with smooth convergence, validating theoretical foundations, though scikit-learn provides 24x training speedup and 98.15 percent accuracy. Class-specific analysis reveals distinct chemical signatures with heterogeneous patterns where color intensity varies dramatically (0.31 to 16.50) across cultivars. L1 regularization produces 54-69 percent feature reduction with only 4.63 percent accuracy decrease, demonstrating favorable interpretability-performance trade-offs. We propose an optimal 5-feature subset achieving 62 percent complexity reduction with estimated 92-94 percent accuracy, enabling cost-effective deployment with 80 dollars savings per sample and 56 percent time reduction. Statistical validation confirms robust generalization with sub-2ms prediction latency suitable for real-time quality control. Our findings provide actionable guidelines for practitioners balancing comprehensive chemical analysis against targeted feature measurement in resource-constrained environments.
arXiv.org Artificial Intelligence
Oct-24-2025
- Country:
- Asia > Bangladesh
- Dhaka Division > Dhaka District > Dhaka (0.04)
- Europe > Germany
- Berlin (0.04)
- North America > United States
- Alabama (0.04)
- California (0.04)
- New Jersey > Hudson County
- Hoboken (0.04)
- New York > New York County
- New York City (0.04)
- Oceania > Australia (0.04)
- Asia > Bangladesh
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Consumer Products & Services > Food, Beverage, Tobacco & Cannabis
- Beverages (1.00)
- Education (1.00)
- Government (0.67)
- Health & Medicine
- Diagnostic Medicine (0.67)
- Pharmaceuticals & Biotechnology (0.93)
- Law (0.93)
- Consumer Products & Services > Food, Beverage, Tobacco & Cannabis
- Technology: