Optimised Feature Subset Selection via Simulated Annealing
Martínez-García, Fernando, Rubio-García, Álvaro, Fernández-Lorenzo, Samuel, García-Ripoll, Juan José, Porras, Diego
We introduce SA-FDR, a novel algorithm for $\ell_0$-norm feature selection that considers this task as a combinatorial optimisation problem and solves it by using simulated annealing to perform a global search over the space of feature subsets. The optimisation is guided by the Fisher discriminant ratio, which we use as a computationally efficient proxy for model quality in classification tasks. Our experiments, conducted on datasets with up to hundreds of thousands of samples and hundreds of features, demonstrate that SA-FDR consistently selects more compact feature subsets while achieving a high predictive accuracy. This ability to recover informative yet minimal sets of features stems from its capacity to capture inter-feature dependencies often missed by greedy optimisation approaches. As a result, SA-FDR provides a flexible and effective solution for designing interpretable models in high-dimensional settings, particularly when model sparsity, interpretability, and performance are crucial.
Aug-1-2025
- Country:
- Europe
- Spain > Galicia
- Madrid (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Spain > Galicia
- North America > United States
- Wisconsin (0.04)
- Europe
- Genre:
- Research Report (1.00)
- Technology: