Finding Optimal Diverse Feature Sets with Alternative Feature Selection
–arXiv.org Artificial Intelligence
Feature selection is popular for obtaining small, interpretable, yet highly accurate prediction models. Conventional feature-selection methods typically yield one feature set only, which might not suffice in some scenarios. For example, users might be interested in finding alternative feature sets with similar prediction quality, offering different explanations of the data. In this article, we introduce alternative feature selection and formalize it as an optimization problem. In particular, we define alternatives via constraints and enable users to control the number and dissimilarity of alternatives. Next, we analyze the complexity of this optimization problem and show NP-hardness. Further, we discuss how to integrate conventional feature-selection methods as objectives. Finally, we evaluate alternative feature selection with 30 classification datasets. We observe that alternative feature sets may indeed have high prediction quality, and we analyze several factors influencing this outcome.
arXiv.org Artificial Intelligence
Jul-21-2023
- Country:
- Africa > Sudan (0.04)
- Oceania > New Zealand
- North Island > Waikato > Hamilton (0.04)
- North America
- United States
- District of Columbia > Washington (0.04)
- Texas > El Paso County
- El Paso (0.04)
- Tennessee > Davidson County
- Nashville (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Georgia > Fulton County
- Atlanta (0.04)
- Florida > Miami-Dade County
- Miami Beach (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Santa Clara County > Santa Clara (0.04)
- Los Angeles County
- Pasadena (0.04)
- Long Beach (0.04)
- Canada
- United States
- Europe
- Czechia (0.04)
- Hungary > Budapest
- Budapest (0.04)
- Italy > Tuscany
- Florence (0.04)
- Germany
- Berlin (0.04)
- Baden-Württemberg > Karlsruhe Region
- Karlsruhe (0.04)
- Greece > Epirus
- Ioannina (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Belgium > Flanders
- East Flanders > Ghent (0.04)
- Antwerp Province > Antwerp (0.04)
- Netherlands > South Holland
- Leiden (0.04)
- Sweden
- Vaestra Goetaland > Gothenburg (0.04)
- Stockholm > Stockholm (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- City of Aberdeen > Aberdeen (0.04)
- Poland > Lesser Poland Province
- Kraków (0.04)
- Asia
- Thailand > Bangkok
- Bangkok (0.04)
- South Korea > Busan
- Busan (0.04)
- Middle East
- Republic of Türkiye > Istanbul Province
- Istanbul (0.04)
- Israel > Haifa District
- Haifa (0.04)
- Republic of Türkiye > Istanbul Province
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- China
- Thailand > Bangkok
- Genre:
- Research Report (1.00)
- Overview (0.92)
- Technology: