"A 6 or a 9?": Ensemble Learning Through the Multiplicity of Performant Models and Explanations
Zuin, Gianlucca, Veloso, Adriano
–arXiv.org Artificial Intelligence
Creating models from past observations and ensuring their effectiveness on new data is the essence of machine learning. However, selecting models that generalize well remains a challenging task. Related to this topic, the Rashomon Effect refers to cases where multiple models perform similarly well for a given learning problem. This often occurs in real-world scenarios, like the manufacturing process or medical diagnosis, where diverse patterns in data lead to multiple high-performing solutions. We propose the Rashomon Ensemble, a method that strategically selects models from these diverse high-performing solutions to improve generalization. By grouping models based on both their performance and explanations, we construct ensembles that maximize diversity while maintaining predictive accuracy. This selection ensures that each model covers a distinct region of the solution space, making the ensemble more robust to distribution shifts and variations in unseen data. We validate our approach on both open and proprietary collaborative real-world datasets, demonstrating up to 0.20+ AUROC improvements in scenarios where the Rashomon ratio is large. Additionally, we demonstrate tangible benefits for businesses in various real-world applications, highlighting the robustness, practicality, and effectiveness of our approach.
arXiv.org Artificial Intelligence
Oct-14-2025
- Country:
- Asia
- China > Hubei Province
- Wuhan (0.04)
- South Korea > Seoul
- Seoul (0.04)
- China > Hubei Province
- Europe
- North America
- Canada > Ontario
- Toronto (0.04)
- United States
- California
- Los Angeles County > Long Beach (0.04)
- Monterey County > Monterey (0.04)
- San Francisco County > San Francisco (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New York > New York County
- New York City (0.04)
- Virginia (0.04)
- California
- Canada > Ontario
- Oceania > Australia
- Queensland (0.04)
- South America
- Brazil
- Amazonas > Manaus (0.04)
- Minas Gerais > Belo Horizonte (0.04)
- Rio de Janeiro > Rio de Janeiro (0.04)
- São Paulo (0.04)
- Paraguay > Asunción
- Asunción (0.04)
- Brazil
- Asia
- Genre:
- Research Report
- New Finding (1.00)
- Promising Solution (0.67)
- Research Report
- Industry:
- Education (1.00)
- Energy (1.00)
- Health & Medicine
- Diagnostic Medicine (0.87)
- Epidemiology (1.00)
- Health Care Providers & Services (1.00)
- Therapeutic Area
- Immunology (1.00)
- Infections and Infectious Diseases (1.00)
- Pulmonary/Respiratory Diseases (1.00)
- Technology:
- Information Technology
- Artificial Intelligence > Machine Learning
- Decision Tree Learning (0.93)
- Neural Networks (0.68)
- Performance Analysis > Accuracy (0.87)
- Statistical Learning (1.00)
- Data Science > Data Mining (1.00)
- Artificial Intelligence > Machine Learning
- Information Technology