Interpretable Symbolic Regression for Data Science: Analysis of the 2022 Competition
de Franca, F. O., Virgolin, M., Kommenda, M., Majumder, M. S., Cranmer, M., Espada, G., Ingelse, L., Fonseca, A., Landajuela, M., Petersen, B., Glatt, R., Mundhenk, N., Lee, C. S., Hochhalter, J. D., Randall, D. L., Kamienny, P., Zhang, H., Dick, G., Simon, A., Burlacu, B., Kasak, Jaan, Machado, Meera, Wilstrup, Casper, La Cava, W. G.
–arXiv.org Artificial Intelligence
Symbolic regression searches for analytic expressions that accurately describe studied phenomena. The main attraction of this approach is that it returns an interpretable model that can be insightful to users. Historically, the majority of algorithms for symbolic regression have been based on evolutionary algorithms. However, there has been a recent surge of new proposals that instead utilize approaches such as enumeration algorithms, mixed linear integer programming, neural networks, and Bayesian optimization. In order to assess how well these new approaches behave on a set of common challenges often faced in real-world data, we hosted a competition at the 2022 Genetic and Evolutionary Computation Conference consisting of different synthetic and real-world datasets which were blind to entrants. For the real-world track, we assessed interpretability in a realistic way by using a domain expert to judge the trustworthiness of candidate models.We present an in-depth analysis of the results obtained in this competition, discuss current challenges of symbolic regression algorithms and highlight possible improvements for future competitions.
arXiv.org Artificial Intelligence
Jul-3-2023
- Country:
- South America
- Brazil > São Paulo (0.04)
- Chile > Santiago Metropolitan Region
- Santiago Province > Santiago (0.04)
- Oceania > New Zealand
- North Island > Auckland Region > Auckland (0.04)
- North America > United States
- Utah (0.04)
- New York > New York County
- New York City (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- Europe
- France (0.04)
- Denmark (0.04)
- Austria > Upper Austria (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Germany > Baden-Württemberg
- Tübingen Region > Tübingen (0.04)
- South America
- Genre:
- Research Report (1.00)
- Industry:
- Technology: