Transformer-based Planning for Symbolic Regression
Shojaee, Parshin, Meidani, Kazem, Farimani, Amir Barati, Reddy, Chandan K.
–arXiv.org Artificial Intelligence
Symbolic regression (SR) is a challenging task in machine learning that involves finding a mathematical expression for a function based on its values. Recent advancements in SR have demonstrated the effectiveness of pre-trained transformer-based models in generating equations as sequences, leveraging large-scale pre-training on synthetic datasets and offering notable advantages in terms of inference time over classical Genetic Programming (GP) methods. However, these models primarily rely on supervised pre-training goals borrowed from text generation and overlook equation discovery objectives like accuracy and complexity. To address this, we propose TPSR, a Transformer-based Planning strategy for Symbolic Regression that incorporates Monte Carlo Tree Search into the transformer decoding process. Unlike conventional decoding strategies, TPSR enables the integration of non-differentiable feedback, such as fitting accuracy and complexity, as external sources of knowledge into the transformer-based equation generation process. Extensive experiments on various datasets show that our approach outperforms state-of-the-art methods, enhancing the model's fitting-complexity trade-off, extrapolation abilities, and robustness to noise.
arXiv.org Artificial Intelligence
Oct-27-2023
- Country:
- South America > Chile
- Oceania > Australia
- North America
- Dominican Republic (0.04)
- United States
- Virginia (0.04)
- Washington > King County
- Seattle (0.04)
- Texas > Travis County
- Austin (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Genre:
- Research Report
- New Finding (1.00)
- Promising Solution (0.66)
- Research Report
- Industry:
- Energy (0.46)
- Technology: