Mambular: A Sequential Model for Tabular Deep Learning
Thielmann, Anton Frederik, Kumar, Manish, Weisser, Christoph, Reuter, Arik, Säfken, Benjamin, Samiee, Soheila
–arXiv.org Artificial Intelligence
The analysis of tabular data has traditionally been dominated by gradient-boosted decision trees (GBDTs), known for their proficiency with mixed categorical and numerical features. However, recent deep learning innovations are challenging this dominance. We introduce Mambular, an adaptation of the Mamba architecture optimized for tabular data. We extensively benchmark Mambular against state-of-the-art models, including neural networks and tree-based methods, and demonstrate its competitive performance across diverse datasets. Additionally, we explore various adaptations of Mambular to understand its effectiveness for tabular data. We investigate different pooling strategies, feature interaction mechanisms, and bi-directional processing. Our analysis shows that interpreting features as a sequence and passing them through Mamba layers results in surprisingly performant models.
arXiv.org Artificial Intelligence
Aug-12-2024
- Country:
- Europe > Germany
- Bavaria > Upper Bavaria > Munich (0.04)
- North America
- Canada (0.04)
- United States > California (0.05)
- Europe > Germany
- Genre:
- Research Report
- Experimental Study (0.93)
- New Finding (1.00)
- Research Report
- Technology: