Advancing Sequential Numerical Prediction in Autoregressive Models
Fei, Xiang, Lu, Jinghui, Sun, Qi, Feng, Hao, Wang, Yanjie, Shi, Wei, Wang, An-Lan, Tang, Jingqun, Huang, Can
–arXiv.org Artificial Intelligence
Autoregressive models have become the de facto choice for sequence generation tasks, but standard approaches treat digits as independent tokens and apply cross-entropy loss, overlooking the coherent structure of numerical sequences. This paper introduces Numerical Token Integrity Loss (NTIL) to address this gap. NTIL operates at two levels: (1) token-level, where it extends the Earth Mover's Distance (EMD) to preserve ordinal relationships between numerical values, and (2) sequence-level, where it penalizes the overall discrepancy between the predicted and actual sequences. This dual approach improves numerical prediction and integrates effectively with LLMs/MLLMs. Extensive experiments show significant performance improvements with NTIL.
arXiv.org Artificial Intelligence
May-29-2025
- Country:
- Asia
- Europe
- Netherlands > North Holland
- Amsterdam (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Ukraine > Kyiv Oblast
- Kyiv (0.04)
- Netherlands > North Holland
- North America > Canada
- Genre:
- Research Report (0.64)
- Technology: