Beyond Token-level Supervision: Unlocking the Potential of Decoding-based Regression via Reinforcement Learning
Chen, Ming, Tang, Sheng, Tan, Rong-Xi, Li, Ziniu, Chen, Jiacheng, Xue, Ke, Qian, Chao
–arXiv.org Artificial Intelligence
Decoding-based regression, which reformulates regression as a sequence generation task, has emerged as a promising paradigm of applying large language models for numerical prediction. However, its progress is hindered by the misalignment between discrete token-level objectives (e.g., cross-entropy) and continuous numerical values. Existing approaches relying on token-level constraints often fail to capture the global magnitude of the target value, limiting their precision and generalization. In this paper, we propose to unlock the potential of decoding-based regression via Reinforcement Learning (RL). We formulate the generation process as a Markov Decision Process, utilizing sequence-level rewards to enforce global numerical coherence. Extensive experiments on tabular regression and code metric regression demonstrate that our method (specifically with ReMax and GRPO) consistently outperforms both state-of-the-art token-level baselines and traditional regression heads, showing the superiority of introducing sequence-level signals. Our analysis further reveals that RL significantly enhances sampling efficiency and predictive precision, establishing decoding-based regression as a robust and accurate paradigm for general-purpose numerical prediction.
arXiv.org Artificial Intelligence
Dec-9-2025
- Country:
- Asia
- China
- Guangdong Province > Shenzhen (0.04)
- Hong Kong (0.04)
- Jiangsu Province > Nanjing (0.04)
- Singapore (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- China
- Europe
- Austria > Vienna (0.14)
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- Slovenia > Coastal-Karst
- Municipality of Hrpelje-Kozina > Kozina (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- California
- Los Angeles County > Long Beach (0.04)
- San Diego County > San Diego (0.04)
- San Francisco County > San Francisco (0.14)
- District of Columbia > Washington (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- New Mexico > Bernalillo County
- Albuquerque (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Maryland > Baltimore (0.04)
- Colorado > Denver County
- Denver (0.04)
- California
- Canada > Quebec
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Asia
- Genre:
- Research Report (1.00)
- Technology: