Beyond Least Squares: Robust Regression Transformer (R2T)
Gutierrez, Roman, Tang, Tony Kai, Gutierrez, Isabel
Robust regression techniques rely on least-squares optimization, which works well for Gaussian noise but fails in the presence of asymmetric structured noise. We propose a hybrid neural-symbolic architecture where a transformer encoder processes numerical sequences, a compression NN predicts symbolic parameters, and a fixed symbolic equation reconstructs the original sequence. Using synthetic data, the training objective is to recover the original sequence after adding asymmetric structured noise, effectively learning a symbolic fit guided by neural parameter estimation. Our model achieves a median regression MSE of 6e-6 to 3.5e-5 on synthetic wearable data, which is a 10-300 times improvement when compared with ordinary least squares fit and robust regression techniques such as Huber loss or SoftL1.
Aug-6-2025
- Genre:
- Research Report (0.40)
- Instructional Material (0.34)
- Industry:
- Health & Medicine > Therapeutic Area (0.32)
- Technology: