PANTHER: Generative Pretraining Beyond Language for Sequential User Behavior Modeling
–Neural Information Processing Systems
Large language models (LLMs) have shown that generative pretraining can distill vast world knowledge into compact token representations. While LLMs encapsulate extensive world knowledge, they remain limited in modeling the behavioral knowledge contained within user interaction histories. User behavior forms a distinct modality, where each action--defined by multi-dimensional attributes such as time, context, and transaction type--constitutes a behavioral token. Modeling these high-cardinality, sparse, and irregular sequences is challenging, and discriminative models often falter under limited supervision. To bridge this gap, we extend generative pretraining to user behavior, learning transferable representations from unlabeled behavioral data analogous to how LLMs learn from text.
Neural Information Processing Systems
Jun-23-2026, 02:03:30 GMT
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.93)
- Research Report
- Industry:
- Banking & Finance (1.00)
- Information Technology > Services
- e-Commerce Services (0.46)
- Technology: