Effective and Efficient Training for Sequential Recommendation using Recency Sampling
Petrov, Aleksandr, Macdonald, Craig
–arXiv.org Artificial Intelligence
Many modern sequential recommender systems use deep neural networks, which can effectively estimate the relevance of items but require a lot of time to train. Slow training increases expenses, hinders product development timescales and prevents the model from being regularly updated to adapt to changing user preferences. Training such sequential models involves appropriately sampling past user interactions to create a realistic training objective. The existing training objectives have limitations. For instance, next item prediction never uses the beginning of the sequence as a learning target, thereby potentially discarding valuable data. On the other hand, the item masking used by BERT4Rec is only weakly related to the goal of the sequential recommendation; therefore, it requires much more time to obtain an effective model. Hence, we propose a novel Recency-based Sampling of Sequences training objective that addresses both limitations. We apply our method to various recent and state-of-the-art model architectures - such as GRU4Rec, Figure 1: The SASRec [18] model trained with our proposed Caser, and SASRec. We show that the models enhanced with our training method outperforms BERT4Rec on the MovieLens-method can achieve performances exceeding or very close to stateof-the-art 20M dataset [14] and requires much less training time.
arXiv.org Artificial Intelligence
Jul-15-2022
- Country:
- North America > United States
- Washington > King County
- Seattle (0.05)
- New York > New York County
- New York City (0.04)
- Washington > King County
- Europe > United Kingdom
- Scotland > City of Glasgow > Glasgow (0.04)
- Asia > China
- Jiangsu Province > Yancheng (0.04)
- North America > United States
- Genre:
- Research Report > Promising Solution (0.34)