JDRec: Practical Actor-Critic Framework for Online Combinatorial Recommender System
Zhao, Xin, Fang, Zhiwei, Guo, Yuchen, He, Jie, Chen, Wenlong, Peng, Changping
–arXiv.org Artificial Intelligence
A combinatorial recommender (CR) system feeds a list of items to a user at a time in the result page, in which the user behavior is affected by both contextual information and items. The CR is formulated as a combinatorial optimization problem with the objective of maximizing the recommendation reward of the whole list. Despite its importance, it is still a challenge to build a practical CR system, due to the efficiency, dynamics, personalization requirement in online environment. In particular, we tear the problem into two sub-problems, list generation and list evaluation. Novel and practical model architectures are designed for these sub-problems aiming at jointly optimizing effectiveness and efficiency. In order to adapt to online case, a bootstrap algorithm forming an actor-critic reinforcement framework is given to explore better recommendation mode in long-term user interaction. Offline and online experiment results demonstrate the efficacy of proposed JDRec framework. JDRec has been applied in online JD recommendation, improving click through rate by 2.6% and synthetical value for the platform by 5.03%. We will publish the large-scale dataset used in this study to contribute to the research community.
arXiv.org Artificial Intelligence
Jul-27-2022
- Country:
- South America > Argentina
- Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
- Oceania > Australia
- North America
- United States
- North Carolina > Wake County
- Raleigh (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- California
- Santa Clara County > Santa Clara (0.04)
- Los Angeles County > Long Beach (0.04)
- North Carolina > Wake County
- Canada > Nova Scotia
- Halifax Regional Municipality > Halifax (0.04)
- United States
- Europe
- Spain (0.04)
- France (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Italy
- Tuscany > Pisa Province
- Pisa (0.04)
- Piedmont > Turin Province
- Turin (0.04)
- Tuscany > Pisa Province
- Denmark > Capital Region
- Copenhagen (0.04)
- Asia
- South America > Argentina
- Genre:
- Research Report > New Finding (0.68)
- Technology: