Hierarchical Structured Neural Network for Retrieval
Rangadurai, Kaushik, Yuan, Siyang, Huang, Minhui, Liu, Yiqun, Ghasemiesfeh, Golnaz, Pu, Yunchen, Xie, Xinfeng, He, Xingfeng, Xu, Fangzhou, Cui, Andrew, Viswanathan, Vidhoon, Dong, Yan, Xiong, Liang, Yang, Lin, Wang, Liang, Yang, Jiyan, Sun, Chonglin
–arXiv.org Artificial Intelligence
Embedding Based Retrieval (EBR) is a crucial component of the retrieval stage in (Ads) Recommendation System that utilizes Two Tower or Siamese Networks to learn embeddings for both users and items (ads). It then employs an Approximate Nearest Neighbor Search (ANN) to efficiently retrieve the most relevant ads for a specific user. Despite the recent rise to popularity in the industry, they have a couple of limitations. Firstly, Two Tower model architecture uses a single dot product interaction which despite their efficiency fail to capture the data distribution in practice. Secondly, the centroid representation and cluster assignment, which are components of ANN, occur after the training process has been completed. As a result, they do not take into account the optimization criteria used for retrieval model. In this paper, we present Hierarchical Structured Neural Network (HSNN), a deployed jointly optimized hierarchical clustering and neural network model that can take advantage of sophisticated interactions and model architectures that are more common in the ranking stages while maintaining a sub-linear inference cost. We achieve 6.5% improvement in offline evaluation and also demonstrate 1.22% online gains through A/B experiments. HSNN has been successfully deployed into the Ads Recommendation system and is currently handling major portion of the traffic. The paper shares our experience in developing this system, dealing with challenges like freshness, volatility, cold start recommendations, cluster collapse and lessons deploying the model in a large scale retrieval production system.
arXiv.org Artificial Intelligence
Aug-13-2024
- Country:
- Oceania > Australia
- North America > United States
- Washington > King County
- Bellevue (0.04)
- New York > New York County
- New York City (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- Colorado > Denver County
- Denver (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Santa Clara County > Sunnyvale (0.06)
- San Mateo County > Menlo Park (0.04)
- Los Angeles County > Pasadena (0.04)
- Washington > King County
- Asia
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- China > Heilongjiang Province
- Daqing (0.04)
- Myanmar > Tanintharyi Region
- Genre:
- Research Report (0.41)
- Industry:
- Marketing (0.48)
- Technology: