Model Size Reduction Using Frequency Based Double Hashing for Recommender Systems

Zhang, Caojin, Liu, Yicun, Xie, Yuanpu, Ktena, Sofia Ira, Tejani, Alykhan, Gupta, Akshay, Myana, Pranay Kumar, Dilipkumar, Deepak, Paul, Suvadip, Ihara, Ikuhiro, Upadhyaya, Prasang, Huszar, Ferenc, Shi, Wenzhe

Jul-28-2020–arXiv.org Machine Learning

Deep Neural Networks (DNNs) with sparse input features have been widely used in recommender systems in industry. These models have large memory requirements and need a huge amount of training data. The large model size usually entails a cost, in the range of millions of dollars, for storage and communication with the inference services. In this paper, we propose a hybrid hashing method to combine frequency hashing and double hashing techniques for model size reduction, without compromising performance. We evaluate the proposed models on two product surfaces. In both cases, experiment results demonstrated that we can reduce the model size by around 90 % while keeping the performance on par with the original baselines.

artificial intelligence, frequency, machine learning, (16 more...)

arXiv.org Machine Learning

Jul-28-2020

arXiv.org PDF

Add feedback

Country:
- Asia (0.04)
- North America > United States
  - Hawaii (0.04)

Genre:
- Research Report (0.50)

Industry:
- Information Technology > Services (0.46)
- Government > Regional Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Personal Assistant Systems (0.91)
  - Machine Learning
    - Statistical Learning (0.94)
    - Neural Networks > Deep Learning (0.70)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found