Recommender Systems: Missing Data and Statistical Model Estimation
Marlin, Benjamin M. (University of British Columbia) | Zemel, Richard S. (University of Toronto) | Roweis, Sam T. (New York University) | Slaney, Malcolm (Yahoo! Research)
The personalization aspect of recommender systems makes them well suited to applications in The goal of rating-based recommender systems is electronic commerce and entertainment, while the fact that to make personalized predictions and recommendations they do not rely on text-based descriptions of items makes for individual users by leveraging the preferences them well suited to content like movies and music. of a community of users with respect to a In this paper, we focus on a key problem in rating-based collection of items like songs or movies. Recommender collaborative filtering: the possibility of a basic incompatibility systems are often based on intricate statistical between the properties of recommender system data sets models that are estimated from data sets containing and the assumptions required for valid estimation and evaluation a very high proportion of missing ratings. of statistical models in the presence of missing data. This work describes evidence of a basic incompatibility We describe properties of recommender system data sets and between the properties of recommender relate them to the statistical theory of model estimation in system data sets and the assumptions required for the presence of nonrandom missing data. We describe an valid estimation and evaluation of statistical models extended modelling framework and a modified set of evaluation in the presence of missing data. We discuss the protocols for dealing with nonrandom missing data.
Jul-19-2011
- Country:
- Genre:
- Research Report (0.93)
- Industry:
- Media (0.70)
- Information Technology > Services (0.49)
- Leisure & Entertainment (0.47)
- Technology: