McRank: Learning to Rank Using Multiple Classification and Gradient Boosting
Li, Ping, Wu, Qiang, Burges, Christopher J.
–Neural Information Processing Systems
We cast the ranking problem as (1) multiple classification ("Mc") (2) multiple ordinal classification, which lead to computationally tractable learning algorithms for relevance ranking in Web search. We consider the DCG criterion (discounted cumulative gain), a standard quality measure in information retrieval. Our approach is motivated by the fact that perfect classifications result in perfect DCG scores and the DCG errors are bounded by classification errors. We propose using the Expected Relevance to convert class probabilities into ranking scores. The class probabilities are learned using a gradient boosting tree algorithm. Evaluations on large-scale datasets show that our approach can improve LambdaRank [5] and the regressions-based ranker [6], in terms of the (normalized) DCG scores. An efficient implementation of the boosting tree algorithm is also presented.
Neural Information Processing Systems
Dec-31-2008
- Country:
- North America > United States > New Jersey > Hudson County > Hoboken (0.04)
- Genre:
- Research Report
- New Finding (0.68)
- Experimental Study (0.47)
- Research Report
- Technology: