McRank: Learning to Rank Using Multiple Classification and Gradient Boosting
Li, Ping, Wu, Qiang, Burges, Christopher J.
–Neural Information Processing Systems
We cast the ranking problem as (1) multiple classification ("Mc") (2) multiple ordinal classification,which lead to computationally tractable learning algorithms for relevance ranking in Web search. We consider the DCG criterion (discounted cumulative gain), a standard quality measure in information retrieval. Our approach ismotivated by the fact that perfect classifications result in perfect DCG scores and the DCG errors are bounded by classification errors. We propose using theExpected Relevance to convert class probabilities into ranking scores. The class probabilities are learned using a gradient boosting tree algorithm. Evaluations onlarge-scale datasets show that our approach can improve LambdaRank [5] and the regressions-based ranker [6], in terms of the (normalized) DCG scores. An efficient implementation of the boosting tree algorithm is also presented.
Neural Information Processing Systems
Dec-31-2008