On the Minimax Regret in Online Ranking with Top-k Feedback
Zhang, Mingyuan, Tewari, Ambuj
In online ranking, a learning algorithm sequentially ranks a set of items and receives feedback on its ranking in the form of relevance scores. Since obtaining relevance scores typically involves human annotation, it is of great interest to consider a partial feedback setting where feedback is restricted to the top-$k$ items in the rankings. Chaudhuri and Tewari [2017] developed a framework to analyze online ranking algorithms with top $k$ feedback. A key element in their work was the use of techniques from partial monitoring. In this paper, we further investigate online ranking with top $k$ feedback and solve some open problems posed by Chaudhuri and Tewari [2017]. We provide a full characterization of minimax regret rates with the top $k$ feedback model for all $k$ and for the following ranking performance measures: Pairwise Loss, Discounted Cumulative Gain, and Precision@n. In addition, we give an efficient algorithm that achieves the minimax regret rate for Precision@n.
Sep-5-2023
- Country:
- North America > United States
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.14)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Pennsylvania > Philadelphia County
- Europe
- Netherlands > North Holland
- Amsterdam (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- France > Auvergne-Rhône-Alpes
- Netherlands > North Holland
- Asia
- Middle East > Jordan (0.04)
- Japan > Honshū
- Chūbu > Nagano Prefecture > Nagano (0.04)
- North America > United States
- Genre:
- Research Report (0.50)
- Industry:
- Leisure & Entertainment (0.46)
- Technology: