Adversarial Attacks on Online Learning to Rank with Click Feedback Zhiyong Wang 4 Shuai Li5

Mar-27-2025, 08:37:31 GMT–Neural Information Processing Systems

Online learning to rank (OLTR) is a sequential decision-making problem where a learning agent selects an ordered list of items and receives feedback through user clicks. Although potential attacks against OLTR algorithms may cause serious losses in real-world applications, there is limited knowledge about adversarial attacks on OLTR. This paper studies attack strategies against multiple variants of OLTR. Our first result provides an attack strategy against the UCB algorithm on classical stochastic bandits with binary feedback, which solves the key issues caused by bounded and discrete feedback that previous works cannot handle.

artificial intelligence, data mining, machine learning, (18 more...)

Neural Information Processing Systems

Mar-27-2025, 08:37:31 GMT

Conferences PDF

Add feedback

Country:
- Asia > China (0.28)
- North America > United States (0.28)

Genre:
- Overview (0.34)
- Research Report (0.34)

Industry:
- Education > Educational Setting
  - Online (0.61)
- Government > Military (1.00)
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (1.00)
  - Data Science > Data Mining
    - Big Data (0.47)
  - Enterprise Applications > Human Resources
    - Learning Management (0.61)
  - Security & Privacy (1.00)

Duplicate Docs Excel Report

Title
Adversarial Attacks on Online Learning to Rank with Click Feedback Zhiyong Wang 4 Shuai Li

Similar Docs Excel Report more

Title	Similarity	Source
None found