User Tampering in Reinforcement Learning Recommender Systems
Evans, Charles, Kasirzadeh, Atoosa
–arXiv.org Artificial Intelligence
This paper provides the first formalisation and empirical demonstration of a particular safety concern in reinforcement learning (RL)-based news and social media recommendation algorithms. This safety concern is what we call "user tampering" -- a phenomenon whereby an RL-based recommender system may manipulate a media user's opinions, preferences and beliefs via its recommendations as part of a policy to increase long-term user engagement. We provide a simulation study of a media recommendation problem constrained to the recommendation of political content, and demonstrate that a Q-learning algorithm consistently learns to exploit its opportunities to 'polarise' simulated 'users' with its early recommendations in order to have more consistent success with later recommendations catering to that polarisation. Finally, we argue that given our findings, designing an RL-based recommender system which cannot learn to exploit user tampering requires making the metric for the recommender's success independent of observable signals of user engagement, and thus that a media recommendation system built solely with RL is necessarily either unsafe, or almost certainly commercially unviable.
arXiv.org Artificial Intelligence
Sep-9-2021
- Country:
- Asia
- China > Hong Kong (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Europe
- Denmark > Capital Region
- Copenhagen (0.04)
- France > Auvergne-Rhône-Alpes
- Netherlands > North Holland
- Amsterdam (0.05)
- United Kingdom > England
- Greater London > London (0.04)
- Denmark > Capital Region
- North America
- Canada > British Columbia (0.04)
- United States
- California > San Francisco County
- San Francisco (0.14)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New Jersey > Mercer County
- Princeton (0.04)
- New York > New York County
- New York City (0.05)
- North Carolina > Wake County
- Raleigh (0.04)
- Texas (0.04)
- California > San Francisco County
- Oceania > Australia
- Australian Capital Territory > Canberra (0.04)
- South America
- Brazil > Ceará
- Fortaleza (0.04)
- Chile > Santiago Metropolitan Region
- Santiago Province > Santiago (0.04)
- Brazil > Ceará
- Asia
- Genre:
- Research Report (0.84)
- Industry:
- Information Technology (0.67)
- Leisure & Entertainment (0.46)
- Media (0.66)
- Technology: