Assessing AI Utility: The Random Guesser Test for Sequential Decision-Making Systems

Ide, Shun, Blunt, Allison, Bouneffouf, Djallel

Aug-11-2024–arXiv.org Artificial Intelligence

We propose a general approach to quantitatively assessing the risk and vulnerability of artificial intelligence (AI) systems to biased decisions. The guiding principle of the proposed approach is that any AI algorithm must outperform a random guesser. This may appear trivial, but empirical results from a simplistic sequential decision-making scenario involving roulette games show that sophisticated AI-based approaches often underperform the random guesser by a significant margin. We highlight that modern recommender systems may exhibit a similar tendency to favor overly low-risk options. We argue that this "random guesser test" can serve as a useful tool for evaluating the utility of AI actions, and also points towards increasing exploration as a potential improvement to such systems.

ai misalignment, algorithm, misalignment, (12 more...)

arXiv.org Artificial Intelligence

Aug-11-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Iowa (0.04)
  - New York
    - Westchester County > Harrison (0.04)
    - New York County > New York City (0.04)
- Europe
  - Portugal (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.05)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Information Technology > Security & Privacy (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found