Risk Alignment in Agentic AI Systems

Clatterbuck, Hayley, Castro, Clinton, Morán, Arvo Muñoz

Oct-2-2024–arXiv.org Artificial Intelligence

Agentic AIs $-$ AIs that are capable and permitted to undertake complex actions with little supervision $-$ mark a new frontier in AI capabilities and raise new questions about how to safely create and align such systems with users, developers, and society. Because agents' actions are influenced by their attitudes toward risk, one key aspect of alignment concerns the risk profiles of agentic AIs. Risk alignment will matter for user satisfaction and trust, but it will also have important ramifications for society more broadly, especially as agentic AIs become more autonomous and are allowed to control key aspects of our lives. AIs with reckless attitudes toward risk (either because they are calibrated to reckless human users or are poorly designed) may pose significant threats. They might also open 'responsibility gaps' in which there is no agent who can be held accountable for harmful actions. What risk attitudes should guide an agentic AI's decision-making? How might we design AI systems that are calibrated to the risk attitudes of their users? What guardrails, if any, should be placed on the range of permissible risk attitudes? What are the ethical considerations involved when designing systems that make risky decisions on behalf of others? We present three papers that bear on key normative and technical aspects of these questions.

agentic ais, developer, risk attitude, (12 more...)

arXiv.org Artificial Intelligence

Oct-2-2024

arXiv.org PDF

Add feedback

Country:
- Asia > India (0.04)
- North America
  - Canada (0.28)
  - Mexico > Oaxaca (0.04)
  - United States
    - Wisconsin > Dane County
      - Madison (0.04)
    - Illinois > Cook County
      - Chicago (0.04)
- Europe > United Kingdom
  - England
    - Oxfordshire > Oxford (0.04)
    - Cambridgeshire > Cambridge (0.04)
- Africa > Eswatini
  - Manzini > Manzini (0.04)

Genre:
- Research Report (1.00)

Industry:
- Information Technology (1.00)
- Banking & Finance > Trading (1.00)
- Automobiles & Trucks (0.92)
- Health & Medicine > Therapeutic Area (0.92)
- Leisure & Entertainment > Games (0.92)
- Consumer Products & Services > Travel (0.67)
- Law > Criminal Law (0.67)
- Government > Regional Government
  - North America Government > United States Government (1.00)
- Transportation
  - Passenger (1.00)
  - Ground > Road (0.92)
  - Air (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Natural Language > Large Language Model (0.68)
  - Robots > Autonomous Vehicles (0.67)
  - Machine Learning
    - Performance Analysis > Accuracy (0.46)
    - Neural Networks > Deep Learning (0.45)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found