Using Stories to Teach Human Values to Artificial Agents

Riedl, Mark O. (Georgia Institute of Technology) | Harrison, Brent (Georgia Institute of Technology)

Apr-12-2016–AAAI Conferences

Value alignment is a property of an intelligent agent indicating that it can only pursue goals that are beneficial to humans. Successful value alignment should ensure that an artificial general intelligence cannot intentionally or unintentionally perform behaviors that adversely affect humans. This is problematic in practice since it is difficult to exhaustively enumerated by human programmers. In order for successful value alignment, we argue that values should be learned. In this paper, we hypothesize that an artificial intelligence that can read and understand stories can learn the values tacitly held by the culture from which the stories originate.We describe preliminary work on using stories to generate a value-aligned reward signal for reinforcement learning agents that prevents psychotic-appearing behavior.

agent, intelligence, plot graph, (15 more...)

AAAI Conferences

Apr-12-2016

Conferences PDF

Add feedback

Country:
- North America > United States
  - Massachusetts (0.04)
  - Georgia > Fulton County
    - Atlanta (0.04)
- Europe > United Kingdom
  - England > Oxfordshire > Oxford (0.04)

Industry:
- Health & Medicine
  - Pharmaceuticals & Biotechnology (0.49)
  - Therapeutic Area (0.35)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found