Learning preferences by looking at the world

Feb-13-2019, 03:59:08 GMT–Robohub

It would be great if we could all have household robots do our chores for us. Chores are tasks that we want done to make our houses cater more to our preferences; they are a way in which we want our house to be different from the way it currently is. However, most "different" states are not very desirable: Surely our robot wouldn't be so dumb as to go around breaking stuff when we ask it to clean our house? Unfortunately, AI systems trained with reinforcement learning only optimize features specified in the reward function and are indifferent to anything we might've inadvertently left out. Generally, it is easy to get the reward wrong by forgetting to include preferences for things that should stay the same, since we are so used to having these preferences satisfied, and there are so many of them.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Robohub

Feb-13-2019, 03:59:08 GMT

News Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Robots (0.86)
  - Machine Learning > Reinforcement Learning (0.69)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found