Surprise and Curiosity for Big Data Robotics

White, Adam (University of Alberta) | Modayil, Joseph (University of Alberta) | Sutton, Richard S. (University of Alberta)

Jul-22-2014–AAAI Conferences

This paper introduces a new perspective on curiosity and intrinsic motivation, viewed as the problem of generating behavior data for parallel off-policy learning.We provide 1) the first measure of surprise based on off-policy general value function learning progress, 2) the first investigation of reactive behavior control with parallel gradient temporal difference learning and function approximation, and 3) the first demonstration of using curiosity driven control to react to a non-stationary learning task---all on a mobile robot. Our approach improves scalability over previous off-policy, robot learning systems, essential for making progress on the ultimate big-data decision making problem---life-long robot learning.

data mining, machine learning, reinforcement learning, (3 more...)

AAAI Conferences

Jul-22-2014

Conferences PDF

Add feedback

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.60)
  - Artificial Intelligence
    - Robots (1.00)
    - Machine Learning > Reinforcement Learning (0.53)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found