Predictive Information Accelerates Learning in RL

Lee, Kuang-Huei, Fischer, Ian, Liu, Anthony, Guo, Yijie, Lee, Honglak, Canny, John, Guadarrama, Sergio

Jul-24-2020–arXiv.org Artificial Intelligence

The Predictive Information is the mutual information between the past and the future, I(X_past; X_future). We hypothesize that capturing the predictive information is useful in RL, since the ability to model what will happen next is necessary for success on many tasks. To test our hypothesis, we train Soft Actor-Critic (SAC) agents from pixels with an auxiliary task that learns a compressed representation of the predictive information of the RL environment dynamics using a contrastive version of the Conditional Entropy Bottleneck (CEB) objective. We refer to these as Predictive Information SAC (PI-SAC) agents. We show that PI-SAC agents can substantially improve sample efficiency over challenging baselines on tasks from the DM Control suite of continuous control environments. We evaluate PI-SAC agents by comparing against uncompressed PI-SAC agents, other compressed and uncompressed agents, and SAC agents directly trained from pixels.

environment step, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

Jul-24-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Michigan (0.04)
  - Illinois > Cook County
    - Chicago (0.04)
- Europe > Germany
  - North Rhine-Westphalia > Upper Bavaria > Munich (0.04)

Genre:
- Research Report (0.82)

Technology:
- Information Technology
  - Information Management (1.00)
  - Data Science (1.00)
  - Artificial Intelligence > Machine Learning
    - Neural Networks (0.93)
    - Reinforcement Learning (0.69)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found