Policy Gradient Coagent Networks

Dec-31-2011–Neural Information Processing Systems

We present a novel class of actor-critic algorithms for actors consisting of sets of interacting modules. We present, analyze theoretically, and empirically evaluate an update rule for each module, which requires only local information: the module's input, output, and the TD error broadcast by a critic. Such updates are necessary when computation of compatible features becomes prohibitively difficult and are also desirable to increase the biological plausibility of reinforcement learning methods.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Dec-31-2011

Conferences PDF

Add feedback

Country:
- North America > United States > Massachusetts (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (0.68)
  - Machine Learning
    - Reinforcement Learning (0.88)
    - Neural Networks (0.68)

Duplicate Docs Excel Report

Title
Policy Gradient Coagent Networks

Similar Docs Excel Report more

Title	Similarity	Source
None found