On the asymptotic equivalence between differential Hebbian and temporal difference learning using a local third factor

Kolodziejski, Christoph, Porr, Bernd, Tamosiunaite, Minija, Wörgötter, Florentin

Neural Information Processing Systems 

In this theoretical contribution we provide mathematical proof that two of the most important classes of network learning - correlation-based differential Hebbian learningand reward-based temporal difference learning - are asymptotically equivalent when timing the learning with a local modulatory signal. This opens the opportunity to consistently reformulate most of the abstract reinforcement learning frameworkfrom a correlation based perspective that is more closely related to the biophysics of neurons.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found