General policy mapping: online continual reinforcement learning inspired on the insect brain

Nov-30-2022–arXiv.org Artificial Intelligence

We have developed a model for online continual or lifelong reinforcement learning (RL) inspired on the insect brain. Our model leverages the offline training of a feature extraction and a common general policy layer to enable the convergence of RL algorithms in online settings. Sharing a common policy layer across tasks leads to positive backward transfer, where the agent continuously improved in older tasks sharing the same underlying general policy. Biologically inspired restrictions to the agent's network are key for the convergence of RL algorithms. This provides a pathway towards efficient online RL in resource-constrained scenarios.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

arXiv.org Artificial Intelligence

Nov-30-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States > Illinois > Cook County > Lemont (0.04)

Genre:
- Research Report (0.64)
- Instructional Material (0.48)

Industry:
- Education > Educational Setting (0.69)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found