Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning

Oct-10-2024, 19:28:15 GMT–Neural Information Processing Systems

Reinforcement learning (RL) agents have long sought to approach the efficiency of human learning. Humans are great observers who can learn by aggregating external knowledge from various sources, including observations from others' policies of attempting a task. Prior studies in RL have incorporated external knowledge policies to help agents improve sample efficiency. However, it remains non-trivial to perform arbitrary combinations and replacements of those policies, an essential feature for generalization and transferability. We propose a new actor architecture for KGRL, Knowledge-Inclusive Attention Network (KIAN), which allows free knowledge rearrangement due to embedding-based attentive action prediction.

efficient deep reinforcement learning, flexible attention-based multi-policy fusion, knowledge policy, (4 more...)

Neural Information Processing Systems

Oct-10-2024, 19:28:15 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)