Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning

Neural Information Processing Systems 

Humans are great observers who can learn by aggregating external knowledge from various sources, including observations from others' policies of attempting a task.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found