Fast deep reinforcement learning using online adjustments from the past

Steven Hansen, Alexander Pritzel, Pablo Sprechmann, Andre Barreto, Charles Blundell

Neural Information Processing Systems 

We propose Ephemeral V alue Adjusments (EV A): a means of allowing deep reinforcement learning agents to rapidly adapt to experience in their replay buffer.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found