Goto

Collaborating Authors

 Markov Models




#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

Neural Information Processing Systems

These counts are then used to compute a reward bonus according to the classic count-based exploration theory. We find that simple hash functions can achieve surprisingly good results on many challenging tasks. Furthermore, we show that a domain-dependent learned hash code may further improve these results.