Randomized Prior Functions for Deep Reinforcement Learning

Ian Osband, John Aslanides, Albin Cassirer

Neural Information Processing Systems 

Neural Information Processing Systems http://nips.cc/