Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement Learning

Harm Van Seijen, Mehdi Fatemi, Arash Tavakoli

Neural Information Processing Systems 

Neural Information Processing Systems http://nips.cc/