Near-optimal Reinforcement Learning in Factored MDPs

Ian Osband, Benjamin Van Roy

Neural Information Processing Systems 

Neural Information Processing Systems http://nips.cc/