Difference of Convex Functions Programming for Reinforcement Learning

Bilal Piot, Matthieu Geist, Olivier Pietquin

Neural Information Processing Systems 

Neural Information Processing Systems http://nips.cc/