Simple statistical gradient-following algorithms for connectionist reinforcement learning

Williams, R. J.

Classics 

Machine Learning, 8, 229–256