Finite-Sample Analysis of Contractive Stochastic Approximation Using Smooth Convex Envelopes

Neural Information Processing Systems 

Our result is applicable in Reinforcement Learning (RL).