A Algorithms Algorithm 1: MAP Propagation - Monte-Carlo Policy-Gradient Control 1 Input: differentiable policy function: π

Neural Information Processing Systems 

Random synaptic feedback weights support error backpropagation for deep learning.