Stochastic approximation for speeding up LSTD (and LSPI)

Open in new window