Concurrent Learning with Aggregated States via Randomized Least Squares Value Iteration

Open in new window