Concurrent Learning with Aggregated States via Randomized Least Squares Value Iteration