Approximate Planning in Large POMDPs via Reusable Trajectories