On Anytime Learning at Macroscale