Curriculum Learning for Cumulative Return Maximization