Average-Reward Learning and Planning with Options Yi Wan, Richard S. Sutton