A Reduction-based Framework for Sequential Decision Making with Delayed Feedback Y unchang Y ang 1 Han Zhong 1 Tianhao Wu2 Bin Liu 3

Neural Information Processing Systems 

More examples include but are not limited to robotics (Mahmood et al.,