Harnessing Causality in Reinforcement Learning With Bagged Decision Times