Active Measuring in Reinforcement Learning With Delayed Negative Effects