Reinforcement Learning with Stochastic Reward Machines