Predictable Reinforcement Learning Dynamics through Entropy Rate Minimization