Model-Based Reinforcement Learning Exploiting State-Action Equivalence