Explore Reinforced: Equilibrium Approximation with Reinforcement Learning