Regret Minimization and Convergence to Equilibria in General-sum Markov Games