$ϕ$-Balancing for Mixture-of-Experts Training