Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks