Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts