How to Mitigate Overfitting in Weak-to-strong Generalization?