Explanation, Debate, Align: A Weak-to-Strong Framework for Language Model Generalization