Theoretical Modeling of LLM Self-Improvement Training Dynamics Through Solver-Verifier Gap