Leveraging the two-timescale regime to demonstrate convergence of neural networks