Leveraging the two timescale regime to demonstrate convergence of neural networks