Provable Acceleration of Neural Net Training via Polyak's Momentum