Differentially Private Learning Needs Hidden State (Or Much Faster Convergence)