Computational Advantages of Multi-Grade Deep Learning: Convergence Analysis and Performance Insights