Convergence Analysis for Learning Orthonormal Deep Linear Neural Networks