Exponential Graph is Provably Efficient for Decentralized Deep Training