Batch Normalization Provably Avoids Rank Collapse for Randomly Initialised Deep Networks

Open in new window