Grad Init: Learning to Initialize Neural Networks for Stable and Efficient Training

Open in new window