Adam Induces Implicit Weight Sparsity in Rectifier Neural Networks

Open in new window