Goto

Collaborating Authors

 freezeout


Train your Deep Learning Faster: FreezeOut

@machinelearnbot

Deep neural networks have many, many learnable parameters that are used to make inferences. Often, this poses a problem in two ways: Sometimes, the model does not make very accurate predictions. It also takes a long time to train them. In a previous post, we covered Train your Deep Learning model faster and sharper: Snapshot Ensembling -- M models for the cost of 1. The authors of this paper propose a method to increase training speed by freezing layers.


Train your Deep Learning Faster: FreezeOut

@machinelearnbot

Deep neural networks have many, many learnable parameters that are used to make inferences. Often, this poses a problem in two ways: Sometimes, the model does not make very accurate predictions. It also takes a long time to train them. In a previous post, we covered Train your Deep Learning model faster and sharper: Snapshot Ensembling -- M models for the cost of 1. The authors of this paper propose a method to increase training speed by freezing layers.


FreezeOut: Accelerate Training by Progressively Freezing Layers

arXiv.org Machine Learning

The early layers of a deep neural net have the fewest parameters, but take up the most computation. In this extended abstract, we propose to only train the hidden layers for a set portion of the training run, freezing them out one-by-one and excluding them from the backward pass. Through experiments on CIFAR, we empirically demonstrate that FreezeOut yields savings of up to 20% wall-clock time during training with 3% loss in accuracy for DenseNets, a 20% speedup without loss of accuracy for ResNets, and no improvement for VGG networks. Our code is publicly available at https://github.com/ajbrock/FreezeOut