Lifted Neural Networks
Askari, Armin, Negiar, Geoffrey, Sambharya, Rajiv, Ghaoui, Laurent El
We describe a novel family of models of multi- layer feedforward neural networks in which the activation functions are encoded via penalties in the training problem. Our approach is based on representing a non-decreasing activation function as the argmin of an appropriate convex optimiza- tion problem. The new framework allows for algo- rithms such as block-coordinate descent methods to be applied, in which each step is composed of a simple (no hidden layer) supervised learning problem that is parallelizable across data points and/or layers. Experiments indicate that the pro- posed models provide excellent initial guesses for weights for standard neural networks. In addi- tion, the model provides avenues for interesting extensions, such as robustness against noisy in- puts and optimizing over parameters in activation functions.
May-3-2018
- Country:
- North America > United States > California
- Alameda County > Berkeley (0.14)
- San Francisco County > San Francisco (0.14)
- North America > United States > California
- Genre:
- Research Report (0.64)
- Industry:
- Education (0.34)
- Technology: