Orthogonalized SGD and Nested Architectures for Anytime Neural Networks

Open in new window