Shapeshifter Networks: Cross-layer Parameter Sharing for Scalable and Effective Deep Learning