Learning Compact Representations of Neural Networks using DiscriminAtive Masking (DAM)