Norm-based Generalization Bounds for Compositionally Sparse Neural Networks