Categorical Feature Compression via Submodular Optimization