Balancing Quality and Variation: Spam Filtering Distorts Data Label Distributions

Open in new window