A Novel Metric for Measuring Data Quality in Classification Applications (extended version)