Statistically Testing Training Data for Unwanted Error Patterns using Rule-Oriented Regression