The quest for better training data