Are fairness metric scores enough to assess discrimination biases in machine learning?