Look out for potential bias in chemical data sets