Data Representativity for Machine Learning and AI Systems