A Survey on Data Collection for Machine Learning: a Big Data - AI Integration Perspective