How to create a machine learning dataset from scratch?