An investigation of licensing of datasets for machine learning based on the GQM model