For AI, data are harder to come by than you think