This is where the data to build AI comes from