Finding Public Data for Your Machine Learning Pipelines