Model-based Cleaning of the QUILT-1M Pathology Dataset for Text-Conditional Image Synthesis