Data curation via joint example selection further accelerates multimodal learning