TextMatch: Enhancing Image-Text Consistency Through Multimodal Optimization