Estimated Audio-Caption Correspondences Improve Language-Based Audio Retrieval

Open in new window