Advancing Natural-Language Based Audio Retrieval with PaSST and Large Audio-Caption Data Sets

Open in new window