Matching Latent Encoding for Audio-Text based Keyword Spotting

Open in new window