COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations

Open in new window