Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction

Open in new window