Do Discrete Self-Supervised Representations of Speech Capture Tone Distinctions?

Open in new window