DM-Codec: Distilling Multimodal Representations for Speech Tokenization

Open in new window