What do self-supervised speech models know about words?

Open in new window