How Redundant Is the Transformer Stack in Speech Representation Models?

Open in new window