Explainable Disentanglement on Discrete Speech Representations for Noise-Robust ASR