Taming Modality Entanglement in Continual Audio-Visual Segmentation