Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss

Open in new window