Sortformer: Seamless Integration of Speaker Diarization and ASR by Bridging Timestamps and Tokens

Open in new window