Long-Form End-to-End Speech Translation via Latent Alignment Segmentation

Open in new window