SBAAM! Eliminating Transcript Dependency in Automatic Subtitling
Gaido, Marco, Papi, Sara, Negri, Matteo, Cettolo, Mauro, Bentivogli, Luisa
–arXiv.org Artificial Intelligence
Subtitling plays a crucial role in enhancing the accessibility of audiovisual content and encompasses three primary subtasks: translating spoken dialogue, segmenting translations into concise textual units, and estimating timestamps that govern their on-screen duration. Past attempts to automate this process rely, to varying degrees, on automatic transcripts, employed diversely for the three subtasks. In response to the acknowledged limitations associated with this reliance on transcripts, recent research has shifted towards transcription-free solutions for translation and segmentation, leaving the direct generation of timestamps as uncharted territory. To fill this gap, we introduce the first direct model capable of producing automatic subtitles, entirely eliminating any dependence on intermediate transcripts also for timestamp prediction. Experimental results, backed by manual evaluation, showcase our solution's new state-of-the-art performance across multiple language pairs and diverse conditions.
arXiv.org Artificial Intelligence
May-17-2024
- Country:
- South America > Chile
- Oceania > Australia
- North America
- United States
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- New York > New York County
- New York City (0.04)
- Colorado > Denver County
- Denver (0.04)
- Pennsylvania > Allegheny County
- Canada > Ontario
- Toronto (0.04)
- United States
- Europe
- Spain (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Portugal > Lisbon
- Lisbon (0.14)
- Italy
- Tuscany > Florence (0.04)
- Liguria > Genoa (0.04)
- Trentino-Alto Adige/Südtirol > Trentino Province
- Trento (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- China (0.04)
- Macao (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Africa > Middle East
- Morocco (0.04)
- Genre:
- Research Report (1.00)
- Industry:
- Media (0.48)
- Leisure & Entertainment (0.48)
- Technology:
- Information Technology > Artificial Intelligence
- Natural Language > Machine Translation (1.00)
- Machine Learning (1.00)
- Speech (0.95)
- Information Technology > Artificial Intelligence