FFSTC: Fongbe to French Speech Translation Corpus

Kponou, D. Fortune, Laleye, Frejus A. A., Ezin, Eugene C.

Mar-8-2024–arXiv.org Artificial Intelligence

In this paper, we introduce the Fongbe to French Speech Translation Corpus (FFSTC) for the first time. This corpus encompasses approximately 31 hours of collected Fongbe language content, featuring both French transcriptions and corresponding Fongbe voice recordings. FFSTC represents a comprehensive dataset compiled through various collection methods and the efforts of dedicated individuals. Furthermore, we conduct baseline experiments using Fairseq's transformer_s and conformer models to evaluate data quality and validity. Our results indicate a score of 8.96 for the transformer_s model and 8.14 for the conformer model, establishing a baseline for the FFSTC corpus.

corpus, dataset, translation, (12 more...)

arXiv.org Artificial Intelligence

Mar-8-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - District of Columbia > Washington (0.04)
  - Washington > King County
    - Seattle (0.04)
  - Pennsylvania > Philadelphia County
    - Philadelphia (0.04)
- Europe > France
  - Île-de-France > Paris
    - Paris (0.04)
  - Provence-Alpes-Côte d'Azur > Bouches-du-Rhône
    - Marseille (0.04)
- Asia
  - Cambodia (0.04)
  - Japan > Kyūshū & Okinawa
    - Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- Africa
  - Benin (0.05)
  - Niger (0.04)
  - Mali (0.04)
  - Togo (0.04)
  - Nigeria (0.04)
  - Middle East > Algeria (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Speech > Speech Recognition (1.00)
  - Natural Language > Machine Translation (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found