A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision

Raude, Charles, Prajwal, K R, Momeni, Liliane, Bull, Hannah, Albanie, Samuel, Zisserman, Andrew, Varol, Gül

May-16-2024–arXiv.org Artificial Intelligence

In this work, our goals are two fold: large-vocabulary continuous sign language recognition (CSLR), and sign language retrieval. To this end, we introduce a multi-task Transformer model, CSLR2, that is able to ingest a signing sequence and output in a joint embedding space between signed language and spoken language text. To enable CSLR evaluation in the large-vocabulary setting, we introduce new dataset annotations that have been manually collected. These provide continuous sign-level annotations for six hours of test videos, and will be made publicly available. We demonstrate that by a careful choice of loss functions, training the model for both the CSLR and retrieval tasks is mutually beneficial in terms of performance -- retrieval improves CSLR performance by providing context, while CSLR improves retrieval with more fine-grained supervision. We further show the benefits of leveraging weak and noisy supervision from large-vocabulary datasets such as BOBSL, namely sign-level pseudo-labels, and English subtitles. Our model significantly outperforms the previous state of the art on both tasks.

annotation, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

May-16-2024

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East
  - Iran (0.04)
  - Jordan (0.04)
- Atlantic Ocean > North Atlantic Ocean
  - Celtic Sea > Bristol Channel (0.04)
  - Irish Sea (0.04)
  - North Sea > West of Shetlands (0.04)
- Europe
  - Belgium (0.04)
  - Denmark (0.04)
  - Faroe Islands (0.04)
  - France (0.04)
  - United Kingdom
    - Bristol Channel (0.04)
    - England
      - Cambridgeshire > Cambridge (0.14)
      - Dorset
        Bournemouth (0.04)
        Lyme Regis (0.04)
      - Isle of Wight (0.04)
      - Oxfordshire > Oxford (0.14)
      - Tyne and Wear > Sunderland (0.04)
    - Irish Sea (0.04)
    - Scotland > West of Shetlands (0.04)
    - Wales (0.04)
- Oceania > French Polynesia (0.04)
- South America > Falkland Islands (0.04)

Genre:
- Research Report (0.63)

Industry:
- Education > Curriculum
  - Subject-Specific Education (0.94)
- Leisure & Entertainment (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Performance Analysis
    - Accuracy (0.93)
  - Natural Language (1.00)
  - Vision (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found