Non-Autoregressive Sign Language Production via Knowledge Distillation

Hwang, Eui Jun, Kim, Jung Ho, Cho, Suk Min, Park, Jong C.

Aug-12-2022–arXiv.org Artificial Intelligence

Sign Language Production (SLP) aims to translate expressions in spoken language into corresponding ones in sign language, such as skeleton-based sign poses or videos. Existing SLP models are either AutoRegressive (AR) or Non-Autoregressive (NAR). However, AR-SLP models suffer from regression to the mean and error propagation during decoding. NSLP-G, a NAR-based model, resolves these issues to some extent but engenders other problems. For example, it does not consider target sign lengths and suffers from false decoding initiation. We propose a novel NAR-SLP model via Knowledge Distillation (KD) to address these problems. First, we devise a length regulator to predict the end of the generated sign pose sequence. We then adopt KD, which distills spatial-linguistic features from a pre-trained pose encoder to alleviate false decoding initiation. Extensive experiments show that the proposed approach significantly outperforms existing SLP models in both Frechet Gesture Distance and Back-Translation evaluation.

language production, sign language production, sign pose, (13 more...)

arXiv.org Artificial Intelligence

Aug-12-2022

arXiv.org PDF

Add feedback

Country:
- North America
  - United States (0.04)
  - Dominican Republic (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia > South Korea
  - Daejeon > Daejeon (0.04)

Genre:
- Research Report (0.82)
- Instructional Material > Course Syllabus & Notes (0.46)

Industry:
- Education > Curriculum > Subject-Specific Education (0.85)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Natural Language (1.00)
  - Machine Learning > Neural Networks (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found