Visually Grounded Keyword Detection and Localisation for Low-Resource Languages

Feb-1-2023–arXiv.org Artificial Intelligence

This study investigates the use of Visually Grounded Speech (VGS) models for keyword localisation in speech. The study focusses on two main research questions: (1) Is keyword localisation possible with VGS models and (2) Can keyword localisation be done cross-lingually in a real low-resource setting? Four methods for localisation are proposed and evaluated on an English dataset, with the best-performing method achieving an accuracy of 57%. A new dataset containing spoken captions in Yoruba language is also collected and released for cross-lingual keyword localisation. The cross-lingual model obtains a precision of 16% in actual keyword localisation and this performance can be improved by initialising from a model pretrained on English data. The study presents a detailed analysis of the model's success and failure modes and highlights the challenges of using VGS models for keyword localisation in low-resource settings.

information retrieval, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

Feb-1-2023

arXiv.org PDF

Add feedback

Country:
- Asia > China (0.04)
- North America > United States
  - New York (0.04)
- Africa
  - Nigeria (0.04)
  - Zimbabwe > Masvingo (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Leisure & Entertainment > Sports (1.00)
- Education (1.00)
- Media (0.67)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (1.00)
  - Information Management (0.67)
  - Communications
    - Social Media (1.00)
    - Networks (0.67)
  - Artificial Intelligence
    - Vision (1.00)
    - Speech > Speech Recognition (1.00)
    - Representation & Reasoning (1.00)
    - Cognitive Science (0.92)
    - Natural Language
      - Information Retrieval (1.00)
      - Machine Translation (0.67)
    - Machine Learning
      - Statistical Learning (1.00)
      - Neural Networks > Deep Learning (1.00)
      - Learning Graphical Models (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found