A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges

Shen, Huangjun, Shao, Liangying, Li, Wenbo, Lan, Zhibin, Liu, Zhanyu, Su, Jinsong

May-22-2024–arXiv.org Artificial Intelligence

In recent years, multi-modal machine translation has attracted significant interest in both academia and industry due to its superior performance. It takes both textual and visual modalities as inputs, leveraging visual context to tackle the ambiguities in source texts. In this paper, we begin by offering an exhaustive overview of 99 prior works, comprehensively summarizing representative studies from the perspectives of dominant models, datasets, and evaluation metrics. Afterwards, we analyze the impact of various factors on model performance and finally discuss the possible research directions for this task in the future. Over time, multi-modal machine translation has developed more types to meet diverse needs. Unlike previous surveys confined to the early stage of multi-modal machine translation, our survey thoroughly concludes these emerging types from different aspects, so as to provide researchers with a better understanding of its current state.

machine translation, proceedings, translation, (12 more...)

arXiv.org Artificial Intelligence

May-22-2024

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - New South Wales > Sydney (0.04)
  - Victoria > Melbourne (0.04)
- North America
  - Dominican Republic (0.04)
  - United States
    - Maryland > Baltimore (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - New Jersey > Essex County
      - Newark (0.04)
    - Nevada > Clark County
      - Las Vegas (0.04)
    - Hawaii > Honolulu County
      - Honolulu (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Pennsylvania > Philadelphia County
      - Philadelphia (0.04)
    - Washington > King County
      - Seattle (0.04)
    - Massachusetts
      - Suffolk County > Boston (0.04)
      - Middlesex County > Cambridge (0.04)
    - California
      - Santa Clara County > San Jose (0.04)
      - Los Angeles County > Long Beach (0.04)
    - New York > New York County
      - New York City (0.04)
  - Canada
    - Ontario > Toronto (0.05)
    - Quebec > Montreal (0.04)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.14)
- Europe
  - Germany > Berlin (0.04)
  - Austria (0.04)
  - Czechia > Prague (0.04)
  - Switzerland > Vaud
    - Lausanne (0.04)
  - Spain
    - Valencian Community > Valencia Province
      - Valencia (0.04)
    - Catalonia > Barcelona Province
      - Barcelona (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - United Kingdom > England
    - West Midlands > Birmingham (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
  - France
    - Île-de-France > Paris
      - Paris (0.04)
    - Provence-Alpes-Côte d'Azur > Bouches-du-Rhône
      - Marseille (0.04)
  - Italy
    - Tuscany > Florence (0.04)
    - Veneto > Venice (0.04)
    - Lombardy > Milan (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - Singapore (0.04)
  - Macao (0.04)
  - Taiwan > Taiwan Province
    - Taipei (0.04)
  - South Korea > Seoul
    - Seoul (0.04)
  - Middle East
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.04)
    - Qatar > Ad-Dawhah
      - Doha (0.04)
  - Japan
    - Kyūshū & Okinawa > Kyūshū
      - Miyazaki Prefecture > Miyazaki (0.04)
    - Honshū
      - Kantō > Tokyo Metropolis Prefecture
        Tokyo (0.14)
      - Kansai > Kyoto Prefecture
        Kyoto (0.04)
  - China > Fujian Province
    - Xiamen (0.04)
- Africa > Ethiopia
  - Addis Ababa > Addis Ababa (0.04)

Genre:
- Overview (1.00)
- Research Report > New Finding (0.67)

Industry:
- Education (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Machine Translation (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found