A Survey on Multi-modal Summarization

Jangra, Anubhav, Mukherjee, Sourajit, Jatowt, Adam, Saha, Sriparna, Hasanuzzaman, Mohammad

Feb-13-2023–arXiv.org Artificial Intelligence

The new era of technology has brought us to the point where it is convenient for people to share their opinions over an abundance of platforms. These platforms have a provision for the users to express themselves in multiple forms of representations, including text, images, videos, and audio. This, however, makes it difficult for users to obtain all the key information about a topic, making the task of automatic multi-modal summarization (MMS) essential. In this paper, we present a comprehensive survey of the existing research in the area of MMS, covering various modalities like text, image, audio, and video. Apart from highlighting the different evaluation metrics and datasets used for the MMS task, our work also discusses the current challenges and future directions in this field.

data mining, machine learning, natural language, (23 more...)

arXiv.org Artificial Intelligence

Feb-13-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York > New York County
    - New York City (0.04)
  - New Mexico > Bernalillo County
    - Albuquerque (0.04)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
- Europe
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
  - Norway > Western Norway
    - Rogaland > Stavanger (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Ireland > Munster
    - County Cork > Cork (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - Austria > Tyrol
    - Innsbruck (0.04)
- Asia
  - Singapore (0.04)
  - Middle East > Jordan (0.04)
  - China > Hong Kong (0.04)
  - India > Bihar
    - Patna (0.14)

Genre:
- Overview (1.00)

Industry:
- Information Technology (1.00)
- Health & Medicine (1.00)
- Leisure & Entertainment > Sports
  - Tennis (1.00)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (1.00)
  - Information Management (1.00)
  - Data Science > Data Mining (1.00)
  - Communications > Social Media (1.00)
  - Human Computer Interaction > Interfaces (0.67)
  - Artificial Intelligence
    - Representation & Reasoning > Optimization (1.00)
    - Cognitive Science (1.00)
    - Vision > Image Understanding (0.67)
    - Natural Language
      - Text Processing (1.00)
      - Information Extraction (0.67)
    - Machine Learning
      - Statistical Learning (1.00)
      - Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found