Alignment Helps Make the Most of Multimodal Data
Arnold, Christian, Küpfer, Andreas
–arXiv.org Artificial Intelligence
When studying political communication, combining the information from text, audio, and video signals promises to reflect the richness of human communication more comprehensively than confining it to individual modalities alone. However, its heterogeneity, connectedness, and interaction are challenging to address when modeling such multimodal data. We argue that aligning the respective modalities can be an essential step in entirely using the potential of multimodal data because it informs the model with human understanding. Taking care of the data-generating process of multimodal data, our framework proposes four principles to organize alignment and, thus, address the challenges of multimodal data. We illustrate the utility of these principles by analyzing how German MPs address members of the far-right AfD in their speeches and predicting the tone of video advertising in the context of the 2020 US presidential race. Our paper offers important insights to all keen to analyze multimodal data effectively.
arXiv.org Artificial Intelligence
Jul-8-2024
- Country:
- North America
- Central America (0.04)
- United States
- Michigan (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Connecticut > Middlexex County
- Middletown (0.04)
- Europe
- Western Europe (0.04)
- Ireland (0.04)
- Belarus (0.04)
- Austria > Vienna (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.04)
- Italy > Tuscany
- Florence (0.04)
- Germany
- Baden-Württemberg (0.04)
- Hesse > Darmstadt Region
- Darmstadt (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Asia
- Middle East > Israel (0.04)
- China (0.04)
- North America
- Genre:
- Research Report (1.00)
- Industry:
- Government > Voting & Elections (1.00)
- Technology:
- Information Technology
- Data Science (1.00)
- Communications (0.93)
- Artificial Intelligence
- Vision (1.00)
- Natural Language (1.00)
- Machine Learning > Neural Networks (0.93)
- Information Technology