Multimodal Learning with Augmentation Techniques for Natural Disaster Assessment
Urse, Adrian-Dinu, Cercel, Dumitru-Clementin, Pop, Florin
–arXiv.org Artificial Intelligence
Abstract--Natural disaster assessment relies on accurate and rapid access to information, with social media emerging as a valuable real-time source. However, existing datasets suffer from class imbalance and limited samples, making effective model development a challenging task. This paper explores augmentation techniques to address these issues on the CrisisMMD multimodal dataset. For visual data, we apply diffusion-based methods, namely Real Guidance and DiffuseMix. For text data, we explore back-translation, paraphrasing with transformers, and image caption-based augmentation. We evaluated these across unimodal, multimodal, and multi-view learning setups. Results show that selected augmentations improve classification performance, particularly for underrepresented classes, while multi-view learning introduces potential but requires further refinement. This study highlights effective augmentation strategies for building more robust disaster assessment systems.
arXiv.org Artificial Intelligence
Nov-4-2025
- Country:
- Asia > Pakistan (0.04)
- Europe > Romania
- North America
- Mexico (0.04)
- United States > California (0.14)
- Genre:
- Research Report > New Finding (1.00)
- Technology: