Learning Regional Monsoon Patterns with a Multimodal Attention U-Net
Mazumder, Swaib Ilias, Kumar, Manish, Khan, Aparajita
–arXiv.org Artificial Intelligence
Accurate long-range monsoon rainfall prediction is critical for India's rain-fed agricultural economy and climate resilience planning, yet remains hindered by sparse ground data and complex regional variability. This work proposes a multimodal deep learning framework for gridded precipitation classification using satellite-derived geospatial inputs. Unlike previous rainfall prediction methods relying on coarse-resolution datasets of 5-50 km grid, we curate a high-resolution dataset of projected 1 km grid resolution for five Indian states, integrating seven heterogeneous Earth observation modalities, including land surface temperature, vegetation, soil moisture, humidity, wind speed, elevation, and land use, spanning the June-September 2024 period. We adopt a attention-guided U-Net architecture that captures spatial patterns and temporal dependencies across multi-modalities, and propose a combination of focal and dice loss to address class imbalance and spatial coherence in rainfall categories defined by the India Meteorological Department. Extensive experiments show that the multi-model framework significantly outperforms unimodal baselines and existing deep approaches, especially in underrepresented extreme rainfall zones. The framework demonstrates potential for scalable, region-adaptive monsoon forecasting and Earth observation driven climate risk assessment.
arXiv.org Artificial Intelligence
Sep-30-2025
- Country:
- Asia
- Bangladesh (0.04)
- India
- Assam (0.04)
- Bihar (0.04)
- Himachal Pradesh (0.05)
- Karnataka (0.06)
- Uttarakhand > Roorkee (0.04)
- Europe
- Austria (0.04)
- United Kingdom > North Sea
- Southern North Sea (0.04)
- Indian Ocean (0.04)
- Asia
- Genre:
- Research Report (0.50)
- Industry:
- Government > Regional Government (0.34)
- Information Technology > Security & Privacy (0.34)
- Technology: