Subich, Christopher
Fixing the Double Penalty in Data-Driven Weather Forecasting Through a Modified Spherical Harmonic Loss Function
Subich, Christopher, Husain, Syed Zahid, Separovic, Leo, Yang, Jing
Beginning in 2023, the release of data-driven atmospheric forecasting models powered by deep neural network architectures began a revolution in medium-range weather forecasting, with some commenters [Bauer, 2024] anticipating that data-driven forecasting will soon supplant traditional numerical weather prediction (NWP) systems in all operational contexts. GraphCast [Lam et al., 2023], FourCastNet [Kurth et al., 2023], and Pangu-Weather [Bi et al., 2023] demonstrated forecast skill superior to that of the high-resolution forecast system (IFS) of the European Centre for Medium Range Weather Forecasts (ECMWF) at lead times (forecast lengths) up to 10 days. Since the publication of these models, the field has been joined by many others, including the Artificial Intelligence Forecasting System (AIFS) developed by ECMWF itself [Lang et al., 2024a]. From the standpoint of machine learning, atmospheric forecasting is a large-scale generative problem comparable to predicting the next frame of a video. As a typical example, the version of the GraphCast model deployed experimentally by the National Oceanic and Atmospheric Administration (NOAA) [NOAA, 2024] predicts the 6-hour forecast for six atmospheric variables at each of 13 vertical levels plus five surface variables, on a latitude/longitude grid, for about 86 million output degrees of freedom in aggregate. GraphCast takes two time-levels as input, so the input for this model has about 170 million degrees of freedom. These first-generation data-driven weather models generally act as deterministic forecast systems, where each unique initial condition is mapped to a single forecast and verified against a "ground truth" from a data analysis system. The ERA5 atmospheric reanalysis [Hersbach et al., 2020] of ECWMF is most often used as the source of initial and verifying data for these forecast systems owing to its high quality and consistent behaviour from 1979 to present.
Leveraging data-driven weather models for improving numerical weather prediction skill through large-scale spectral nudging
Husain, Syed Zahid, Separovic, Leo, Caron, Jean-François, Aider, Rabah, Buehner, Mark, Chamberland, Stéphane, Lapalme, Ervig, McTaggart-Cowan, Ron, Subich, Christopher, Vaillancourt, Paul, Yang, Jing, Zadra, Ayrton
Operational meteorological forecasting has long relied on physics-based numerical weather prediction (NWP) models. Recently, this landscape has been disrupted by the advent of data-driven artificial intelligence (AI)-based weather models, which offer tremendous computational performance and competitive forecasting skill. However, data-driven models for medium-range forecasting generally suffer from major limitations, including low effective resolution and a narrow range of predicted variables. This study illustrates the relative strengths and weaknesses of these competing paradigms using the GEM (Global Environmental Multiscale) and GraphCast models to represent physics-based and AI-based approaches, respectively. By analyzing global predictions from these two models against observations and analyses in both physical and spectral spaces, this study demonstrates that GraphCast-predicted large scales outperform GEM, particularly for longer lead times. Building on this insight, a hybrid NWP-AI system is proposed, wherein GEM-predicted large-scale state variables are spectrally nudged toward GraphCast predictions, while allowing GEM to freely generate fine-scale details critical for weather extremes. Results indicate that this hybrid approach is capable of leveraging the strengths of GraphCast to enhance the prediction skill of the GEM model. Importantly, trajectories of tropical cyclones are predicted with enhanced accuracy without significant changes in intensity. Furthermore, this new hybrid system ensures that meteorologists have access to a complete set of forecast variables, including those relevant for high-impact weather events.