WeatherBench 2: A benchmark for the next generation of data-driven global weather models

Rasp, Stephan, Hoyer, Stephan, Merose, Alexander, Langmore, Ian, Battaglia, Peter, Russel, Tyler, Sanchez-Gonzalez, Alvaro, Yang, Vivian, Carver, Rob, Agrawal, Shreya, Chantry, Matthew, Bouallegue, Zied Ben, Dueben, Peter, Bromberg, Carla, Sisk, Jared, Barrington, Luke, Bell, Aaron, Sha, Fei

Jan-26-2024–arXiv.org Artificial Intelligence

WeatherBench 2 is an update to the global, medium-range (1-14 day) weather forecasting benchmark proposed by Rasp et al. (2020), designed with the aim to accelerate progress in data-driven weather modeling. WeatherBench 2 consists of an open-source evaluation framework, publicly available training, ground truth and baseline data as well as a continuously updated website with the latest metrics and state-of-the-art models: https://sites.research.google/weatherbench. This paper describes the design principles of the evaluation framework and presents results for current state-of-the-art physical and data-driven weather models. The metrics are based on established practices for evaluating weather forecasts at leading operational weather centers. We define a set of headline scores to provide an overview of model performance. In addition, we also discuss caveats in the current evaluation setup and challenges for the future of data-driven weather forecasting.

forecast, lead time, resolution, (16 more...)

arXiv.org Artificial Intelligence

Jan-26-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.14)
- Europe
  - Spain (0.04)
  - Iceland (0.04)
  - France (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.04)
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Atlantic Ocean > North Atlantic Ocean
  - English Channel (0.04)

Genre:
- Research Report (1.00)
- Overview (0.68)

Technology:
- Information Technology
  - Modeling & Simulation (1.00)
  - Artificial Intelligence
    - Natural Language (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (0.68)