Skip to main content

ORBIT-2 Dataset for Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling

    Dan Lu | Oak Ridge National Laboratory
    Xiao Wang | Oak Ridge National Laboratory
    Aristeidis Tsaris | Oak Ridge National Laboratory
    Jong Youl Choi | Oak Ridge National Laboratory
    Moetasim Ashfaq | Oak Ridge National Laboratory
Download Dataset on Globus

Description

This dataset release corresponds to the work conducted in ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling, where large-scale AI methods were applied to improve climate and weather resolution. The collection integrates four widely used, publicly available datasets: ERA5, PRISM, DAYMET, and IMERG. To prepare the data for ORBIT-2 model training and evaluation, we applied a preprocessing pipeline that generates paired low-resolution and high-resolution samples, enabling supervised downscaling experiments. The transformation from coarse to fine scales was performed using bilinear regridding, consistent with the procedures described in WeatherBench2, a community benchmark for weather and climate AI models. This dataset supports the development and evaluation of foundation models designed for weather and climate downscaling at exascale. Additional details on methodology and applications can be found in Wang et al., ORBIT-2 (arXiv:2505.04802, 2025).

Funding Information

DOE Contract Number

AC05-00OR22725

Originating Research Organization

Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)

Sponsoring Organization

Office of Science (SC)

Related Works

Details

Release Date

October 10, 2025

Subject

54 ENVIRONMENTAL SCIENCES, 58 GEOSCIENCES, 97 MATHEMATICS AND COMPUTING

Dataset

Dataset Type

ND Numeric Data

Software

The data can be interpreted and analyzed using widely adopted scientific computing and visualization libraries, including NumPy, Matplotlib, Xarray, and scikit-learn. All datasets are provided in the standard NumPy binary format (.npz or .npy).

Cite This Dataset:

Lu, D., Wang, X., Tsaris, A., Choi, J., Ashfaq, M. (2025). ORBIT-2 Dataset for Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling. Oak Ridge National Laboratory. https://doi.org/10.13139/OLCF/2589526.

Acknowledgements

This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Advanced Scientific Computing Research programs in the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725.