Government
SSL4EO-L: Datasets and Foundation Models for Landsat Imagery Adam J. Stewart
The Landsat program is the longest-running Earth observation program in history, with 50+ years of data acquisition by 8 satellites. The multispectral imagery captured by sensors onboard these satellites is critical for a wide range of scientific fields. Despite the increasing popularity of deep learning and remote sensing, the majority of researchers still use decision trees and random forests for Landsat image analysis due to the prevalence of small labeled datasets and lack of foundation models. In this paper, we introduce SSL4EO-L, the first ever dataset designed for Self-Supervised Learning for Earth O bservation for the Landsat family of satellites (including 3 sensors and 2 product levels) and the largest Landsat dataset in history (5M image patches). Additionally, we modernize and re-release the L7 Irish and L8 Biome cloud detection datasets, and introduce the first ML benchmark datasets for Landsats 4-5 TM and Landsat 7 ETM+ SR. Finally, we pre-train the first foundation models for Landsat imagery using SSL4EO-L and evaluate their performance on multiple semantic segmentation tasks.
Supplementary Material
The material provided in this document contains additional information relevant to the dataset. The authors have provided extra details about data collection, annotation clean-up pipeline and evaluation. Finally, we have also provided a datasheet for dataset. In this section, we provide additional details regarding the dataset collection process. The link to access the dataset Bucktales. A guide to use the DarkLabel annotation tool is provided with dataset on Edmond. It can also be searched on Edmond platform of the Max Planck Group. The link to annotation analysis code is here. The video recording was done at sunrise and sunset every day between 2-18 March 2023, in Tal Chhapar Wildlife Sanctuary, India. The images for the object detection dataset are selected from nine different days from this period. Peak activity on the lek occurred between 9-15 March 2023.