• Stars
    star
    178
  • Rank 214,989 (Top 5 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created over 2 years ago
  • Updated 7 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

SSL4EO-S12: a large-scale dataset for self-supervised learning in Earth observation

SSL4EO-S12

The SSL4EO-S12 dataset is a large-scale multimodal multitemporal dataset for unsupervised/self-supervised pre-training in Earth observation. The dataset consists of unlabeled patch triplets (Sentinel-1 dual-pol SAR, Sentinel-2 top-of-atmosphere multispectral, Sentinel-2 surface reflectance multispectral) from 251079 locations across the globe, each patch covering 2640mx2640m and including four seasonal time stamps.

ssl4eo-s12

Access the dataset

  • Raw dataset: The full SSL4EO-S12 dataset (1.5TB, 500GB for each modality) is accessible at mediaTUM. There are some void IDs (gaps in folder names), see data/void_ids.csv. Center coordinates of all locations are available here.
  • Example subset: An example 100-patch subset (600MB) is available at Google Drive.
  • Compressed dataset: A compressed 8-bit version (20-50GB for each modality, including an RGB version) is available at mediaTUM. The raw 16/32-bit values are normalized by mean and std and converted to uint8, plus a default geotiff JPEG compression with quality 75. Note: in our experiments, 8-bit input (without JPEG compression) performs comparably well as 16-bit.
  • A 50k (random) RGB subset (18GB) is available here (link broken). Sample IDs see data/50k_ids_random.csv.

Collect your own data

Check src/download_data for instructions to download sentinel or other products from Google Earth Engine.

Pre-trained models

The pre-trained models with different SSL methods are provided as follows (13 bands of S2-L1C, 100 epochs, input clip to [0,1] by dividing 10000).

SSL method Arch BigEarthNet* EuroSAT So2Sat-LCZ42 Download Usage
MoCo ResNet50 91.8% 99.1% 60.9% full ckpt backbone logs define model, load weights
MoCo ViT-S/16 89.9% 98.6% 61.6% full ckpt backbone logs define model, load weights
DINO ResNet50 90.7% 99.1% 63.6% full ckpt backbone logs define model, load weights
DINO ViT-S/16 90.5% 99.0% 62.2% full ckpt backbone logs define model, load weights
MAE ViT-S/16 88.9% 98.7% 63.9% full ckpt backbone logs define model, load weights
Data2vec ViT-S/16 90.3% 99.1% 64.8% full ckpt backbone logs define model, load weights

* Note the results for BigEarthNet are based on the train/val split following SeCo.

Other pre-trained models:

SSL method Arch Input Download
MoCo ResNet18 S2-L1C 13 bands full ckpt backbone logs
ResNet18 S2-L1C RGB full ckpt, full ckpt ep200 backbone logs
ResNet50 S2-L1C RGB full ckpt backbone logs
ResNet50 S1 SAR 2 bands full ckpt backbone logs
MAE ViT-S/16 S1 SAR 2 bands full ckpt backbone
ViT-B/16 S1 SAR 2 bands full ckpt backbone
ViT-L/16 S1 SAR 2 bands full ckpt backbone
ViT-H/14 S1 SAR 2 bands full ckpt backbone
ViT-B/16 S2-L1C 13 bands full ckpt backbone
ViT-L/16 S2-L1C 13 bands full ckpt backbone
ViT-H/14 S2-L1C 13 bands full ckpt backbone

License

This repository is released under the Apache 2.0 license. The dataset and pretrained model weights are released under the CC-BY-4.0 license.

Citation

@article{wang2022ssl4eo,
  title={SSL4EO-S12: A Large-Scale Multi-Modal, Multi-Temporal Dataset for Self-Supervised Learning in Earth Observation},
  author={Wang, Yi and Braham, Nassim Ait Ali and Xiong, Zhitong and Liu, Chenying and Albrecht, Conrad M and Zhu, Xiao Xiang},
  journal={arXiv preprint arXiv:2211.07044},
  year={2022}
}

More Repositories

1

M3R-CR

Multimodal and Multiresolution Data Fusion for High-Resolution Cloud Removal: A Novel Baseline and Benchmark
Python
63
star
2

DOFA

Code for Neural Plasticity-Inspired Foundation Model for Observing the Earth Crossing Modalities
Jupyter Notebook
60
star
3

SSL4EO-Review

Codes for the benchmark experiments in the paper "Self-supervised Learning in Remote Sensing: A Review"
Python
35
star
4

UniDA

Code for TGRS paper "Universal Domain Adaptation for Remote Sensing Image Scene Classification"
Python
34
star
5

augsburg_Multimodal_Data_Set_MDaS

MATLAB
28
star
6

So2Sat-LCZ42

Zhu et al., So2Sat LCZ42: A Benchmark Dataset for Global Local Climate Zones Classification, IEEE Geoscience and Remote Sensing Magazine, in press, 2020.
24
star
7

DINO-MM

Self-supervised vision transformers for joint SAR-optical representation learning
Python
21
star
8

ChatEarthNet

Python
19
star
9

So2Sat-LCZ-Classification-Demo

Python
16
star
10

FGMAE

Feature guided masked Autoencoder for self-supervised learning in remote sensing
Jupyter Notebook
15
star
11

DeCUR

DeCUR: decoupling common and unique representations for multimodal self-supervised learning.
Python
14
star
12

ST-FSOD

Python
14
star
13

ML4Earth-Hackathon-2022

Jupyter Notebook
13
star
14

ML4Earth-Hackathon-2023

Python
13
star
15

UCDFormer

Code for TGRS paper "UCDFormer: Unsupervised Change Detection Using a Transformer-driven Image Translation"
Python
12
star
16

HTC-DC-Net

Official Implementation of submitted TGRS paper "HTC-DC Net: A Network for Monocular Height Estimation from Single Remote Sensing Images"
Python
10
star
17

rrsis

Python
8
star
18

So2Sat-POP-DL

Python
7
star
19

AIO2

Official implementation of TGRS paper - AIO2: Online Correction of Object Labels for Deep Learning with Incomplete Annotation in Remote Sensing Image Segmentation
Python
7
star
20

So2Sat-POP

Python
7
star
21

So2Sat-BuildingType

A large-scale dataset for building type classification using social media and aerial data
Python
6
star
22

softcon

Multi-label guided soft contrastive learning for efficient Earth observation pretraining
Python
6
star
23

A-Relation-Augmented-Fully-Convolutional-Network-for-Semantic-Segmentation-in-Aerial-Scenes

Mou et al., A Relation-Augmented Fully Convolutional Network for Semantic Segmentation in Aerial Scenes, CVPR 2019, accepted.
5
star
24

PFST

Official implementation of TGRS paper: "Pseudo Features Guided Self-training for Domain Adaptive Semantic Segmentation of Satellite Images"
Python
5
star
25

SARptical

Wang et al., β€œFusing Meter-Resolution 4-D InSAR Point Clouds and Optical Images for Semantic Urban Infrastructure Monitoring,” IEEE Transactions on Geoscience and Remote Sensing, 55(1), 14–26, 2017.
4
star
26

Deep-Recurrent-Neural-Networks-for-Hyperspectral-Image-Classification

Mou et al., Deep Recurrent Neural Networks for Hyperspectral Image Classification, IEEE Transactions on Geoscience and Remote Sensing, 55(7), 3639-3655, 2017.
4
star
27

VQA-easy2hard

Code for paper "From easy to hard: Learning language-guided curriculum for visual question answering on remote sensing data"
Python
4
star
28

AdaptMatch

This is official code of "AdaptMatch: Adaptive Matching for Semisupervised Binary Segmentation of Remote Sensing Images," IEEE-TGRS, 2023
Python
3
star
29

Floodcast

3
star
30

Busy-parking-lot-dataset---vehicle-detection-in-UAV-video

L. Mou and X. X. Zhu, "Vehicle instance segmentation from aerial image and video using a multi-task learning residual fully convolutional network," IEEE Transactions on Geoscience and Remote Sensing, vol. 56, no. 11, pp. 6699-6711, 2018.
3
star
31

Benchmark-of-streetview-image-of-individual-building-instance

Kang et al., Building instance classification using street view images, ISPRS Journal of Photogrammetry and Remote Sensing, 145, 44-59, 2018.
3
star
32

ERA-DATASET

A Dataset and Deep Learning Benchmark for Event Recognition in Aerial Videos
2
star
33

SEN12MS

Schmitt et al., SEN12MS – A Curated Dataset of Georeferenced Multi-spectral Sentinel-1/2 Imagery for Deep Learning and Data Fusion, ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci., IV-2/W7, 153–160, 2019.
2
star
34

LULC_MSI_QCNN

Land Cover Classification from Sentinel-2 Images with Quantum-Classical Convolutional Neural Networks (FQCNN and MQCNN)
Python
2
star
35

Cross-cityDTL

Python
2
star
36

ForestFormer

Python
2
star
37

BiomassUQ

Data used in the paper "Biomass Estimation and Uncertainty Quantification from Tree Height".
R
2
star
38

UTB_codes

Nationwide urban tree canopy mapping and coverage assessment in Brazil
Python
1
star
39

SEN12MSCR

1
star
40

land_consumption

MATLAB
1
star
41

SEQNN

Hybrid Quantum Deep Learning with Superpixel Encoding for Earth Observation Data Classification
1
star
42

PeFNN

1
star
43

RS-DWL

This is official Pytorch implementation of "Decouple and Weight Semi-supervised Semantic Segmentation of Remote Sensing Images, " ISPRS, 2024.
Python
1
star