Skip to content

whuwuteng/Aerial_Stereo_Dataset

Repository files navigation

An evaluation of Deep Learning based stereo dense matching dataset shift from aerial images and a large scale stereo dataset

This Github will provide the detail information of our paper An evaluation of Deep Learning based stereo dense matching dataset shift from aerial images and a large scale stereo dataset.

Because the length limit of International Journal of Applied Earth Observation and Geoinformation is 8000 words, the original paper can be found on Arxiv, if you have any questions about this long version paper, please contact me.

This is the Github repository for the stereo dense matching benchmark for AI4GEO project.

In order to discuss the transferability of deep learning methods on aerial dataset, we produce 6 aerial dataset covers 4 different area.

All the data and model can be found On Zenodo

History

This work is an extension of our previous work, and the old version dataset is already published. In the ISPRS Conress 2022 in Nice, we presented an extension work as a poster, and the slide and the poster is provided.

Introduction

For stereo dense matching, there are many famous benchmark dataset in Robust Vision, for example, KITTI stereo and middlebury stereo. With the development of machine learning, especially deep learning, these methods usually need a lot of training data(or ground truth). For photogrammetry community, as far as we know, it is not easy to find these training data. We will publish our data as ground truth. The data is produced from original image and LiDAR dataset. To be noticed, the image and LiDAR should be well-registered.

Global information of dataset

For each dataset, the global information of the dataset is listed follow:

Dataset Color GSD(cm) LiDAR($pt/m^2$) Origin orientation ICP refined Outlier remove Difficulty
ISPRS-Vaihingen IR-R-G 8 6.7 x x ++
EuroSDR-Vaihingen R-G-B 20 6.7 x x ++
Toulouse-UMBRA R-G-B 12.5 2-4 x ++++
Toulouse-Métropole R-G-B 5 8 x x +
Enschede R-G-B 10 10 x +++
DublinCity R-G-B 3.4 250-348 x x ++

In the table, the origin orientation accuracy influence the data accuracy, in order to improve the quality of the dataset, an ICP based Image-LiDAR is proposed to refine the orientation.

Dataset structure

The training and evaluation dataset is also provided, the structure of the folder is same with the old version.

Because the whole dataset is too large, so only the used in the paper is uploaded. The data will be hosted by Zenodo, because the limitation is 50GB, so the dataset used be in the paper and the pre-trained models are available on Zenodo. (Don't upload data for the community "Photogrammetry - Geological Remote Sensing", the dataset must be reviewed, and it will takes too long time to wait, and I give up.)

ISPRS-Vaihingen

All the training and testing data can be found on Google Drive, this is a newer version compare to the old version, the origin image and LiDAR used are same with old version. To visualize the data in GoogleEarth, we need know the projection system of the data, for Vaihingen data, the system is WGS 84 / UTM zone 32N.

*Origin  ISPRS-Vaihingen coverage*
Origin ISPRS-Vaihingen coverage

An example is show here :

*Left image* *Left image* *Left image* *Left image* *Left image*
Example for ISPRS-Vaihingen

EuroSDR-Vaihingen

This dataset is collected nearly same time with ISPRS-Vaihingen, the difference is that the resolution, the dataset can be found here, because the LiDAR is from ISPRS-Vaihingen, so only a small part of the data is used.

*Origin  EuroSDR-Vaihingen coverage*
Origin EuroSDR-Vaihingen coverage

An example is show here :

*Left image* *Left image* *Left image* *Left image* *Left image*
Example for EuroSDR-Vaihingen

Toulouse-UMBRA

This dataset is collected by IGN(French map agency) in 2012, and the camera is produced by IGN, the origin image is 16bit, this dataset is for remote sensing use, to make the training data, we use auto just. And in the experiment, we found that the image is quite different from other dataset.

*Origin Toulouse-UMBRA coverage*
Origin Toulouse-UMBRA coverage

An example is show here :

*Left image* *Left image* *Left image* *Left image* *Left image*
Example for Toulouse-UMBRA

Toulouse-Métropole

This dataset is collect by AI4GEO in 2019, the camera is UltraCam Osprey Prime M3, and the LiDAR is ALS70. The origin dataset is too large, only the area same with the Toulous-UMBRA is used in the paper for produce the dataset.

*Origin Toulouse-Métropole coverage*
Origin Toulouse-Métropole coverage

An example is show here :

*Left image* *Left image* *Left image* *Left image* *Left image*
Example for Toulouse-Métropole

Enschede

This dataset is a dataset collected from ITC Faculty Geo-Information Science and Earth Observation in 2011, the LiDAR is AHN2 in 2012. The origin device has 5 cameras, only the nadir camera is used in the experiment.

To visualize the data in GoogleEarth, we need know the projection system of the data, for Enschede data, the system is Amersfoort.

*Origin  Enschede coverage*
Origin Enschede coverage

An example is show here :

*Left image* *Left image* *Left image* *Left image* *Left image*
Example for Enschede

DublinCity

DublinCity is an open dataset, the original aerial and LiDAR point cloud can be downloaded, the origin dataset is very large.

You can find the training and testing dataset from another paper. To save the disk, we do not upload this time, more information can be found on Github also.

An example is show here :

*Left image* *Left image* *Left image* *Left image* *Left image*
Example for DublinCity

Method

In the paper, we evaluate the state of the art methods of deep learning on stereo dense matching before 2020.

MicMac

MicMac can is a open source code, the code can be found from Github. In the experiment, the command line is :

mm3d MM1P left.tif right.tif NONE DefCor=0.2 HasSBG=false HasVeg=true 

SGM(GPU)

This method is revised during the experiment, because the origin disparity range is too small, i.e 128, in our experiment, 256 is used. The code can be found in folder.

GraphCuts

This origin code can be found from Github, to make the code run in Linux, a new version can be found here.

CBMV

The code can be found from Github.

MC-CNN

The code can be found from Github.

DeepFeature

The code can be found here.

PSM net

The origin code can be found from Github.

HRS net

The origin code can be found from Github.

DeepPruner

The origin code can be found from Github.

GANet

The origin code can be found from Github.

LEAStereo

The origin code can be found from Github.

Pretrained models

The pretrained models are important in the paper, so we will also share the pretrained models and training setting in the paper.

CBMV

The pre-trained model for the 6 dataset are provide in CBMV_Model.zip :

Model Name training data images
CBMV_model_ISPRS-Vaihingen.rf ISPRS-Vaihingen 200
CBMV_model_EuroSDR-Vaihingen.rf EuroSDR-Vaihingen 200
CBMV_model_Toulouse-UMBRA.rf Toulouse-UMBRA 200
CBMV_model_Toulouse-Metropole.rf Toulouse-Metropole 200
CBMV_model_Enschede.rf Enschede 200
CBMV_model_DublinCity.rf DublinCity 200

MC-CNN

The pre-trained model for the 6 dataset are provide in MC-CNN_Model.zip, and a model trained on all the image :

Model Name training data images
MC-CNN_model_ISPRS-Vaihingen.t7 ISPRS-Vaihingen 1200
MC-CNN_model_EuroSDR-Vaihingen.t7 EuroSDR-Vaihingen 1200
MC-CNN_model_Toulouse-UMBRA.t7 Toulouse-UMBRA 1200
MC-CNN_model_Toulouse-Metropole.t7 Toulouse-Metropole 1200
MC-CNN_model_Enschede.t7 Enschede 1200
MC-CNN_model_DublinCity.t7 DublinCity 1200
MC-CNN_model_All.t7 6 dataset 1200

DeepFeature

The pre-trained model for the 6 dataset are provide in DeepFeature_Model.zip, and a model trained on all the image :

Model Name training data images
bn_meanvar_ISPRS-Vaihingen.t7 and param_ISPRS-Vaihingen.t7 ISPRS-Vaihingen 1200
bn_meanvar_EuroSDR-Vaihingen.t7 and param_EuroSDR-Vaihingen.t7 EuroSDR-Vaihingen images1200
bn_meanvar_Toulouse-UMBRA.t7 and param_Toulouse-UMBRA.t7 Toulouse-UMBRA 1200
bn_meanvar_Toulouse-Metropole.t7 and param_Toulouse-Metropole.t7 Toulouse-Metropole 1200
bn_meanvar_Enschede.t7 and param_Enschede.t7 Enschede 1200
bn_meanvar_DublinCity.t7 and param_DublinCity.t7 DublinCity 1200
bn_meanvar_all.t7 and param_all.t7 6 dataset 1200

PSM net

The pre-trained model for the 6 dataset are provide in PSMNet_Model.zip, and a model trained on all the image :

Model Name training data images
PSMNet_Model_ISPRS-Vaihingen.tar ISPRS-Vaihingen 1200
PSMNet_Model_EuroSDR-Vaihingen.tar EuroSDR-Vaihingen 1200
PSMNet_Model_Toulouse-UMBRA.tar Toulouse-UMBRA 1200
PSMNet_Model_Toulouse-Metropole.tar Toulouse-Metropole 1200
PSMNet_Model_Enschede.tar Enschede 1200
PSMNet_Model_DublinCity.tar DublinCity 1200
PSMNet_Model_All.tar 6 dataset 1200

HRS net

The pre-trained model for the 6 dataset are provide in HRSNet_Model.zip, and a model trained on all the image :

Model Name training data images
HRSNet_Model_ISPRS-Vaihingen.tar ISPRS-Vaihingen 1200
HRSNet_Model_EuroSDR-Vaihingen.tar EuroSDR-Vaihingen 1200
HRSNet_Model_Toulouse-UMBRA.tar Toulouse-UMBRA 1200
HRSNet_Model_Toulouse-Metropole.tar Toulouse-Metropole 1200
HRSNet_Model_Enschede.tar Enschede 1200
HRSNet_Model_DublinCity.tar DublinCity 1200
HRSNet_Model_All.tar 6 dataset 1200

DeepPruner

The pre-trained model for the 6 dataset are provide in DeepPruner_Model.zip, and a model trained on all the image :

Model Name training data images
DeepPruner_model_ISPRS-Vaihingen.tar ISPRS-Vaihingen 1200
DeepPruner_model_EuroSDR-Vaihingen.tar EuroSDR-Vaihingen 1200
DeepPruner_model_Toulouse-UMBRA.tar Toulouse-UMBRA 1200
DeepPruner_model_Toulouse-Metropole.tar Toulouse-Metropole 1200
DeepPruner_model_Enschede.tar Enschede 1200
DeepPruner_model_DublinCity.tar DublinCity 1200
DeepPruner_model_All.tar 6 dataset 1200

GANet

The pre-trained model for the 6 dataset are provide in GANet_Model.zip, and a model trained on all the image :

Model Name training data images
GANet_model_ISPRS-Vaihingen.pth ISPRS-Vaihingen 1200
GANet_model_EuroSDR-Vaihingen.pth EuroSDR-Vaihingen 1200
GANet_model_Toulouse-UMBRA.pth Toulouse-UMBRA 1200
GANet_model_Toulouse-Metropole.pth Toulouse-Metropole 1200
GANet_model_Enschede.pth Enschede 1200
GANet_model_DublinCity.pth DublinCity 1200
GANet_model_All.pth 6 dataset 1200

LEAStereo

The pre-trained model for the 6 dataset are provide in LEAStereo_Model.zip, and a model trained on all the image :

Model Name training data images
LEAStereo_model_ISPRS-Vaihingen.pth ISPRS-Vaihingen 1200
LEAStereo_model_EuroSDR-Vaihingen.pth EuroSDR-Vaihingen 1200
LEAStereo_model_Toulouse-UMBRA.pth Toulouse-UMBRA 1200
LEAStereo_model_Toulouse-Metropole.pth Toulouse-Metropole 1200
LEAStereo_model_Enschede.pth Enschede 1200
LEAStereo_model_DublinCity.pth DublinCity 1200
LEAStereo_model_All.pth 6 dataset 1200

TODO

  • Image-LiDAR process
  • Publish dataset V1 (use in the paper)
  • Publish the long paper on Arxiv
  • Publish pretrained models
  • Publish full dataset (we don't have the host, the full dataset can be provided after required)

Stereo-LiDAR fusion

Based on the data generation, we also generate the Toulouse2020 data from IGN, and this data can be found in our CVPR photogrammetry and computer vision workshop paper. The Github site can be found here.

Citation

If you think you have any problem, contact [Teng Wu]whuwuteng@gmail.com

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published