GitHub - isaaccorley/resize-is-all-you-need: The official repository for the paper "Revisiting pre-trained remote sensing model benchmarks: resizing and normalization matters"

Revisiting pre-trained remote sensing model benchmarks: resizing and normalization matters

Isaac Corley¹ · Caleb Robinson² · Rahul Dodhia² · Juan M. Lavista Ferres² · Peyman Najafirad (Paul Rad)¹

¹University of Texas at San Antonio ²Microsoft AI for Good Research Lab

Figure 1. Difference in downstream task metrics, Overall Accuracy (OA) (multiclass) or mean Average Precision (mAP) (multilabel), after resizing images to 224 × 224 from the original, smaller, image sizes. ImageNet pretrained models often are trained with 224 x 224 inputs and therefore do not produce useful embeddings with smaller image patches.

This is the official repository for the paper, "Revisiting pre-trained remote sensing model benchmarks: resizing and normalization matters" presented at the 2024 CVPR PBVS Workshop.

In this paper, we find that simply resizing and normalizing remote sensing imagery correctly provides a significant boost, particlarly when transferring ImageNet pretrained models to the remote sensing domain

Resizing

Remote sensing benchmark datasets, e.g. EuroSAT -- 64 x 64, commonly have small image sizes due to patches being extracted from large satellite tiles. However, we find that recently, evaluation is being performed at these small image sizes while being trained at larger image sizes.

Figure 2. The effect of input image size on EuroSAT downstream performance (overall accuracy) across different ResNet models. By default, EuroSAT images are 64 × 64 pixels, however resizing to larger image sizes before embedding increases downstream accuracy under a KNN (k = 5) classification model in all cases.

Normalization

Furthermore, we find that many pretrained geospatial foundation models are sensitive to the standard normalization used during inference. Blindly using ImageNet statistics can significantly degrade representation ability

Figure 3. t-SNE plots of EuroSAT test set embeddings extracted using a ResNet50 pretrained on ImageNet with different preprocessing. (left to right: 32 × 32 with normalization, 224 × 224 without normalization, 224 × 224 with normalization)

Extracting Features

We have provided a sample script for extracting features using various models from the paper from your own folder of remote sensing imagery. Please modify the script to your use case (for best performance you will need the mean/std of your dataset). The extracted features will be saved to output_directory/model_features.npy

python embed.py --model resnet50_pretrained_moco --output-dir outputs --root path/to/your/folder --image-size 224 --batch-size 32 --workers 8 --device cuda:0

Cite

If this work inspired you to properly resize and normalize your images in benchmarking please consider citing our paper

@InProceedings{Corley_2024_CVPR, author = {Corley, Isaac and Robinson, Caleb and Dodhia, Rahul and Ferres, Juan M. Lavista and Najafirad, Peyman}, title = {Revisiting Pre-trained Remote Sensing Model Benchmarks: Resizing and Normalization Matters}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2024}, pages = {3162-3172} }

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
figures		figures
notebooks		notebooks
results		results
scripts		scripts
src		src
.flake8		.flake8
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
embed.py		embed.py
eurosat_size_vs_performance.py		eurosat_size_vs_performance.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Revisiting pre-trained remote sensing model benchmarks: resizing and normalization matters

In this paper, we find that simply resizing and normalizing remote sensing imagery correctly provides a significant boost, particlarly when transferring ImageNet pretrained models to the remote sensing domain

Resizing

Normalization

Extracting Features

Cite

About

Releases

Packages

Contributors 2

Languages

License

isaaccorley/resize-is-all-you-need

Folders and files

Latest commit

History

Repository files navigation

Revisiting pre-trained remote sensing model benchmarks: resizing and normalization matters

In this paper, we find that simply resizing and normalizing remote sensing imagery correctly provides a significant boost, particlarly when transferring ImageNet pretrained models to the remote sensing domain

Resizing

Normalization

Extracting Features

Cite

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages