Skip to content

Latest commit

 

History

History
88 lines (68 loc) · 5.89 KB

README.md

File metadata and controls

88 lines (68 loc) · 5.89 KB

Parameter Efficient Self-Supervised Geospatial Domain Adaptation

Paper | Poster | Slides | Video | Blog | CVPR Page

This repository contains code supporting the CVPR 2024 paper Parameter Efficient Self-supervised Geospatial Domain Adaptation.

Authors: Linus Scheibenreif Michael Mommert Damian Borth

Overview image

Background

This work proposes a three-step approach to adapt geospatial foundation models to new dataset modalities (see figure above):

  1. Add SLR adapter parameters to the pre-trained foundation model.
  2. Train the adapters via self-supervised masked image modeling on unlabeled data from the target domain.
  3. Fine-tune the adapters supervised for the target task.

This codebase includes scripts to train and evaluate different geospatial foundation models on a number of remote sensing datasets.

Getting Started

This codebase provides scripts to add SLR adapters to existing, trained, visual foundation models before fine-tuning them on different downstream tasks. To get started, make sure that the trained weights for a visual foundation model are available in the checkpoints/ directory and download a dataset for training (either via the torchgeo library or through the links provided below). For each foundation model, the pre-trained weights should be stored in a different sub-directory (checkpoints/{mae, sat-mae, scale-mae}) See below for the models and datasets used in the paper.

Training SLR adapters (Steps 1 + 2)

Follow these steps to add and train SLR adapters for one of the supported foundation model / dataset combinations.

  • Update the configuration at configs/mae/experiment.yaml with the following values:
    • wandb.entity: your wandb id
    • data.datamodule: class name of the desired datamodule (see src/datamodules for all supported options)
    • data.modality if the chosen dataset contains multiple modalities, choose one here.
    • data.size similarly, if the dataset contains images of different sizes, choose one here (e.g., TreeSatAI).
    • data.root set to the parent directory of the dataset.
    • model.name sets the foundation model, supported options are mae, sat_mae, and scale_mae.
    • to change the bottleneck dimensionality of the low-rank adapters (called r in the paper), set model.adapter_hidden_dim.
    • finally, change all other settings (e.g., learning rate or patch size) as you wish.
  • Start the training run with python main_mae.py.

Details about the training run will be logged to weights and biases and stored in the outputs/ directory.

Fine-tuning / Linear eval (Step 3)

To evaluate a model with trained SLR adapters, follow these steps.

  • Update the configuration at configs/lin_eval/experiment.yaml:
    • continual_pretrain_run set the wandb run id of the SLR adapter pre-training run (see step above).
    • data.* make sure the data configuration matches your run from Step 2 above (now you also need to set the number of classes in the dataset). Optionally, few-shot experiments can be run by setting few_shot_k and a few_shot_seed.
    • model.* settings need to match the pre-training run.
  • Start the training run with python main_linear_eval.py.

Evaluation on the test set

To test fine-tuned models (i.e., after Step 3), create a file containing all wandb run ids that should be tested (see configs/run_ids.csv for a template). This can also only contain one line in addition to the column names. Then poin the run_id_file variable in main_test_checkpoints.py to that file. Finally, run python main_test_checkpoints.py to run the tests. Results will be written to files in the logs/tests/ directory.

Models

MAE

SatMAE

Scale-MAE

Datasets

Where possible, use torchgeo implementations of remote sensing datasets. Please download the other dataset from these locations:

Dependencies

  • see requirements.txt
  • torchgeo==0.5.0
  • torch==2.0.1

Acknowledgements

If you would like to reference our work, please use the following reference:

@InProceedings{Scheibenreif_2024_CVPR,
    author    = {Scheibenreif, Linus and Mommert, Michael and Borth, Damian},
    title     = {Parameter Efficient Self-Supervised Geospatial Domain Adaptation},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2024},
    pages     = {27841-27851}
}