GitHub - CAMMA-public/ScalingSurgicalSSL: Official repository for "Jumpstarting Surgical Computer Vision"

Jumpstarting Surgical Computer Vision

Deepak Alapatt, Aditya Murali, Vinkle Srivastav, Pietro Mascagni, AI4SafeChole Consortium, Nicolas Padoy, MICCAI, 2024

Code coming soon

Introduction

Consensus amongst researchers and industry points to a lack of large, representative annotated datasets as the biggest obstacle to progress in the field of surgical data science. Advances in Self-Supervised Learning (SSL) represent a solution, reducing the dependence on large labeled datasets by providing task-agnostic initializations. However, the robustness of current self-supervised learning methods to domain shifts remains unclear, limiting our understanding of its utility for leveraging diverse sources of surgical data. Shifting the focus from methods to data, we demonstrate that the downstream value of SSL-based initializations is intricately intertwined with the composition of pre-training datasets. These results underscore an important gap that needs to be filled as we scale self-supervised approaches toward building general-purpose ``foundation models'' that enable diverse use-cases within the surgical domain. Through several stages of controlled experimentation, we develop recommendations for pretraining dataset composition evidenced through over 300 experiments spanning 20 pre-training datasets, 9 surgical procedures, 7 centers (hospitals), 3 labeled-data settings, 3 downstream tasks, and multiple runs. Using the approaches here described, we outperform state-of-the-art pre-trainings on two public benchmarks for phase recognition: up to 2.2% on Cholec80 and 5.1% on AutoLaparo.

Citation

@article{alapatt2023jumpstarting,
  title={Jumpstarting Surgical Computer Vision},
  author={Alapatt, Deepak and Murali, Aditya and Srivastav, Vinkle and Mascagni, Pietro and Consortium, AI4SafeChole and Padoy, Nicolas},
  booktitle={International conference on medical image computing and computer-assisted intervention},
  year={2024},
  organization={Springer}
}

@article{ramesh2023dissecting,
  title={Dissecting self-supervised learning methods for surgical computer vision},
  author={Ramesh, Sanat and Srivastav, Vinkle and Alapatt, Deepak and Yu, Tong and Murali, Aditya and Sestini, Luca and Nwoye, Chinedu Innocent and Hamoud, Idris and Sharma, Saurav and Fleurentin, Antoine and others},
  journal={Medical Image Analysis},
  pages={102844},
  year={2023},
  publisher={Elsevier}
}

References

The project uses VISSL. We thank the authors of VISSL for releasing the library. If you use VISSL, consider citing it using the following BibTeX entry.

@misc{goyal2021vissl,
  author =       {Priya Goyal and Quentin Duval and Jeremy Reizenstein and Matthew Leavitt and Min Xu and
                  Benjamin Lefaudeux and Mannat Singh and Vinicius Reis and Mathilde Caron and Piotr Bojanowski and
                  Armand Joulin and Ishan Misra},
  title =        {VISSL},
  howpublished = {\url{https://github.com/facebookresearch/vissl}},
  year =         {2021}
}

License

This code, models, and datasets are available for non-commercial scientific research purposes as defined in the CC BY-NC-SA 4.0. By downloading and using this code you agree to the terms in the LICENSE. Third-party codes are subject to their respective licenses.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
checkpoints/defaults/resnet_50		checkpoints/defaults/resnet_50
configs/config/hparams/cholec80		configs/config/hparams/cholec80
datasets/cholec80		datasets/cholec80
downstream_phase_tcn		downstream_phase_tcn
downstream_triplet		downstream_triplet
ext_libs		ext_libs
static		static
utils		utils
vissl/vissl		vissl/vissl
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
main_ft_phase_tcn.py		main_ft_phase_tcn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Jumpstarting Surgical Computer Vision

Code coming soon

Introduction

Citation

References

License

About

Releases

Packages

Languages

License

CAMMA-public/ScalingSurgicalSSL

Folders and files

Latest commit

History

Repository files navigation

Jumpstarting Surgical Computer Vision

Code coming soon

Introduction

Citation

References

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages