Transformer Based Multi-model Fusion for Medical Image Segmentation

Authors

  • Bo Dong Zhejiang University
  • Wenhai Wang
  • Jinpeng Li

DOI:

https://doi.org/10.5617/nmi.9171

Keywords:

polyp segmentation, surgical instrument segmentation, vision transformer

Abstract

We present our solutions to the MedAI for all three tasks: polyp segmentation task, instrument segmentation task, and transparency task. We use the same framework to process the two segmentation tasks of polyps and instruments. The key improvement over last year is new state-of-the-art vision architectures, especially transformers which significantly outperform ConvNets for the medical image segmentation tasks. Our solution consists of multiple segmentation models, and each model uses a transformer as the backbone network. we get the best IoU score of 0.915 on the instrument segmentation task and 0.836 on polyp segmentation task after submitting. Meanwhile, we provide complete solutions in https://github.com/dongbo811/MedAI-2021.

Downloads

Published

2021-11-01