Scalable Swin Transformer network for brain tumor segmentation from incomplete MRI modalities.

in Artificial intelligence in medicine by Dongsong Zhang, Changjian Wang, Tianhua Chen, Weidao Chen, Yiqing Shen

TLDR

  • IMSTrans is a new way to look at brain MRI scans to find tumors. It uses a single encoder to extract information from all available modalities, which helps it share information and work more efficiently. The study found that IMSTrans worked better than other methods and was more scalable.

Abstract

Deep learning methods have shown great potential in processing multi-modal Magnetic Resonance Imaging (MRI) data, enabling improved accuracy in brain tumor segmentation. However, the performance of these methods can suffer when dealing with incomplete modalities, which is a common issue in clinical practice. Existing solutions, such as missing modality synthesis, knowledge distillation, and architecture-based methods, suffer from drawbacks such as long training times, high model complexity, and poor scalability. This paper proposes IMSTrans, a novel lightweight scalable Swin Transformer network by utilizing a single encoder to extract latent feature maps from all available modalities. This unified feature extraction process enables efficient information sharing and fusion among the modalities, resulting in efficiency without compromising segmentation performance even in the presence of missing modalities. Two datasets, BraTS 2018 and BraTS 2020, containing incomplete modalities for brain tumor segmentation are evaluated against popular benchmarks. On the BraTS 2018 dataset, our model achieved higher average Dice similarity coefficient (DSC) scores for the whole tumor, tumor core, and enhancing tumor regions (86.57, 75.67, and 58.28, respectively), in comparison with a state-of-the-art model, i.e. mmFormer (86.45, 75.51, and 57.79, respectively). Similarly, on the BraTS 2020 dataset, our model scored higher DSC scores in these three brain tumor regions (87.33, 79.09, and 62.11, respectively) compared to mmFormer (86.17, 78.34, and 60.36, respectively). We also conducted a Wilcoxon test on the experimental results, and the generated p-value confirmed that our model's performance was statistically significant. Moreover, our model exhibits significantly reduced complexity with only 4.47 M parameters, 121.89G FLOPs, and a model size of 77.13 MB, whereas mmFormer comprises 34.96 M parameters, 265.79 G FLOPs, and a model size of 559.74 MB. These indicate our model, being light-weighted with significantly reduced parameters, is still able to achieve better performance than a state-of-the-art model. By leveraging a single encoder for processing the available modalities, IMSTrans offers notable scalability advantages over methods that rely on multiple encoders. This streamlined approach eliminates the need for maintaining separate encoders for each modality, resulting in a lightweight and scalable network architecture. The source code of IMSTrans and the associated weights are both publicly available at https://github.com/hudscomdz/IMS2Trans.

Overview

  • The study focuses on developing a novel lightweight scalable Swin Transformer network, IMSTrans, for processing multi-modal Magnetic Resonance Imaging (MRI) data for brain tumor segmentation. The hypothesis being tested is that IMSTrans can achieve better performance than existing solutions in the presence of missing modalities, while maintaining scalability and efficiency. The methodology used for the experiment includes two datasets, BraTS 2018 and BraTS 2020, containing incomplete modalities for brain tumor segmentation. The primary objective of the study is to evaluate the performance of IMSTrans against popular benchmarks and demonstrate its scalability and efficiency advantages over existing solutions.

Comparative Analysis & Findings

  • The study compares the performance of IMSTrans with popular benchmarks on two datasets, BraTS 2018 and BraTS 2020, containing incomplete modalities for brain tumor segmentation. The results show that IMSTrans achieved higher average Dice similarity coefficient (DSC) scores for the whole tumor, tumor core, and enhancing tumor regions compared to a state-of-the-art model, i.e. mmFormer. The Wilcoxon test confirmed that the performance of IMSTrans was statistically significant.

Implications and Future Directions

  • The study's findings demonstrate the potential of IMSTrans for improving the accuracy of brain tumor segmentation in the presence of missing modalities while maintaining scalability and efficiency. Future research directions could include exploring the use of IMSTrans for other multi-modal medical imaging tasks, such as lesion detection and classification, and investigating the impact of different hyperparameters on the performance of the network.