BSA-Seg: A Bi-level sparse attention network combining narrow band loss for multi-target medical image segmentation.

in Neural networks : the official journal of the International Neural Network Society by Zhiyong Zhou, Zhechen Zhou, Xusheng Qian, Jisu Hu, Bo Peng, Chen Geng, Bin Dai, He Huang, Wenbin Zhang, Yakang Dai

TLDR

  • This study proposes a bi-level sparse attention network and narrow-band loss function for accurate and efficient multi-target medical image segmentation, outperforming state-of-the-art methods in a series of comprehensive experiments.
  • The proposed method addresses issues of over-segmentation and under-segmentation by guiding the network to perform boundary-aware segmentation, resulting in enhanced segmentation accuracy and reduced computational complexity.

Abstract

Segmentation of multiple targets of varying sizes within medical images is of significant importance for the diagnosis of disease and pathological research. Transformer-based methods are emerging in the medical image segmentation, leveraging the powerful yet computationally intensive self-attention mechanism. A variety of attention mechanisms have been proposed to reduce computation at the cost of accuracy loss, utilizing handcrafted patterns within local or artificially defined receptive fields. Furthermore, the common region-based loss functions are insufficient for guiding the transformer to focus on tissue regions, resulting in their unsuitability for the segmentation of tissues with intricate boundaries. This paper presents the development of a bi-level sparse attention network and a narrow band (NB) loss function for the accurate and efficient multi-target segmentation of medical images. In particular, we introduce a bi-level sparse attention module (BSAM) and formulate a segmentation network based on this module. The BSAM consists of coarse-grained patch-level attention and fine-grained pixel-level attention, which captures fine-grained contextual features in adaptive receptive fields learned by patch-level attention. This results in enhanced segmentation accuracy while simultaneously reducing computational complexity. The proposed narrow-band (NB) loss function constructs a target region in close proximity to the tissue boundary. The network is thus guided to perform boundary-aware segmentation, thereby simultaneously alleviating the issues of over-segmentation and under-segmentation. A series of comprehensive experiments on whole brains, brain tumors and abdominal organs, demonstrate that our method outperforms other state-of-the-art segmentation methods. Furthermore, the BSAM and NB loss can be applied flexibly to a variety of network frameworks.

Overview

  • The study proposes a bi-level sparse attention network (BSAN) and a narrow-band (NB) loss function for accurate and efficient multi-target segmentation of medical images.
  • The BSAN consists of coarse-grained patch-level attention and fine-grained pixel-level attention, capturing fine-grained contextual features in adaptive receptive fields learned by patch-level attention.
  • The proposed method aims to overcome the issues of over-segmentation and under-segmentation by guiding the network to perform boundary-aware segmentation.

Comparative Analysis & Findings

  • The proposed method outperforms other state-of-the-art segmentation methods in a series of comprehensive experiments on whole brains, brain tumors, and abdominal organs.
  • The BSAN and NB loss function can be applied flexibly to various network frameworks, offering high adaptability.
  • The study demonstrates enhanced segmentation accuracy and reduced computational complexity using the BSAN and NB loss function.

Implications and Future Directions

  • The proposed method has the potential to significantly impact clinical practice in the diagnosis of disease and pathological research, particularly for complex medical image segmentation tasks.
  • Future research directions may involve exploring the application of the proposed method to other medical image segmentation tasks, such as cardiac or pulmonary imaging.
  • The study highlights the importance of developing novel and efficient attention mechanisms and loss functions for transformer-based medical image segmentation methods.