Parameter-Efficient Transformer with Hybrid Axial-Attention for Medical Image Segmentation

Hu, Yiyue; Zhang, Lei; Mu, Nan; Liu, Lei

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2211.09533 (eess)

COVID-19 e-print

Important: e-prints posted on arXiv are not peer-reviewed by arXiv; they should not be relied upon without context to guide clinical practice or health-related behavior and should not be reported in news media as established information without consulting multiple experts in the field.

[Submitted on 17 Nov 2022]

Title:Parameter-Efficient Transformer with Hybrid Axial-Attention for Medical Image Segmentation

Authors:Yiyue Hu, Lei Zhang, Nan Mu, Lei Liu

View PDF

Abstract:Transformers have achieved remarkable success in medical image analysis owing to their powerful capability to use flexible self-attention mechanism. However, due to lacking intrinsic inductive bias in modeling visual structural information, they generally require a large-scale pre-training schedule, limiting the clinical applications over expensive small-scale medical data. To this end, we propose a parameter-efficient transformer to explore intrinsic inductive bias via position information for medical image segmentation. Specifically, we empirically investigate how different position encoding strategies affect the prediction quality of the region of interest (ROI), and observe that ROIs are sensitive to the position encoding strategies. Motivated by this, we present a novel Hybrid Axial-Attention (HAA), a form of position self-attention that can be equipped with spatial pixel-wise information and relative position information as inductive bias. Moreover, we introduce a gating mechanism to alleviate the burden of training schedule, resulting in efficient feature selection over small-scale datasets. Experiments on the BraTS and Covid19 datasets prove the superiority of our method over the baseline and previous works. Internal workflow visualization with interpretability is conducted to better validate our success.

Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2211.09533 [eess.IV]
	(or arXiv:2211.09533v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2211.09533

Submission history

From: Lei Liu [view email]
[v1] Thu, 17 Nov 2022 13:54:55 UTC (6,265 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Parameter-Efficient Transformer with Hybrid Axial-Attention for Medical Image Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Parameter-Efficient Transformer with Hybrid Axial-Attention for Medical Image Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators