COMPARATIVE ANALYSIS OF U-NET, U-NET++, TRANSUNET AND SWIN-UNET FOR LUNG X-RAY SEGMENTATION

ӨКПЕНІҢ РЕНТГЕНДІК КЕСКІНДЕРІН СЕГМЕНТТЕУ МӘСЕЛЕСІНДЕ U-NET, U-NET++, TRANSUNET ЖӘНЕ SWIN-UNET МОДЕЛЬДЕРІН САЛЫСТЫРМАЛЫ ТАЛДАУ
СРАВНИТЕЛЬНЫЙ АНАЛИЗ МОДЕЛЕЙ U-NET, U-NET++, TRANSUNET AND SWIN-UNET В ЗАДАЧЕ СЕГМЕНТАЦИИ РЕНТГЕН-СНИМКОВ ЛЕГКОГО
Nam D. Pak A.
2024 Kazakh-British Technical University

Herald of the Kazakh British Technical UNiversity
2024 #21 Issue 2 42 - 53 pp.

Medical image segmentation is a widely used task in medical image processing. It allows us to receive the location and size of the required instance. Several critical factors should be considered. First, the model should provide an accurate prediction of the mask. Second, the model should not require a lot of computational resources. Finally, the distribution between the false positive and false negative predictions should be considered. We provide the comparative analysis between four deep learning models, base U-Net and its extension U-Net++, TranUNet, and Swin-UNet for lung X-ray segmentation based on trainable parameters, DICE, IoU, Hausdorff Distance, Precision and Recall. CNN models with the smallest number of parameters show the highest DICE and IoU scores than their descendants on the limited-size dataset. Based on the experiment results provided in the article U-Nethas maximum DICE, IoU, and precision. It makes the model the most appropriate for medical image segmentation. SwinU-Net is the model with minimum Hausdorff Distance. U-Net++ has the maximum Recall.

CNN , medical image processing , segmentation , transformers

Text of the article Перейти на текст статьи

Kazakh-British Technical University, Almaty, 050000, Kazakhstan

Kazakh-British Technical University

10 лет помогаем публиковать статьи Международный издатель

Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026