Carotid Intima-Media Thickness Segmentation using Attention Mechanism based Convolutional Neural Network with Domain-Specific Objective Function
DOI:
https://doi.org/10.47611/jsrhs.v11i3.2708Keywords:
Attention Mechanism, CIMT, Encoder-Decoder System, Connection Loss FunctionAbstract
CIMT (Carotid Intima-Media Thickness) has been proven to be both a significant and reliable marker for the evaluation of the risk of cardiovascular disease. Cardiovascular disease is the leading cause of mortality globally, and yet could be easily treated if detected in its early stages. A prime indicator of Cardiovascular disease, CIMT has previously been measured through manual examination of ultrasound videos for the gap between the Lumen-Intima and the Media-Adventitia interfaces, the two inner layers of the Carotid Artery. However, this method is not only inconvenient, but also time consuming. There has been a significant number of previous deep learning approaches to this issue, which have yielded substantial results. However, as this problem concerns the morality of patients directly, medical professionals have been hesitant to be dependent on these approaches, as the current accuracy of the state-of-the-art model still falls short to human observations. Furthermore, high performing models come at high computational costs. CIMT can actually be determined by a miniscule region of the Carotid Ultrasonic image, which many past researches have not taken into consideration. This paper proposes to use an attention mechanism to determine the region of interest and an encoder-decoder system which significantly reduces computational trade off while maintaining comparable accuracy. We also propose a novel connection loss to solve the disconnection problem in the prediction. The proposed model yields an unprecedented accuracy in terms of IoU and ACC of 0.78 and 0.99 respectively, substantially higher than previous state-of-the-art models by 18% and 8.8% on average.
Downloads
References or Bibliography
O'Leary, D. H., & Bots, M. L. (2010). Imaging of atherosclerosis: carotid intima-media thickness. European heart journal, 31(14), 1682–1689. https://doi.org/10.1093/eurheartj/ehq185
Shin, Jae, et al. "Automating carotid intima-media thickness video interpretation with convolutional neural networks." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016.
Al-Mohannadi, A., Al-Maadeed, S., Elharrouss, O., & Sadasivuni, K. K. (2021). Encoder-Decoder Architecture for Ultrasound IMC Segmentation and cIMT Measurement. Sensors (Basel, Switzerland), 21(20), 6839. https://doi.org/10.3390/s21206839
Ronneberger, Olaf, Philipp Fischer, and Thomas Brox. "U-net: Convolutional networks for biomedical image segmentation." International Conference on Medical image computing and computer-assisted intervention. Springer, Cham, 2015. https://doi.org/10.48550/arXiv.1505.04597
Badrinarayanan, Vijay, Alex Kendall, and Roberto Cipolla. "Segnet: A deep convolutional encoder-decoder architecture for image segmentation." IEEE transactions on pattern analysis and machine intelligence 39.12 (2017): 2481-2495. https://doi.org/10.48550/arXiv.1511.00561
Chen, Liang-Chieh, et al. "Encoder-decoder with atrous separable convolution for semantic image segmentation." Proceedings of the European conference on computer vision (ECCV). 2018. https://doi.org/10.48550/arXiv.1802.02611
He, Kaiming, et al. "Deep residual learning for image recognition." Proceedings of the IEEE conference on computer vision and pattern recognition. 2016. https://doi.org/10.48550/arXiv.1512.03385
Howard, Andrew G., et al. "Mobilenets: Efficient convolutional neural networks for mobile vision applications." arXiv preprint arXiv:1704.04861 (2017). https://doi.org/10.48550/arXiv.1704.04861
Huang, Gao, et al. "Densely connected convolutional networks." Proceedings of the IEEE conference on computer vision and pattern recognition. 2017. https://doi.org/10.48550/arXiv.1608.06993
Szegedy, Christian, et al. "Going deeper with convolutions." Proceedings of the IEEE conference on computer vision and pattern recognition. 2015. https://doi.org/10.48550/arXiv.1409.4842
Kingma, Diederik P., and Jimmy Ba. "Adam: A method for stochastic optimization." arXiv preprint arXiv:1412.6980 (2014). https://doi.org/10.48550/arXiv.1412.6980
J. Deng, W. Dong, R. Socher, L. -J. Li, Kai Li and Li Fei-Fei, "ImageNet: A large-scale hierarchical image database," 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009, pp. 248-255, doi: 10.1109/CVPR.2009.5206848.
Jain, Jitesh, et al. "SeMask: Semantically Masked Transformers for Semantic Segmentation." arXiv preprint arXiv:2112.12782 (2021). https://doi.org/10.48550/arXiv.2112.12782
Cheng, Bowen, et al. "Masked-attention mask transformer for universal image segmentation." arXiv preprint arXiv:2112.01527 (2021). https://doi.org/10.48550/arXiv.2112.01527
Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks for large-scale image recognition." arXiv preprint arXiv:1409.1556 (2014). https://doi.org/10.48550/arXiv.1409.1556
Published
How to Cite
Issue
Section
Copyright (c) 2022 Juheon Rhee; Pyae Sone Kway
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright holder(s) granted JSR a perpetual, non-exclusive license to distriute & display this article.