Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization

An accurate whole heart segmentation (WHS) on medical images, including computed tomography (CT) and magnetic resonance (MR) images, plays a crucial role in many clinical applications, such as cardiovascular disease diagnosis, pre-surgical planning, and intraoperative treatment. Manual whole-heart s...

Full description

Bibliographic Details
Main Authors:	Marija Habijan, Irena Galić, Hrvoje Leventić, Krešimir Romić
Format:	Article
Language:	English
Published:	MDPI AG 2021-04-01
Series:	Applied Sciences
Subjects:	artificial intelligence cardiac CT cardiac MRI deep learning ResNet variational autoencoder
Online Access:	https://www.mdpi.com/2076-3417/11/9/3912

id	doaj-ffa6dc8320a8424e99d6f30ff091b71b
record_format	Article
spelling	doaj-ffa6dc8320a8424e99d6f30ff091b71b2021-04-26T23:02:09ZengMDPI AGApplied Sciences2076-34172021-04-01113912391210.3390/app11093912Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder RegularizationMarija Habijan0Irena Galić1Hrvoje Leventić2Krešimir Romić3Faculty of Electrical Engineering, Computer Science and Information Technology Osijek, 31000 Osijek, CroatiaFaculty of Electrical Engineering, Computer Science and Information Technology Osijek, 31000 Osijek, CroatiaFaculty of Electrical Engineering, Computer Science and Information Technology Osijek, 31000 Osijek, CroatiaFaculty of Electrical Engineering, Computer Science and Information Technology Osijek, 31000 Osijek, CroatiaAn accurate whole heart segmentation (WHS) on medical images, including computed tomography (CT) and magnetic resonance (MR) images, plays a crucial role in many clinical applications, such as cardiovascular disease diagnosis, pre-surgical planning, and intraoperative treatment. Manual whole-heart segmentation is a time-consuming process, prone to subjectivity and error. Therefore, there is a need to develop a quick, automatic, and accurate whole heart segmentation systems. Nowadays, convolutional neural networks (CNNs) emerged as a robust approach for medical image segmentation. In this paper, we first introduce a novel connectivity structure of residual unit that we refer to as a feature merge residual unit (FM-Pre-ResNet). The proposed connectivity allows the creation of distinctly deep models without an increase in the number of parameters compared to the pre-activation residual units. Second, we propose a three-dimensional (3D) encoder–decoder based architecture that successfully incorporates FM-Pre-ResNet units and variational autoencoder (VAE). In an encoding stage, FM-Pre-ResNet units are used for learning a low-dimensional representation of the input. After that, the variational autoencoder (VAE) reconstructs the input image from the low-dimensional latent space to provide a strong regularization of all model weights, simultaneously preventing overfitting on the training data. Finally, the decoding stage creates the final whole heart segmentation. We evaluate our method on the 40 test subjects of the MICCAI Multi-Modality Whole Heart Segmentation (MM-WHS) Challenge. The average dice values of whole heart segmentation are 90.39% (CT images) and 89.50% (MRI images), which are both highly comparable to the state-of-the-art.https://www.mdpi.com/2076-3417/11/9/3912artificial intelligencecardiac CTcardiac MRIdeep learningResNetvariational autoencoder
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Marija Habijan Irena Galić Hrvoje Leventić Krešimir Romić
spellingShingle	Marija Habijan Irena Galić Hrvoje Leventić Krešimir Romić Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization Applied Sciences artificial intelligence cardiac CT cardiac MRI deep learning ResNet variational autoencoder
author_facet	Marija Habijan Irena Galić Hrvoje Leventić Krešimir Romić
author_sort	Marija Habijan
title	Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization
title_short	Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization
title_full	Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization
title_fullStr	Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization
title_full_unstemmed	Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization
title_sort	whole heart segmentation using 3d fm-pre-resnet encoder–decoder based architecture with variational autoencoder regularization
publisher	MDPI AG
series	Applied Sciences
issn	2076-3417
publishDate	2021-04-01
description	An accurate whole heart segmentation (WHS) on medical images, including computed tomography (CT) and magnetic resonance (MR) images, plays a crucial role in many clinical applications, such as cardiovascular disease diagnosis, pre-surgical planning, and intraoperative treatment. Manual whole-heart segmentation is a time-consuming process, prone to subjectivity and error. Therefore, there is a need to develop a quick, automatic, and accurate whole heart segmentation systems. Nowadays, convolutional neural networks (CNNs) emerged as a robust approach for medical image segmentation. In this paper, we first introduce a novel connectivity structure of residual unit that we refer to as a feature merge residual unit (FM-Pre-ResNet). The proposed connectivity allows the creation of distinctly deep models without an increase in the number of parameters compared to the pre-activation residual units. Second, we propose a three-dimensional (3D) encoder–decoder based architecture that successfully incorporates FM-Pre-ResNet units and variational autoencoder (VAE). In an encoding stage, FM-Pre-ResNet units are used for learning a low-dimensional representation of the input. After that, the variational autoencoder (VAE) reconstructs the input image from the low-dimensional latent space to provide a strong regularization of all model weights, simultaneously preventing overfitting on the training data. Finally, the decoding stage creates the final whole heart segmentation. We evaluate our method on the 40 test subjects of the MICCAI Multi-Modality Whole Heart Segmentation (MM-WHS) Challenge. The average dice values of whole heart segmentation are 90.39% (CT images) and 89.50% (MRI images), which are both highly comparable to the state-of-the-art.
topic	artificial intelligence cardiac CT cardiac MRI deep learning ResNet variational autoencoder
url	https://www.mdpi.com/2076-3417/11/9/3912
work_keys_str_mv	AT marijahabijan wholeheartsegmentationusing3dfmpreresnetencoderdecoderbasedarchitecturewithvariationalautoencoderregularization AT irenagalic wholeheartsegmentationusing3dfmpreresnetencoderdecoderbasedarchitecturewithvariationalautoencoderregularization AT hrvojeleventic wholeheartsegmentationusing3dfmpreresnetencoderdecoderbasedarchitecturewithvariationalautoencoderregularization AT kresimirromic wholeheartsegmentationusing3dfmpreresnetencoderdecoderbasedarchitecturewithvariationalautoencoderregularization
_version_	1721507316007698432

Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization

Similar Items