Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization

An accurate whole heart segmentation (WHS) on medical images, including computed tomography (CT) and magnetic resonance (MR) images, plays a crucial role in many clinical applications, such as cardiovascular disease diagnosis, pre-surgical planning, and intraoperative treatment. Manual whole-heart s...

Full description

Bibliographic Details
Main Authors: Marija Habijan, Irena Galić, Hrvoje Leventić, Krešimir Romić
Format: Article
Language:English
Published: MDPI AG 2021-04-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/11/9/3912
id doaj-ffa6dc8320a8424e99d6f30ff091b71b
record_format Article
spelling doaj-ffa6dc8320a8424e99d6f30ff091b71b2021-04-26T23:02:09ZengMDPI AGApplied Sciences2076-34172021-04-01113912391210.3390/app11093912Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder RegularizationMarija Habijan0Irena Galić1Hrvoje Leventić2Krešimir Romić3Faculty of Electrical Engineering, Computer Science and Information Technology Osijek, 31000 Osijek, CroatiaFaculty of Electrical Engineering, Computer Science and Information Technology Osijek, 31000 Osijek, CroatiaFaculty of Electrical Engineering, Computer Science and Information Technology Osijek, 31000 Osijek, CroatiaFaculty of Electrical Engineering, Computer Science and Information Technology Osijek, 31000 Osijek, CroatiaAn accurate whole heart segmentation (WHS) on medical images, including computed tomography (CT) and magnetic resonance (MR) images, plays a crucial role in many clinical applications, such as cardiovascular disease diagnosis, pre-surgical planning, and intraoperative treatment. Manual whole-heart segmentation is a time-consuming process, prone to subjectivity and error. Therefore, there is a need to develop a quick, automatic, and accurate whole heart segmentation systems. Nowadays, convolutional neural networks (CNNs) emerged as a robust approach for medical image segmentation. In this paper, we first introduce a novel connectivity structure of residual unit that we refer to as a feature merge residual unit (FM-Pre-ResNet). The proposed connectivity allows the creation of distinctly deep models without an increase in the number of parameters compared to the pre-activation residual units. Second, we propose a three-dimensional (3D) encoder–decoder based architecture that successfully incorporates FM-Pre-ResNet units and variational autoencoder (VAE). In an encoding stage, FM-Pre-ResNet units are used for learning a low-dimensional representation of the input. After that, the variational autoencoder (VAE) reconstructs the input image from the low-dimensional latent space to provide a strong regularization of all model weights, simultaneously preventing overfitting on the training data. Finally, the decoding stage creates the final whole heart segmentation. We evaluate our method on the 40 test subjects of the MICCAI Multi-Modality Whole Heart Segmentation (MM-WHS) Challenge. The average dice values of whole heart segmentation are 90.39% (CT images) and 89.50% (MRI images), which are both highly comparable to the state-of-the-art.https://www.mdpi.com/2076-3417/11/9/3912artificial intelligencecardiac CTcardiac MRIdeep learningResNetvariational autoencoder
collection DOAJ
language English
format Article
sources DOAJ
author Marija Habijan
Irena Galić
Hrvoje Leventić
Krešimir Romić
spellingShingle Marija Habijan
Irena Galić
Hrvoje Leventić
Krešimir Romić
Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization
Applied Sciences
artificial intelligence
cardiac CT
cardiac MRI
deep learning
ResNet
variational autoencoder
author_facet Marija Habijan
Irena Galić
Hrvoje Leventić
Krešimir Romić
author_sort Marija Habijan
title Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization
title_short Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization
title_full Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization
title_fullStr Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization
title_full_unstemmed Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization
title_sort whole heart segmentation using 3d fm-pre-resnet encoder–decoder based architecture with variational autoencoder regularization
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2021-04-01
description An accurate whole heart segmentation (WHS) on medical images, including computed tomography (CT) and magnetic resonance (MR) images, plays a crucial role in many clinical applications, such as cardiovascular disease diagnosis, pre-surgical planning, and intraoperative treatment. Manual whole-heart segmentation is a time-consuming process, prone to subjectivity and error. Therefore, there is a need to develop a quick, automatic, and accurate whole heart segmentation systems. Nowadays, convolutional neural networks (CNNs) emerged as a robust approach for medical image segmentation. In this paper, we first introduce a novel connectivity structure of residual unit that we refer to as a feature merge residual unit (FM-Pre-ResNet). The proposed connectivity allows the creation of distinctly deep models without an increase in the number of parameters compared to the pre-activation residual units. Second, we propose a three-dimensional (3D) encoder–decoder based architecture that successfully incorporates FM-Pre-ResNet units and variational autoencoder (VAE). In an encoding stage, FM-Pre-ResNet units are used for learning a low-dimensional representation of the input. After that, the variational autoencoder (VAE) reconstructs the input image from the low-dimensional latent space to provide a strong regularization of all model weights, simultaneously preventing overfitting on the training data. Finally, the decoding stage creates the final whole heart segmentation. We evaluate our method on the 40 test subjects of the MICCAI Multi-Modality Whole Heart Segmentation (MM-WHS) Challenge. The average dice values of whole heart segmentation are 90.39% (CT images) and 89.50% (MRI images), which are both highly comparable to the state-of-the-art.
topic artificial intelligence
cardiac CT
cardiac MRI
deep learning
ResNet
variational autoencoder
url https://www.mdpi.com/2076-3417/11/9/3912
work_keys_str_mv AT marijahabijan wholeheartsegmentationusing3dfmpreresnetencoderdecoderbasedarchitecturewithvariationalautoencoderregularization
AT irenagalic wholeheartsegmentationusing3dfmpreresnetencoderdecoderbasedarchitecturewithvariationalautoencoderregularization
AT hrvojeleventic wholeheartsegmentationusing3dfmpreresnetencoderdecoderbasedarchitecturewithvariationalautoencoderregularization
AT kresimirromic wholeheartsegmentationusing3dfmpreresnetencoderdecoderbasedarchitecturewithvariationalautoencoderregularization
_version_ 1721507316007698432