Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization
An accurate whole heart segmentation (WHS) on medical images, including computed tomography (CT) and magnetic resonance (MR) images, plays a crucial role in many clinical applications, such as cardiovascular disease diagnosis, pre-surgical planning, and intraoperative treatment. Manual whole-heart s...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-04-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/11/9/3912 |
id |
doaj-ffa6dc8320a8424e99d6f30ff091b71b |
---|---|
record_format |
Article |
spelling |
doaj-ffa6dc8320a8424e99d6f30ff091b71b2021-04-26T23:02:09ZengMDPI AGApplied Sciences2076-34172021-04-01113912391210.3390/app11093912Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder RegularizationMarija Habijan0Irena Galić1Hrvoje Leventić2Krešimir Romić3Faculty of Electrical Engineering, Computer Science and Information Technology Osijek, 31000 Osijek, CroatiaFaculty of Electrical Engineering, Computer Science and Information Technology Osijek, 31000 Osijek, CroatiaFaculty of Electrical Engineering, Computer Science and Information Technology Osijek, 31000 Osijek, CroatiaFaculty of Electrical Engineering, Computer Science and Information Technology Osijek, 31000 Osijek, CroatiaAn accurate whole heart segmentation (WHS) on medical images, including computed tomography (CT) and magnetic resonance (MR) images, plays a crucial role in many clinical applications, such as cardiovascular disease diagnosis, pre-surgical planning, and intraoperative treatment. Manual whole-heart segmentation is a time-consuming process, prone to subjectivity and error. Therefore, there is a need to develop a quick, automatic, and accurate whole heart segmentation systems. Nowadays, convolutional neural networks (CNNs) emerged as a robust approach for medical image segmentation. In this paper, we first introduce a novel connectivity structure of residual unit that we refer to as a feature merge residual unit (FM-Pre-ResNet). The proposed connectivity allows the creation of distinctly deep models without an increase in the number of parameters compared to the pre-activation residual units. Second, we propose a three-dimensional (3D) encoder–decoder based architecture that successfully incorporates FM-Pre-ResNet units and variational autoencoder (VAE). In an encoding stage, FM-Pre-ResNet units are used for learning a low-dimensional representation of the input. After that, the variational autoencoder (VAE) reconstructs the input image from the low-dimensional latent space to provide a strong regularization of all model weights, simultaneously preventing overfitting on the training data. Finally, the decoding stage creates the final whole heart segmentation. We evaluate our method on the 40 test subjects of the MICCAI Multi-Modality Whole Heart Segmentation (MM-WHS) Challenge. The average dice values of whole heart segmentation are 90.39% (CT images) and 89.50% (MRI images), which are both highly comparable to the state-of-the-art.https://www.mdpi.com/2076-3417/11/9/3912artificial intelligencecardiac CTcardiac MRIdeep learningResNetvariational autoencoder |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Marija Habijan Irena Galić Hrvoje Leventić Krešimir Romić |
spellingShingle |
Marija Habijan Irena Galić Hrvoje Leventić Krešimir Romić Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization Applied Sciences artificial intelligence cardiac CT cardiac MRI deep learning ResNet variational autoencoder |
author_facet |
Marija Habijan Irena Galić Hrvoje Leventić Krešimir Romić |
author_sort |
Marija Habijan |
title |
Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization |
title_short |
Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization |
title_full |
Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization |
title_fullStr |
Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization |
title_full_unstemmed |
Whole Heart Segmentation Using 3D FM-Pre-ResNet Encoder–Decoder Based Architecture with Variational Autoencoder Regularization |
title_sort |
whole heart segmentation using 3d fm-pre-resnet encoder–decoder based architecture with variational autoencoder regularization |
publisher |
MDPI AG |
series |
Applied Sciences |
issn |
2076-3417 |
publishDate |
2021-04-01 |
description |
An accurate whole heart segmentation (WHS) on medical images, including computed tomography (CT) and magnetic resonance (MR) images, plays a crucial role in many clinical applications, such as cardiovascular disease diagnosis, pre-surgical planning, and intraoperative treatment. Manual whole-heart segmentation is a time-consuming process, prone to subjectivity and error. Therefore, there is a need to develop a quick, automatic, and accurate whole heart segmentation systems. Nowadays, convolutional neural networks (CNNs) emerged as a robust approach for medical image segmentation. In this paper, we first introduce a novel connectivity structure of residual unit that we refer to as a feature merge residual unit (FM-Pre-ResNet). The proposed connectivity allows the creation of distinctly deep models without an increase in the number of parameters compared to the pre-activation residual units. Second, we propose a three-dimensional (3D) encoder–decoder based architecture that successfully incorporates FM-Pre-ResNet units and variational autoencoder (VAE). In an encoding stage, FM-Pre-ResNet units are used for learning a low-dimensional representation of the input. After that, the variational autoencoder (VAE) reconstructs the input image from the low-dimensional latent space to provide a strong regularization of all model weights, simultaneously preventing overfitting on the training data. Finally, the decoding stage creates the final whole heart segmentation. We evaluate our method on the 40 test subjects of the MICCAI Multi-Modality Whole Heart Segmentation (MM-WHS) Challenge. The average dice values of whole heart segmentation are 90.39% (CT images) and 89.50% (MRI images), which are both highly comparable to the state-of-the-art. |
topic |
artificial intelligence cardiac CT cardiac MRI deep learning ResNet variational autoencoder |
url |
https://www.mdpi.com/2076-3417/11/9/3912 |
work_keys_str_mv |
AT marijahabijan wholeheartsegmentationusing3dfmpreresnetencoderdecoderbasedarchitecturewithvariationalautoencoderregularization AT irenagalic wholeheartsegmentationusing3dfmpreresnetencoderdecoderbasedarchitecturewithvariationalautoencoderregularization AT hrvojeleventic wholeheartsegmentationusing3dfmpreresnetencoderdecoderbasedarchitecturewithvariationalautoencoderregularization AT kresimirromic wholeheartsegmentationusing3dfmpreresnetencoderdecoderbasedarchitecturewithvariationalautoencoderregularization |
_version_ |
1721507316007698432 |