Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks
Early Barrett’s neoplasia are often missed due to subtle visual features and inexperience of the non-expert endoscopist with such lesions. While promising results have been reported on the automated detection of this type of early cancer in still endoscopic images, video-based detection using the te...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-07-01
|
Series: | Sensors |
Subjects: | |
Online Access: | https://www.mdpi.com/1424-8220/20/15/4133 |
id |
doaj-1948d2c3497a4c5d81758d0d182fcddd |
---|---|
record_format |
Article |
spelling |
doaj-1948d2c3497a4c5d81758d0d182fcddd2020-11-25T03:02:15ZengMDPI AGSensors1424-82202020-07-01204133413310.3390/s20154133Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural NetworksTim Boers0Joost van der Putten1Maarten Struyvenberg2Kiki Fockens3Jelmer Jukema4Erik Schoon5Fons van der Sommen6Jacques Bergman7Peter de With8Eindhoven University of Technology, Groene Loper 3, 5612 AE Eindhoven, The NetherlandsEindhoven University of Technology, Groene Loper 3, 5612 AE Eindhoven, The NetherlandsAmsterdam University Medical Center, Meibergdreef 9, 1105 AZ Amsterdam, The NetherlandsAmsterdam University Medical Center, Meibergdreef 9, 1105 AZ Amsterdam, The NetherlandsAmsterdam University Medical Center, Meibergdreef 9, 1105 AZ Amsterdam, The NetherlandsCatharina Hospital, Michelangelolaan 2, 5623 EJ Eindhoven, The NetherlandsEindhoven University of Technology, Groene Loper 3, 5612 AE Eindhoven, The NetherlandsAmsterdam University Medical Center, Meibergdreef 9, 1105 AZ Amsterdam, The NetherlandsEindhoven University of Technology, Groene Loper 3, 5612 AE Eindhoven, The NetherlandsEarly Barrett’s neoplasia are often missed due to subtle visual features and inexperience of the non-expert endoscopist with such lesions. While promising results have been reported on the automated detection of this type of early cancer in still endoscopic images, video-based detection using the temporal domain is still open. The temporally stable nature of video data in endoscopic examinations enables to develop a framework that can diagnose the imaged tissue class over time, thereby yielding a more robust and improved model for spatial predictions. We show that the introduction of Recurrent Neural Network nodes offers a more stable and accurate model for tissue classification, compared to classification on individual images. We have developed a customized Resnet18 feature extractor with four types of classifiers: Fully Connected (FC), Fully Connected with an averaging filter (FC Avg(n = 5)), Long Short Term Memory (LSTM) and a Gated Recurrent Unit (GRU). Experimental results are based on 82 pullback videos of the esophagus with 46 high-grade dysplasia patients. Our results demonstrate that the LSTM classifier outperforms the FC, FC Avg(n = 5) and GRU classifier with an average accuracy of 85.9% compared to 82.2%, 83.0% and 85.6%, respectively. The benefit of our novel implementation for endoscopic tissue classification is the inclusion of spatio-temporal information for improved and robust decision making, and it is the first step towards full temporal learning of esophageal cancer detection in endoscopic video.https://www.mdpi.com/1424-8220/20/15/4133Barrett neoplasiatissue detectionrecurrent neural networksupper GI tract |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Tim Boers Joost van der Putten Maarten Struyvenberg Kiki Fockens Jelmer Jukema Erik Schoon Fons van der Sommen Jacques Bergman Peter de With |
spellingShingle |
Tim Boers Joost van der Putten Maarten Struyvenberg Kiki Fockens Jelmer Jukema Erik Schoon Fons van der Sommen Jacques Bergman Peter de With Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks Sensors Barrett neoplasia tissue detection recurrent neural networks upper GI tract |
author_facet |
Tim Boers Joost van der Putten Maarten Struyvenberg Kiki Fockens Jelmer Jukema Erik Schoon Fons van der Sommen Jacques Bergman Peter de With |
author_sort |
Tim Boers |
title |
Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks |
title_short |
Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks |
title_full |
Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks |
title_fullStr |
Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks |
title_full_unstemmed |
Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks |
title_sort |
improving temporal stability and accuracy for endoscopic video tissue classification using recurrent neural networks |
publisher |
MDPI AG |
series |
Sensors |
issn |
1424-8220 |
publishDate |
2020-07-01 |
description |
Early Barrett’s neoplasia are often missed due to subtle visual features and inexperience of the non-expert endoscopist with such lesions. While promising results have been reported on the automated detection of this type of early cancer in still endoscopic images, video-based detection using the temporal domain is still open. The temporally stable nature of video data in endoscopic examinations enables to develop a framework that can diagnose the imaged tissue class over time, thereby yielding a more robust and improved model for spatial predictions. We show that the introduction of Recurrent Neural Network nodes offers a more stable and accurate model for tissue classification, compared to classification on individual images. We have developed a customized Resnet18 feature extractor with four types of classifiers: Fully Connected (FC), Fully Connected with an averaging filter (FC Avg(n = 5)), Long Short Term Memory (LSTM) and a Gated Recurrent Unit (GRU). Experimental results are based on 82 pullback videos of the esophagus with 46 high-grade dysplasia patients. Our results demonstrate that the LSTM classifier outperforms the FC, FC Avg(n = 5) and GRU classifier with an average accuracy of 85.9% compared to 82.2%, 83.0% and 85.6%, respectively. The benefit of our novel implementation for endoscopic tissue classification is the inclusion of spatio-temporal information for improved and robust decision making, and it is the first step towards full temporal learning of esophageal cancer detection in endoscopic video. |
topic |
Barrett neoplasia tissue detection recurrent neural networks upper GI tract |
url |
https://www.mdpi.com/1424-8220/20/15/4133 |
work_keys_str_mv |
AT timboers improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks AT joostvanderputten improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks AT maartenstruyvenberg improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks AT kikifockens improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks AT jelmerjukema improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks AT erikschoon improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks AT fonsvandersommen improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks AT jacquesbergman improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks AT peterdewith improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks |
_version_ |
1724690579611189248 |