Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks

Early Barrett’s neoplasia are often missed due to subtle visual features and inexperience of the non-expert endoscopist with such lesions. While promising results have been reported on the automated detection of this type of early cancer in still endoscopic images, video-based detection using the te...

Full description

Bibliographic Details
Main Authors: Tim Boers, Joost van der Putten, Maarten Struyvenberg, Kiki Fockens, Jelmer Jukema, Erik Schoon, Fons van der Sommen, Jacques Bergman, Peter de With
Format: Article
Language:English
Published: MDPI AG 2020-07-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/20/15/4133
id doaj-1948d2c3497a4c5d81758d0d182fcddd
record_format Article
spelling doaj-1948d2c3497a4c5d81758d0d182fcddd2020-11-25T03:02:15ZengMDPI AGSensors1424-82202020-07-01204133413310.3390/s20154133Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural NetworksTim Boers0Joost van der Putten1Maarten Struyvenberg2Kiki Fockens3Jelmer Jukema4Erik Schoon5Fons van der Sommen6Jacques Bergman7Peter de With8Eindhoven University of Technology, Groene Loper 3, 5612 AE Eindhoven, The NetherlandsEindhoven University of Technology, Groene Loper 3, 5612 AE Eindhoven, The NetherlandsAmsterdam University Medical Center, Meibergdreef 9, 1105 AZ Amsterdam, The NetherlandsAmsterdam University Medical Center, Meibergdreef 9, 1105 AZ Amsterdam, The NetherlandsAmsterdam University Medical Center, Meibergdreef 9, 1105 AZ Amsterdam, The NetherlandsCatharina Hospital, Michelangelolaan 2, 5623 EJ Eindhoven, The NetherlandsEindhoven University of Technology, Groene Loper 3, 5612 AE Eindhoven, The NetherlandsAmsterdam University Medical Center, Meibergdreef 9, 1105 AZ Amsterdam, The NetherlandsEindhoven University of Technology, Groene Loper 3, 5612 AE Eindhoven, The NetherlandsEarly Barrett’s neoplasia are often missed due to subtle visual features and inexperience of the non-expert endoscopist with such lesions. While promising results have been reported on the automated detection of this type of early cancer in still endoscopic images, video-based detection using the temporal domain is still open. The temporally stable nature of video data in endoscopic examinations enables to develop a framework that can diagnose the imaged tissue class over time, thereby yielding a more robust and improved model for spatial predictions. We show that the introduction of Recurrent Neural Network nodes offers a more stable and accurate model for tissue classification, compared to classification on individual images. We have developed a customized Resnet18 feature extractor with four types of classifiers: Fully Connected (FC), Fully Connected with an averaging filter (FC Avg(n = 5)), Long Short Term Memory (LSTM) and a Gated Recurrent Unit (GRU). Experimental results are based on 82 pullback videos of the esophagus with 46 high-grade dysplasia patients. Our results demonstrate that the LSTM classifier outperforms the FC, FC Avg(n = 5) and GRU classifier with an average accuracy of 85.9% compared to 82.2%, 83.0% and 85.6%, respectively. The benefit of our novel implementation for endoscopic tissue classification is the inclusion of spatio-temporal information for improved and robust decision making, and it is the first step towards full temporal learning of esophageal cancer detection in endoscopic video.https://www.mdpi.com/1424-8220/20/15/4133Barrett neoplasiatissue detectionrecurrent neural networksupper GI tract
collection DOAJ
language English
format Article
sources DOAJ
author Tim Boers
Joost van der Putten
Maarten Struyvenberg
Kiki Fockens
Jelmer Jukema
Erik Schoon
Fons van der Sommen
Jacques Bergman
Peter de With
spellingShingle Tim Boers
Joost van der Putten
Maarten Struyvenberg
Kiki Fockens
Jelmer Jukema
Erik Schoon
Fons van der Sommen
Jacques Bergman
Peter de With
Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks
Sensors
Barrett neoplasia
tissue detection
recurrent neural networks
upper GI tract
author_facet Tim Boers
Joost van der Putten
Maarten Struyvenberg
Kiki Fockens
Jelmer Jukema
Erik Schoon
Fons van der Sommen
Jacques Bergman
Peter de With
author_sort Tim Boers
title Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks
title_short Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks
title_full Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks
title_fullStr Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks
title_full_unstemmed Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks
title_sort improving temporal stability and accuracy for endoscopic video tissue classification using recurrent neural networks
publisher MDPI AG
series Sensors
issn 1424-8220
publishDate 2020-07-01
description Early Barrett’s neoplasia are often missed due to subtle visual features and inexperience of the non-expert endoscopist with such lesions. While promising results have been reported on the automated detection of this type of early cancer in still endoscopic images, video-based detection using the temporal domain is still open. The temporally stable nature of video data in endoscopic examinations enables to develop a framework that can diagnose the imaged tissue class over time, thereby yielding a more robust and improved model for spatial predictions. We show that the introduction of Recurrent Neural Network nodes offers a more stable and accurate model for tissue classification, compared to classification on individual images. We have developed a customized Resnet18 feature extractor with four types of classifiers: Fully Connected (FC), Fully Connected with an averaging filter (FC Avg(n = 5)), Long Short Term Memory (LSTM) and a Gated Recurrent Unit (GRU). Experimental results are based on 82 pullback videos of the esophagus with 46 high-grade dysplasia patients. Our results demonstrate that the LSTM classifier outperforms the FC, FC Avg(n = 5) and GRU classifier with an average accuracy of 85.9% compared to 82.2%, 83.0% and 85.6%, respectively. The benefit of our novel implementation for endoscopic tissue classification is the inclusion of spatio-temporal information for improved and robust decision making, and it is the first step towards full temporal learning of esophageal cancer detection in endoscopic video.
topic Barrett neoplasia
tissue detection
recurrent neural networks
upper GI tract
url https://www.mdpi.com/1424-8220/20/15/4133
work_keys_str_mv AT timboers improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks
AT joostvanderputten improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks
AT maartenstruyvenberg improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks
AT kikifockens improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks
AT jelmerjukema improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks
AT erikschoon improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks
AT fonsvandersommen improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks
AT jacquesbergman improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks
AT peterdewith improvingtemporalstabilityandaccuracyforendoscopicvideotissueclassificationusingrecurrentneuralnetworks
_version_ 1724690579611189248