Semi-Automatic Classification Of Histopathological Images: Dealing With Inter-Slide Variations

Introduction/ Background The large size and high resolution of histopathological whole slide images renders their manual annotation time-consuming and costly. State-of-the-art computer-based segmentation approaches are generally able to classify tissue reliably, but strong inter-slide variations bet...

Full description

Bibliographic Details
Main Authors: Michael Gadermayr, M. Strauch, J. Unger, P. Boor, B.M. Klinkhammer, S. Djudjaj, D. Merhof
Format: Article
Language:English
Published: DiagnomX 2016-06-01
Series:Diagnostic Pathology
Online Access:http://www.diagnosticpathology.eu/content/index.php/dpath/article/view/161
id doaj-270cd509b3a84eaf9b5165e162d0b636
record_format Article
spelling doaj-270cd509b3a84eaf9b5165e162d0b6362020-11-24T21:08:00ZengDiagnomXDiagnostic Pathology2364-48932016-06-011810.17629/www.diagnosticpathology.eu-2016-8:161 Semi-Automatic Classification Of Histopathological Images: Dealing With Inter-Slide VariationsMichael Gadermayr0M. Strauch1J. Unger2P. Boor3B.M. Klinkhammer4S. Djudjaj5D. Merhof6RWTH Aachen University, Aachen, GermanyRWTH Aachen University, Aachen, GermanyRWTH Aachen University, Aachen, GermanyRWTH Aachen University, Aachen, GermanyRWTH Aachen University, Aachen, GermanyRWTH Aachen University, Aachen, GermanyRWTH Aachen University, Aachen, GermanyIntroduction/ Background The large size and high resolution of histopathological whole slide images renders their manual annotation time-consuming and costly. State-of-the-art computer-based segmentation approaches are generally able to classify tissue reliably, but strong inter-slide variations between training and evaluation data can cause significant decreases in classification accuracy. Aims In this study, we focus on alpha-SMA stainings of the mouse kidney, and in particular on the classification of glomerular vs. non-glomerular regions. Even though all slides had been recorded using a common staining protocol, inter-slide variations could be observed. We investigate the impact of these variations as well as methods of resolution. Methods We propose an interactive, semi-automatic tissue classification approach [1] which adapts a pre-trained classification model to the new image on which classification should be performed. Image patches for which the class (glomerular/non-glomerular) is uncertain are automatically selected and presented to the user to determine the class label. The user interaction step is repeated several times to iteratively adjust the model to the characteristics of the new image. For image representation and classification, well known methods from the literature are utilized. Specifically, we combine Local Binary Patters with the support vector classifier. Results In case of 50 available labelled sample patches of a certain whole slide image, the overall classification rate increased from 92 % to 98 % through including the interactive labelling step. Even with only 20 labelled patches, accuracy already increased to 97 %. Without a pre-trained model, if training is performed on target domain data only, 88 % (20 labelled samples) and 95 % (50 labelled samples) accuracy, respectively, were obtained. If enough target domain data was available (about 20 images), the amount of source domain data was of minor relevance. The difference in outcome between a source domain training data set containing 100 patches from one whole slide image and a set containing 700 patches from seven images was lower than 1 %. Contrarily, without target domain data, the difference in accuracy was 10 % (82 % compared to 92 %) between these two settings. Execution runtime between two interaction steps is significantly below one second (0.23 s), which is an important usability criterion. It proved to be beneficial to select specific target domain data in an active learning sense based on the currently available trained model. While experimental evaluation provided strong empirical evidence for increased classification performance with the proposed method, the additional manual effort can be kept at a low level. The labelling of e.g. 20 images per slide is surely less time consuming than the validation of a complete whole slide image processed with a fully automatic, but less reliable, segmentation approach. Finally, it should be highlighted that the proposed interaction protocol could easily be adapted to other histopathological classification or segmentation tasks, also for implementation in a clinical system.   http://www.diagnosticpathology.eu/content/index.php/dpath/article/view/161
collection DOAJ
language English
format Article
sources DOAJ
author Michael Gadermayr
M. Strauch
J. Unger
P. Boor
B.M. Klinkhammer
S. Djudjaj
D. Merhof
spellingShingle Michael Gadermayr
M. Strauch
J. Unger
P. Boor
B.M. Klinkhammer
S. Djudjaj
D. Merhof
Semi-Automatic Classification Of Histopathological Images: Dealing With Inter-Slide Variations
Diagnostic Pathology
author_facet Michael Gadermayr
M. Strauch
J. Unger
P. Boor
B.M. Klinkhammer
S. Djudjaj
D. Merhof
author_sort Michael Gadermayr
title Semi-Automatic Classification Of Histopathological Images: Dealing With Inter-Slide Variations
title_short Semi-Automatic Classification Of Histopathological Images: Dealing With Inter-Slide Variations
title_full Semi-Automatic Classification Of Histopathological Images: Dealing With Inter-Slide Variations
title_fullStr Semi-Automatic Classification Of Histopathological Images: Dealing With Inter-Slide Variations
title_full_unstemmed Semi-Automatic Classification Of Histopathological Images: Dealing With Inter-Slide Variations
title_sort semi-automatic classification of histopathological images: dealing with inter-slide variations
publisher DiagnomX
series Diagnostic Pathology
issn 2364-4893
publishDate 2016-06-01
description Introduction/ Background The large size and high resolution of histopathological whole slide images renders their manual annotation time-consuming and costly. State-of-the-art computer-based segmentation approaches are generally able to classify tissue reliably, but strong inter-slide variations between training and evaluation data can cause significant decreases in classification accuracy. Aims In this study, we focus on alpha-SMA stainings of the mouse kidney, and in particular on the classification of glomerular vs. non-glomerular regions. Even though all slides had been recorded using a common staining protocol, inter-slide variations could be observed. We investigate the impact of these variations as well as methods of resolution. Methods We propose an interactive, semi-automatic tissue classification approach [1] which adapts a pre-trained classification model to the new image on which classification should be performed. Image patches for which the class (glomerular/non-glomerular) is uncertain are automatically selected and presented to the user to determine the class label. The user interaction step is repeated several times to iteratively adjust the model to the characteristics of the new image. For image representation and classification, well known methods from the literature are utilized. Specifically, we combine Local Binary Patters with the support vector classifier. Results In case of 50 available labelled sample patches of a certain whole slide image, the overall classification rate increased from 92 % to 98 % through including the interactive labelling step. Even with only 20 labelled patches, accuracy already increased to 97 %. Without a pre-trained model, if training is performed on target domain data only, 88 % (20 labelled samples) and 95 % (50 labelled samples) accuracy, respectively, were obtained. If enough target domain data was available (about 20 images), the amount of source domain data was of minor relevance. The difference in outcome between a source domain training data set containing 100 patches from one whole slide image and a set containing 700 patches from seven images was lower than 1 %. Contrarily, without target domain data, the difference in accuracy was 10 % (82 % compared to 92 %) between these two settings. Execution runtime between two interaction steps is significantly below one second (0.23 s), which is an important usability criterion. It proved to be beneficial to select specific target domain data in an active learning sense based on the currently available trained model. While experimental evaluation provided strong empirical evidence for increased classification performance with the proposed method, the additional manual effort can be kept at a low level. The labelling of e.g. 20 images per slide is surely less time consuming than the validation of a complete whole slide image processed with a fully automatic, but less reliable, segmentation approach. Finally, it should be highlighted that the proposed interaction protocol could easily be adapted to other histopathological classification or segmentation tasks, also for implementation in a clinical system.  
url http://www.diagnosticpathology.eu/content/index.php/dpath/article/view/161
work_keys_str_mv AT michaelgadermayr semiautomaticclassificationofhistopathologicalimagesdealingwithinterslidevariations
AT mstrauch semiautomaticclassificationofhistopathologicalimagesdealingwithinterslidevariations
AT junger semiautomaticclassificationofhistopathologicalimagesdealingwithinterslidevariations
AT pboor semiautomaticclassificationofhistopathologicalimagesdealingwithinterslidevariations
AT bmklinkhammer semiautomaticclassificationofhistopathologicalimagesdealingwithinterslidevariations
AT sdjudjaj semiautomaticclassificationofhistopathologicalimagesdealingwithinterslidevariations
AT dmerhof semiautomaticclassificationofhistopathologicalimagesdealingwithinterslidevariations
_version_ 1716761261503414272