Reliable computational quantification of liver fibrosis is compromised by inherent staining variation

Abstract Biopsy remains the gold‐standard measure for staging liver disease, both to inform prognosis and to assess the response to a given treatment. Semiquantitative scores such as the Ishak fibrosis score are used for evaluation. These scores are utilised in clinical trials, with the US Food and...

Full description

Bibliographic Details
Main Authors: Stuart Astbury, Jane I Grove, David A Dorward, Indra N Guha, Jonathan A Fallowfield, Timothy J Kendall
Format: Article
Language:English
Published: Wiley 2021-09-01
Series:The Journal of Pathology: Clinical Research
Subjects:
Online Access:https://doi.org/10.1002/cjp2.227
id doaj-26ed8a88a22d4260a6d1f9a5f383f1ce
record_format Article
spelling doaj-26ed8a88a22d4260a6d1f9a5f383f1ce2021-08-14T09:52:24ZengWileyThe Journal of Pathology: Clinical Research2056-45382021-09-017547148110.1002/cjp2.227Reliable computational quantification of liver fibrosis is compromised by inherent staining variationStuart Astbury0Jane I Grove1David A Dorward2Indra N Guha3Jonathan A Fallowfield4Timothy J Kendall5NIHR Nottingham Biomedical Research Centre Nottingham University Hospitals NHS Trust and the University of Nottingham Nottingham UKNIHR Nottingham Biomedical Research Centre Nottingham University Hospitals NHS Trust and the University of Nottingham Nottingham UKUniversity of Edinburgh Centre for Inflammation Research University of Edinburgh Edinburgh UKNIHR Nottingham Biomedical Research Centre Nottingham University Hospitals NHS Trust and the University of Nottingham Nottingham UKUniversity of Edinburgh Centre for Inflammation Research University of Edinburgh Edinburgh UKUniversity of Edinburgh Centre for Inflammation Research University of Edinburgh Edinburgh UKAbstract Biopsy remains the gold‐standard measure for staging liver disease, both to inform prognosis and to assess the response to a given treatment. Semiquantitative scores such as the Ishak fibrosis score are used for evaluation. These scores are utilised in clinical trials, with the US Food and Drug Administration mandating particular scores as inclusion criteria for participants and using the change in score as evidence of treatment efficacy. There is an urgent need for improved, quantitative assessment of liver biopsies to detect small incremental changes in liver architecture over the course of a clinical trial. Artificial intelligence (AI) methods have been proposed as a way to increase the amount of information extracted from a biopsy and to potentially remove bias introduced by manual scoring. We have trained and evaluated an AI tool for measuring the amount of scarring in sections of picrosirius red‐stained liver. The AI methodology was compared with both manual scoring and widely available colour space thresholding. Four sequential sections from each case were stained on two separate occasions by two independent clinical laboratories using routine protocols to study the effect of inter‐ and intra‐laboratory staining variation on these tools. Finally, we compared these methods to second harmonic generation (SHG) imaging, a stain‐free quantitative measure of collagen. Although AI methods provided a modest improvement over simpler computer‐assisted measures, staining variation both within and between laboratories had a dramatic effect on quantitation, with manual assignment of scar proportion being the most consistent. Manual assessment also most strongly correlated with collagen measured by SHG. In conclusion, results suggest that computational measures of liver scarring from stained sections are compromised by inter‐ and intra‐laboratory staining. Stain‐free quantitative measurement using SHG avoids staining‐related variation and may prove more accurate in detecting small changes in scarring that may occur in therapeutic trials.https://doi.org/10.1002/cjp2.227liver fibrosishistological scoringartificial intelligencedigital pathology
collection DOAJ
language English
format Article
sources DOAJ
author Stuart Astbury
Jane I Grove
David A Dorward
Indra N Guha
Jonathan A Fallowfield
Timothy J Kendall
spellingShingle Stuart Astbury
Jane I Grove
David A Dorward
Indra N Guha
Jonathan A Fallowfield
Timothy J Kendall
Reliable computational quantification of liver fibrosis is compromised by inherent staining variation
The Journal of Pathology: Clinical Research
liver fibrosis
histological scoring
artificial intelligence
digital pathology
author_facet Stuart Astbury
Jane I Grove
David A Dorward
Indra N Guha
Jonathan A Fallowfield
Timothy J Kendall
author_sort Stuart Astbury
title Reliable computational quantification of liver fibrosis is compromised by inherent staining variation
title_short Reliable computational quantification of liver fibrosis is compromised by inherent staining variation
title_full Reliable computational quantification of liver fibrosis is compromised by inherent staining variation
title_fullStr Reliable computational quantification of liver fibrosis is compromised by inherent staining variation
title_full_unstemmed Reliable computational quantification of liver fibrosis is compromised by inherent staining variation
title_sort reliable computational quantification of liver fibrosis is compromised by inherent staining variation
publisher Wiley
series The Journal of Pathology: Clinical Research
issn 2056-4538
publishDate 2021-09-01
description Abstract Biopsy remains the gold‐standard measure for staging liver disease, both to inform prognosis and to assess the response to a given treatment. Semiquantitative scores such as the Ishak fibrosis score are used for evaluation. These scores are utilised in clinical trials, with the US Food and Drug Administration mandating particular scores as inclusion criteria for participants and using the change in score as evidence of treatment efficacy. There is an urgent need for improved, quantitative assessment of liver biopsies to detect small incremental changes in liver architecture over the course of a clinical trial. Artificial intelligence (AI) methods have been proposed as a way to increase the amount of information extracted from a biopsy and to potentially remove bias introduced by manual scoring. We have trained and evaluated an AI tool for measuring the amount of scarring in sections of picrosirius red‐stained liver. The AI methodology was compared with both manual scoring and widely available colour space thresholding. Four sequential sections from each case were stained on two separate occasions by two independent clinical laboratories using routine protocols to study the effect of inter‐ and intra‐laboratory staining variation on these tools. Finally, we compared these methods to second harmonic generation (SHG) imaging, a stain‐free quantitative measure of collagen. Although AI methods provided a modest improvement over simpler computer‐assisted measures, staining variation both within and between laboratories had a dramatic effect on quantitation, with manual assignment of scar proportion being the most consistent. Manual assessment also most strongly correlated with collagen measured by SHG. In conclusion, results suggest that computational measures of liver scarring from stained sections are compromised by inter‐ and intra‐laboratory staining. Stain‐free quantitative measurement using SHG avoids staining‐related variation and may prove more accurate in detecting small changes in scarring that may occur in therapeutic trials.
topic liver fibrosis
histological scoring
artificial intelligence
digital pathology
url https://doi.org/10.1002/cjp2.227
work_keys_str_mv AT stuartastbury reliablecomputationalquantificationofliverfibrosisiscompromisedbyinherentstainingvariation
AT janeigrove reliablecomputationalquantificationofliverfibrosisiscompromisedbyinherentstainingvariation
AT davidadorward reliablecomputationalquantificationofliverfibrosisiscompromisedbyinherentstainingvariation
AT indranguha reliablecomputationalquantificationofliverfibrosisiscompromisedbyinherentstainingvariation
AT jonathanafallowfield reliablecomputationalquantificationofliverfibrosisiscompromisedbyinherentstainingvariation
AT timothyjkendall reliablecomputationalquantificationofliverfibrosisiscompromisedbyinherentstainingvariation
_version_ 1721207502330134528