Prospector: A web-based tool for rapid acquisition of gold standard data for pathology research and image analysis

Background: Obtaining ground truth for pathological images is essential for various experiments, especially for training and testing image analysis algorithms. However, obtaining pathologist input is often difficult, time consuming and expensive. This leads to algorithms being over-fitted to small d...

Full description

Bibliographic Details
Main Authors: Alexander I Wright, Derek R Magee, Philip Quirke, Darren E Treanor
Format: Article
Language:English
Published: Wolters Kluwer Medknow Publications 2015-01-01
Series:Journal of Pathology Informatics
Subjects:
Online Access:http://www.jpathinformatics.org/article.asp?issn=2153-3539;year=2015;volume=6;issue=1;spage=21;epage=21;aulast=Wright
id doaj-66f4d4fe5e354b4fa93531778e6b576a
record_format Article
spelling doaj-66f4d4fe5e354b4fa93531778e6b576a2020-11-24T23:53:10ZengWolters Kluwer Medknow PublicationsJournal of Pathology Informatics2153-35392015-01-0161212110.4103/2153-3539.157785Prospector: A web-based tool for rapid acquisition of gold standard data for pathology research and image analysisAlexander I WrightDerek R MageePhilip QuirkeDarren E TreanorBackground: Obtaining ground truth for pathological images is essential for various experiments, especially for training and testing image analysis algorithms. However, obtaining pathologist input is often difficult, time consuming and expensive. This leads to algorithms being over-fitted to small datasets, and inappropriate validation, which causes poor performance on real world data. There is a great need to gather data from pathologists in a simple and efficient manner, in order to maximise the amount of data obtained. Methods: We present a lightweight, web-based HTML5 system for administering and participating in data collection experiments. The system is designed for rapid input with minimal effort, and can be accessed from anywhere in the world with a reliable internet connection. Results: We present two case studies that use the system to assess how limitations on fields of view affect pathologist agreement, and to what extent poorly stained slides affect judgement. In both cases, the system collects pathologist scores at a rate of less than two seconds per image. Conclusions: The system has multiple potential applications in pathology and other domains.http://www.jpathinformatics.org/article.asp?issn=2153-3539;year=2015;volume=6;issue=1;spage=21;epage=21;aulast=WrightData acquisition, gold standard, ground truth, training data, web experiment system
collection DOAJ
language English
format Article
sources DOAJ
author Alexander I Wright
Derek R Magee
Philip Quirke
Darren E Treanor
spellingShingle Alexander I Wright
Derek R Magee
Philip Quirke
Darren E Treanor
Prospector: A web-based tool for rapid acquisition of gold standard data for pathology research and image analysis
Journal of Pathology Informatics
Data acquisition, gold standard, ground truth, training data, web experiment system
author_facet Alexander I Wright
Derek R Magee
Philip Quirke
Darren E Treanor
author_sort Alexander I Wright
title Prospector: A web-based tool for rapid acquisition of gold standard data for pathology research and image analysis
title_short Prospector: A web-based tool for rapid acquisition of gold standard data for pathology research and image analysis
title_full Prospector: A web-based tool for rapid acquisition of gold standard data for pathology research and image analysis
title_fullStr Prospector: A web-based tool for rapid acquisition of gold standard data for pathology research and image analysis
title_full_unstemmed Prospector: A web-based tool for rapid acquisition of gold standard data for pathology research and image analysis
title_sort prospector: a web-based tool for rapid acquisition of gold standard data for pathology research and image analysis
publisher Wolters Kluwer Medknow Publications
series Journal of Pathology Informatics
issn 2153-3539
publishDate 2015-01-01
description Background: Obtaining ground truth for pathological images is essential for various experiments, especially for training and testing image analysis algorithms. However, obtaining pathologist input is often difficult, time consuming and expensive. This leads to algorithms being over-fitted to small datasets, and inappropriate validation, which causes poor performance on real world data. There is a great need to gather data from pathologists in a simple and efficient manner, in order to maximise the amount of data obtained. Methods: We present a lightweight, web-based HTML5 system for administering and participating in data collection experiments. The system is designed for rapid input with minimal effort, and can be accessed from anywhere in the world with a reliable internet connection. Results: We present two case studies that use the system to assess how limitations on fields of view affect pathologist agreement, and to what extent poorly stained slides affect judgement. In both cases, the system collects pathologist scores at a rate of less than two seconds per image. Conclusions: The system has multiple potential applications in pathology and other domains.
topic Data acquisition, gold standard, ground truth, training data, web experiment system
url http://www.jpathinformatics.org/article.asp?issn=2153-3539;year=2015;volume=6;issue=1;spage=21;epage=21;aulast=Wright
work_keys_str_mv AT alexanderiwright prospectorawebbasedtoolforrapidacquisitionofgoldstandarddataforpathologyresearchandimageanalysis
AT derekrmagee prospectorawebbasedtoolforrapidacquisitionofgoldstandarddataforpathologyresearchandimageanalysis
AT philipquirke prospectorawebbasedtoolforrapidacquisitionofgoldstandarddataforpathologyresearchandimageanalysis
AT darrenetreanor prospectorawebbasedtoolforrapidacquisitionofgoldstandarddataforpathologyresearchandimageanalysis
_version_ 1725470907184447488