Ontology-based information extraction from pathology reports for cancer registration

This research project develops an ontology-based technique to exploit the information contained in free-text surgical pathology reports for breast cancer patients. A novel ontology for the domain is designed and several tools for information extraction and reasoning are developed, supported by machi...

Full description

Bibliographic Details
Main Author: Napolitano, Giulio
Published: Queen's University Belfast 2014
Subjects:
Online Access:http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.675419
id ndltd-bl.uk-oai-ethos.bl.uk-675419
record_format oai_dc
spelling ndltd-bl.uk-oai-ethos.bl.uk-6754192016-08-04T04:20:22ZOntology-based information extraction from pathology reports for cancer registrationNapolitano, Giulio2014This research project develops an ontology-based technique to exploit the information contained in free-text surgical pathology reports for breast cancer patients. A novel ontology for the domain is designed and several tools for information extraction and reasoning are developed, supported by machine learning algorithms aiding the identification of the relevant information within the documents. The research shows that information extraction from surgical pathology reports can be significantly enhanced by machine learning pre-processing, which will select the appropriate extraction technique for the report layout and filter out irrelevant portions of text. Also, such a system can be coupled with clearly defined, formal semantic models of both the reality, which will support the information extraction tasks, and of coding systems, which will enable to automatically assign clinical codes with complex rules. As a whole, it can alleviate the burden for cancer registry staff, researchers or clinicians of reading pathology reports, calculating cancer staging codes' and entering information on a database. The main benefits of this research will result in cost savings and in the augmented completeness and accuracy of both routine cancer registrations and study-specific cancer data collection for cancer registries. The outcomes of this research will also be appreciated by the management of pathology laboratories. Increasing their awareness of the reports' use in automated contexts will hopefully induce relevant modifications in the writing styles of the documents or, even better, encourage the adoption of structured collection of information for, at least, the essential data items used for cancer epidemiology.615.5Queen's University Belfasthttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.675419Electronic Thesis or Dissertation
collection NDLTD
sources NDLTD
topic 615.5
spellingShingle 615.5
Napolitano, Giulio
Ontology-based information extraction from pathology reports for cancer registration
description This research project develops an ontology-based technique to exploit the information contained in free-text surgical pathology reports for breast cancer patients. A novel ontology for the domain is designed and several tools for information extraction and reasoning are developed, supported by machine learning algorithms aiding the identification of the relevant information within the documents. The research shows that information extraction from surgical pathology reports can be significantly enhanced by machine learning pre-processing, which will select the appropriate extraction technique for the report layout and filter out irrelevant portions of text. Also, such a system can be coupled with clearly defined, formal semantic models of both the reality, which will support the information extraction tasks, and of coding systems, which will enable to automatically assign clinical codes with complex rules. As a whole, it can alleviate the burden for cancer registry staff, researchers or clinicians of reading pathology reports, calculating cancer staging codes' and entering information on a database. The main benefits of this research will result in cost savings and in the augmented completeness and accuracy of both routine cancer registrations and study-specific cancer data collection for cancer registries. The outcomes of this research will also be appreciated by the management of pathology laboratories. Increasing their awareness of the reports' use in automated contexts will hopefully induce relevant modifications in the writing styles of the documents or, even better, encourage the adoption of structured collection of information for, at least, the essential data items used for cancer epidemiology.
author Napolitano, Giulio
author_facet Napolitano, Giulio
author_sort Napolitano, Giulio
title Ontology-based information extraction from pathology reports for cancer registration
title_short Ontology-based information extraction from pathology reports for cancer registration
title_full Ontology-based information extraction from pathology reports for cancer registration
title_fullStr Ontology-based information extraction from pathology reports for cancer registration
title_full_unstemmed Ontology-based information extraction from pathology reports for cancer registration
title_sort ontology-based information extraction from pathology reports for cancer registration
publisher Queen's University Belfast
publishDate 2014
url http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.675419
work_keys_str_mv AT napolitanogiulio ontologybasedinformationextractionfrompathologyreportsforcancerregistration
_version_ 1718373549813530624