Ontology-based information extraction from pathology reports for cancer registration
This research project develops an ontology-based technique to exploit the information contained in free-text surgical pathology reports for breast cancer patients. A novel ontology for the domain is designed and several tools for information extraction and reasoning are developed, supported by machi...
Main Author: | |
---|---|
Published: |
Queen's University Belfast
2014
|
Subjects: | |
Online Access: | http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.675419 |
id |
ndltd-bl.uk-oai-ethos.bl.uk-675419 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-bl.uk-oai-ethos.bl.uk-6754192016-08-04T04:20:22ZOntology-based information extraction from pathology reports for cancer registrationNapolitano, Giulio2014This research project develops an ontology-based technique to exploit the information contained in free-text surgical pathology reports for breast cancer patients. A novel ontology for the domain is designed and several tools for information extraction and reasoning are developed, supported by machine learning algorithms aiding the identification of the relevant information within the documents. The research shows that information extraction from surgical pathology reports can be significantly enhanced by machine learning pre-processing, which will select the appropriate extraction technique for the report layout and filter out irrelevant portions of text. Also, such a system can be coupled with clearly defined, formal semantic models of both the reality, which will support the information extraction tasks, and of coding systems, which will enable to automatically assign clinical codes with complex rules. As a whole, it can alleviate the burden for cancer registry staff, researchers or clinicians of reading pathology reports, calculating cancer staging codes' and entering information on a database. The main benefits of this research will result in cost savings and in the augmented completeness and accuracy of both routine cancer registrations and study-specific cancer data collection for cancer registries. The outcomes of this research will also be appreciated by the management of pathology laboratories. Increasing their awareness of the reports' use in automated contexts will hopefully induce relevant modifications in the writing styles of the documents or, even better, encourage the adoption of structured collection of information for, at least, the essential data items used for cancer epidemiology.615.5Queen's University Belfasthttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.675419Electronic Thesis or Dissertation |
collection |
NDLTD |
sources |
NDLTD |
topic |
615.5 |
spellingShingle |
615.5 Napolitano, Giulio Ontology-based information extraction from pathology reports for cancer registration |
description |
This research project develops an ontology-based technique to exploit the information contained in free-text surgical pathology reports for breast cancer patients. A novel ontology for the domain is designed and several tools for information extraction and reasoning are developed, supported by machine learning algorithms aiding the identification of the relevant information within the documents. The research shows that information extraction from surgical pathology reports can be significantly enhanced by machine learning pre-processing, which will select the appropriate extraction technique for the report layout and filter out irrelevant portions of text. Also, such a system can be coupled with clearly defined, formal semantic models of both the reality, which will support the information extraction tasks, and of coding systems, which will enable to automatically assign clinical codes with complex rules. As a whole, it can alleviate the burden for cancer registry staff, researchers or clinicians of reading pathology reports, calculating cancer staging codes' and entering information on a database. The main benefits of this research will result in cost savings and in the augmented completeness and accuracy of both routine cancer registrations and study-specific cancer data collection for cancer registries. The outcomes of this research will also be appreciated by the management of pathology laboratories. Increasing their awareness of the reports' use in automated contexts will hopefully induce relevant modifications in the writing styles of the documents or, even better, encourage the adoption of structured collection of information for, at least, the essential data items used for cancer epidemiology. |
author |
Napolitano, Giulio |
author_facet |
Napolitano, Giulio |
author_sort |
Napolitano, Giulio |
title |
Ontology-based information extraction from pathology reports for cancer registration |
title_short |
Ontology-based information extraction from pathology reports for cancer registration |
title_full |
Ontology-based information extraction from pathology reports for cancer registration |
title_fullStr |
Ontology-based information extraction from pathology reports for cancer registration |
title_full_unstemmed |
Ontology-based information extraction from pathology reports for cancer registration |
title_sort |
ontology-based information extraction from pathology reports for cancer registration |
publisher |
Queen's University Belfast |
publishDate |
2014 |
url |
http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.675419 |
work_keys_str_mv |
AT napolitanogiulio ontologybasedinformationextractionfrompathologyreportsforcancerregistration |
_version_ |
1718373549813530624 |