Personalized Medicine through Automatic Extraction of Information from Medical Texts

The wealth of medical-related information available today gives rise to a multidimensional source of knowledge. Research discoveries published in prestigious venues, electronic-health records data, discharge summaries, clinical notes, etc., all represent important medical information that can a...

Full description

Bibliographic Details
Main Author:	Frunza, Oana Magdalena
Other Authors:	Inkpen, Diana
Language:	en
Published:	Université d'Ottawa / University of Ottawa 2012
Subjects:	Natural Language Processing Machine Learning Text Mining Medical Informatics
Online Access:	http://hdl.handle.net/10393/22724 http://dx.doi.org/10.20381/ruor-5599

id	ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-22724
record_format	oai_dc
spelling	ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-227242018-01-05T19:01:14Z Personalized Medicine through Automatic Extraction of Information from Medical Texts Frunza, Oana Magdalena Inkpen, Diana Natural Language Processing Machine Learning Text Mining Medical Informatics The wealth of medical-related information available today gives rise to a multidimensional source of knowledge. Research discoveries published in prestigious venues, electronic-health records data, discharge summaries, clinical notes, etc., all represent important medical information that can assist in the medical decision-making process. The challenge that comes with accessing and using such vast and diverse sources of data stands in the ability to distil and extract reliable and relevant information. Computer-based tools that use natural language processing and machine learning techniques have proven to help address such challenges. This current work proposes automatic reliable solutions for solving tasks that can help achieve a personalized-medicine, a medical practice that brings together general medical knowledge and case-specific medical information. Phenotypic medical observations, along with data coming from test results, are not enough when assessing and treating a medical case. Genetic, life-style, background and environmental data also need to be taken into account in the medical decision process. This thesis’s goal is to prove that natural language processing and machine learning techniques represent reliable solutions for solving important medical-related problems. From the numerous research problems that need to be answered when implementing personalized medicine, the scope of this thesis is restricted to four, as follows: 1. Automatic identification of obesity-related diseases by using only textual clinical data; 2. Automatic identification of relevant abstracts of published research to be used for building systematic reviews; 3. Automatic identification of gene functions based on textual data of published medical abstracts; 4. Automatic identification and classification of important medical relations between medical concepts in clinical and technical data. This thesis investigation on finding automatic solutions for achieving a personalized medicine through information identification and extraction focused on individual specific problems that can be later linked in a puzzle-building manner. A diverse representation technique that follows a divide-and-conquer methodological approach shows to be the most reliable solution for building automatic models that solve the above mentioned tasks. The methodologies that I propose are supported by in-depth research experiments and thorough discussions and conclusions. 2012-04-17T18:56:02Z 2012-04-17T18:56:02Z 2012 2012 Thesis http://hdl.handle.net/10393/22724 http://dx.doi.org/10.20381/ruor-5599 en Université d'Ottawa / University of Ottawa
collection	NDLTD
language	en
sources	NDLTD
topic	Natural Language Processing Machine Learning Text Mining Medical Informatics
spellingShingle	Natural Language Processing Machine Learning Text Mining Medical Informatics Frunza, Oana Magdalena Personalized Medicine through Automatic Extraction of Information from Medical Texts
description	The wealth of medical-related information available today gives rise to a multidimensional source of knowledge. Research discoveries published in prestigious venues, electronic-health records data, discharge summaries, clinical notes, etc., all represent important medical information that can assist in the medical decision-making process. The challenge that comes with accessing and using such vast and diverse sources of data stands in the ability to distil and extract reliable and relevant information. Computer-based tools that use natural language processing and machine learning techniques have proven to help address such challenges. This current work proposes automatic reliable solutions for solving tasks that can help achieve a personalized-medicine, a medical practice that brings together general medical knowledge and case-specific medical information. Phenotypic medical observations, along with data coming from test results, are not enough when assessing and treating a medical case. Genetic, life-style, background and environmental data also need to be taken into account in the medical decision process. This thesis’s goal is to prove that natural language processing and machine learning techniques represent reliable solutions for solving important medical-related problems. From the numerous research problems that need to be answered when implementing personalized medicine, the scope of this thesis is restricted to four, as follows: 1. Automatic identification of obesity-related diseases by using only textual clinical data; 2. Automatic identification of relevant abstracts of published research to be used for building systematic reviews; 3. Automatic identification of gene functions based on textual data of published medical abstracts; 4. Automatic identification and classification of important medical relations between medical concepts in clinical and technical data. This thesis investigation on finding automatic solutions for achieving a personalized medicine through information identification and extraction focused on individual specific problems that can be later linked in a puzzle-building manner. A diverse representation technique that follows a divide-and-conquer methodological approach shows to be the most reliable solution for building automatic models that solve the above mentioned tasks. The methodologies that I propose are supported by in-depth research experiments and thorough discussions and conclusions.
author2	Inkpen, Diana
author_facet	Inkpen, Diana Frunza, Oana Magdalena
author	Frunza, Oana Magdalena
author_sort	Frunza, Oana Magdalena
title	Personalized Medicine through Automatic Extraction of Information from Medical Texts
title_short	Personalized Medicine through Automatic Extraction of Information from Medical Texts
title_full	Personalized Medicine through Automatic Extraction of Information from Medical Texts
title_fullStr	Personalized Medicine through Automatic Extraction of Information from Medical Texts
title_full_unstemmed	Personalized Medicine through Automatic Extraction of Information from Medical Texts
title_sort	personalized medicine through automatic extraction of information from medical texts
publisher	Université d'Ottawa / University of Ottawa
publishDate	2012
url	http://hdl.handle.net/10393/22724 http://dx.doi.org/10.20381/ruor-5599
work_keys_str_mv	AT frunzaoanamagdalena personalizedmedicinethroughautomaticextractionofinformationfrommedicaltexts
_version_	1718597507279224832

Personalized Medicine through Automatic Extraction of Information from Medical Texts

Similar Items