Personalized Medicine through Automatic Extraction of Information from Medical Texts

The wealth of medical-related information available today gives rise to a multidimensional source of knowledge. Research discoveries published in prestigious venues, electronic-health records data, discharge summaries, clinical notes, etc., all represent important medical information that can a...

Full description

Bibliographic Details
Main Author: Frunza, Oana Magdalena
Other Authors: Inkpen, Diana
Language:en
Published: Université d'Ottawa / University of Ottawa 2012
Subjects:
Online Access:http://hdl.handle.net/10393/22724
http://dx.doi.org/10.20381/ruor-5599
id ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-22724
record_format oai_dc
spelling ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-227242018-01-05T19:01:14Z Personalized Medicine through Automatic Extraction of Information from Medical Texts Frunza, Oana Magdalena Inkpen, Diana Natural Language Processing Machine Learning Text Mining Medical Informatics The wealth of medical-related information available today gives rise to a multidimensional source of knowledge. Research discoveries published in prestigious venues, electronic-health records data, discharge summaries, clinical notes, etc., all represent important medical information that can assist in the medical decision-making process. The challenge that comes with accessing and using such vast and diverse sources of data stands in the ability to distil and extract reliable and relevant information. Computer-based tools that use natural language processing and machine learning techniques have proven to help address such challenges. This current work proposes automatic reliable solutions for solving tasks that can help achieve a personalized-medicine, a medical practice that brings together general medical knowledge and case-specific medical information. Phenotypic medical observations, along with data coming from test results, are not enough when assessing and treating a medical case. Genetic, life-style, background and environmental data also need to be taken into account in the medical decision process. This thesis’s goal is to prove that natural language processing and machine learning techniques represent reliable solutions for solving important medical-related problems. From the numerous research problems that need to be answered when implementing personalized medicine, the scope of this thesis is restricted to four, as follows: 1. Automatic identification of obesity-related diseases by using only textual clinical data; 2. Automatic identification of relevant abstracts of published research to be used for building systematic reviews; 3. Automatic identification of gene functions based on textual data of published medical abstracts; 4. Automatic identification and classification of important medical relations between medical concepts in clinical and technical data. This thesis investigation on finding automatic solutions for achieving a personalized medicine through information identification and extraction focused on individual specific problems that can be later linked in a puzzle-building manner. A diverse representation technique that follows a divide-and-conquer methodological approach shows to be the most reliable solution for building automatic models that solve the above mentioned tasks. The methodologies that I propose are supported by in-depth research experiments and thorough discussions and conclusions. 2012-04-17T18:56:02Z 2012-04-17T18:56:02Z 2012 2012 Thesis http://hdl.handle.net/10393/22724 http://dx.doi.org/10.20381/ruor-5599 en Université d'Ottawa / University of Ottawa
collection NDLTD
language en
sources NDLTD
topic Natural Language Processing
Machine Learning
Text Mining
Medical Informatics
spellingShingle Natural Language Processing
Machine Learning
Text Mining
Medical Informatics
Frunza, Oana Magdalena
Personalized Medicine through Automatic Extraction of Information from Medical Texts
description The wealth of medical-related information available today gives rise to a multidimensional source of knowledge. Research discoveries published in prestigious venues, electronic-health records data, discharge summaries, clinical notes, etc., all represent important medical information that can assist in the medical decision-making process. The challenge that comes with accessing and using such vast and diverse sources of data stands in the ability to distil and extract reliable and relevant information. Computer-based tools that use natural language processing and machine learning techniques have proven to help address such challenges. This current work proposes automatic reliable solutions for solving tasks that can help achieve a personalized-medicine, a medical practice that brings together general medical knowledge and case-specific medical information. Phenotypic medical observations, along with data coming from test results, are not enough when assessing and treating a medical case. Genetic, life-style, background and environmental data also need to be taken into account in the medical decision process. This thesis’s goal is to prove that natural language processing and machine learning techniques represent reliable solutions for solving important medical-related problems. From the numerous research problems that need to be answered when implementing personalized medicine, the scope of this thesis is restricted to four, as follows: 1. Automatic identification of obesity-related diseases by using only textual clinical data; 2. Automatic identification of relevant abstracts of published research to be used for building systematic reviews; 3. Automatic identification of gene functions based on textual data of published medical abstracts; 4. Automatic identification and classification of important medical relations between medical concepts in clinical and technical data. This thesis investigation on finding automatic solutions for achieving a personalized medicine through information identification and extraction focused on individual specific problems that can be later linked in a puzzle-building manner. A diverse representation technique that follows a divide-and-conquer methodological approach shows to be the most reliable solution for building automatic models that solve the above mentioned tasks. The methodologies that I propose are supported by in-depth research experiments and thorough discussions and conclusions.
author2 Inkpen, Diana
author_facet Inkpen, Diana
Frunza, Oana Magdalena
author Frunza, Oana Magdalena
author_sort Frunza, Oana Magdalena
title Personalized Medicine through Automatic Extraction of Information from Medical Texts
title_short Personalized Medicine through Automatic Extraction of Information from Medical Texts
title_full Personalized Medicine through Automatic Extraction of Information from Medical Texts
title_fullStr Personalized Medicine through Automatic Extraction of Information from Medical Texts
title_full_unstemmed Personalized Medicine through Automatic Extraction of Information from Medical Texts
title_sort personalized medicine through automatic extraction of information from medical texts
publisher Université d'Ottawa / University of Ottawa
publishDate 2012
url http://hdl.handle.net/10393/22724
http://dx.doi.org/10.20381/ruor-5599
work_keys_str_mv AT frunzaoanamagdalena personalizedmedicinethroughautomaticextractionofinformationfrommedicaltexts
_version_ 1718597507279224832