Personalized Medicine through Automatic Extraction of Information from Medical Texts
The wealth of medical-related information available today gives rise to a multidimensional source of knowledge. Research discoveries published in prestigious venues, electronic-health records data, discharge summaries, clinical notes, etc., all represent important medical information that can a...
Main Author: | |
---|---|
Other Authors: | |
Language: | en |
Published: |
Université d'Ottawa / University of Ottawa
2012
|
Subjects: | |
Online Access: | http://hdl.handle.net/10393/22724 http://dx.doi.org/10.20381/ruor-5599 |
id |
ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-22724 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-227242018-01-05T19:01:14Z Personalized Medicine through Automatic Extraction of Information from Medical Texts Frunza, Oana Magdalena Inkpen, Diana Natural Language Processing Machine Learning Text Mining Medical Informatics The wealth of medical-related information available today gives rise to a multidimensional source of knowledge. Research discoveries published in prestigious venues, electronic-health records data, discharge summaries, clinical notes, etc., all represent important medical information that can assist in the medical decision-making process. The challenge that comes with accessing and using such vast and diverse sources of data stands in the ability to distil and extract reliable and relevant information. Computer-based tools that use natural language processing and machine learning techniques have proven to help address such challenges. This current work proposes automatic reliable solutions for solving tasks that can help achieve a personalized-medicine, a medical practice that brings together general medical knowledge and case-specific medical information. Phenotypic medical observations, along with data coming from test results, are not enough when assessing and treating a medical case. Genetic, life-style, background and environmental data also need to be taken into account in the medical decision process. This thesis’s goal is to prove that natural language processing and machine learning techniques represent reliable solutions for solving important medical-related problems. From the numerous research problems that need to be answered when implementing personalized medicine, the scope of this thesis is restricted to four, as follows: 1. Automatic identification of obesity-related diseases by using only textual clinical data; 2. Automatic identification of relevant abstracts of published research to be used for building systematic reviews; 3. Automatic identification of gene functions based on textual data of published medical abstracts; 4. Automatic identification and classification of important medical relations between medical concepts in clinical and technical data. This thesis investigation on finding automatic solutions for achieving a personalized medicine through information identification and extraction focused on individual specific problems that can be later linked in a puzzle-building manner. A diverse representation technique that follows a divide-and-conquer methodological approach shows to be the most reliable solution for building automatic models that solve the above mentioned tasks. The methodologies that I propose are supported by in-depth research experiments and thorough discussions and conclusions. 2012-04-17T18:56:02Z 2012-04-17T18:56:02Z 2012 2012 Thesis http://hdl.handle.net/10393/22724 http://dx.doi.org/10.20381/ruor-5599 en Université d'Ottawa / University of Ottawa |
collection |
NDLTD |
language |
en |
sources |
NDLTD |
topic |
Natural Language Processing Machine Learning Text Mining Medical Informatics |
spellingShingle |
Natural Language Processing Machine Learning Text Mining Medical Informatics Frunza, Oana Magdalena Personalized Medicine through Automatic Extraction of Information from Medical Texts |
description |
The wealth of medical-related information available today gives rise to a multidimensional source of knowledge. Research discoveries published in prestigious venues, electronic-health records data, discharge summaries, clinical notes, etc., all represent important medical information that can assist in the medical decision-making process. The challenge that comes with accessing and using such vast and diverse sources of data stands in the ability to distil and extract reliable and relevant information. Computer-based tools that use natural language processing and machine learning techniques have proven to help address such challenges. This current work proposes automatic reliable solutions for solving tasks that can help achieve a personalized-medicine, a medical practice that brings together general medical knowledge and case-specific medical information. Phenotypic medical observations, along with data coming from test results, are not enough when assessing and treating a medical case. Genetic, life-style, background and environmental data also need to be taken into
account in the medical decision process. This thesis’s goal is to prove that natural
language processing and machine learning techniques represent reliable solutions for
solving important medical-related problems.
From the numerous research problems that need to be answered when implementing
personalized medicine, the scope of this thesis is restricted to four, as follows:
1. Automatic identification of obesity-related diseases by using only textual clinical
data;
2. Automatic identification of relevant abstracts of published research to be used for
building systematic reviews;
3. Automatic identification of gene functions based on textual data of published medical abstracts;
4. Automatic identification and classification of important medical relations between medical concepts in clinical and technical data. This thesis investigation on finding automatic solutions for achieving a personalized medicine through information identification and extraction focused on individual specific problems that can be later linked in a puzzle-building manner. A diverse representation technique that follows a divide-and-conquer methodological approach shows to be the most reliable solution for building automatic models that solve the above mentioned
tasks. The methodologies that I propose are supported by in-depth research experiments
and thorough discussions and conclusions. |
author2 |
Inkpen, Diana |
author_facet |
Inkpen, Diana Frunza, Oana Magdalena |
author |
Frunza, Oana Magdalena |
author_sort |
Frunza, Oana Magdalena |
title |
Personalized Medicine through Automatic Extraction of Information from Medical Texts |
title_short |
Personalized Medicine through Automatic Extraction of Information from Medical Texts |
title_full |
Personalized Medicine through Automatic Extraction of Information from Medical Texts |
title_fullStr |
Personalized Medicine through Automatic Extraction of Information from Medical Texts |
title_full_unstemmed |
Personalized Medicine through Automatic Extraction of Information from Medical Texts |
title_sort |
personalized medicine through automatic extraction of information from medical texts |
publisher |
Université d'Ottawa / University of Ottawa |
publishDate |
2012 |
url |
http://hdl.handle.net/10393/22724 http://dx.doi.org/10.20381/ruor-5599 |
work_keys_str_mv |
AT frunzaoanamagdalena personalizedmedicinethroughautomaticextractionofinformationfrommedicaltexts |
_version_ |
1718597507279224832 |