Biomedical Literature Mining with Transitive Closure and Maximum Network Flow

This thesis examines biomedical text mining with an application in bone biology. A special thanks is extended to Anita Park and Mark Jaeger from the Purdue University Graduate School Office, who acted as invaluable assets in the formatting of the thesis. IUPUI and every other university would be fo...

Full description

Bibliographic Details
Main Author: Hoblitzell, Andrew P.
Other Authors: Mukhopadhyay, Snehasis
Language:en_US
Published: http://doi.acm.org/10.1145/1851476.1851552 2011
Subjects:
Online Access:Andrew Hoblitzell, Snehasis Mukhopadhyay, Qian You, Shiaofen Fang, Yuni Xia, and Joseph Bidwell. 2010. Text mining for bone biology. In Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing (HPDC '10). ACM, New York, NY, USA, 522-530. DOI=10.1145/1851476.1851552 http://doi.acm.org/10.1145/1851476.1851552
http://hdl.handle.net/1805/2609
id ndltd-IUPUI-oai-scholarworks.iupui.edu-1805-2609
record_format oai_dc
spelling ndltd-IUPUI-oai-scholarworks.iupui.edu-1805-26092019-05-10T15:21:00Z Biomedical Literature Mining with Transitive Closure and Maximum Network Flow Hoblitzell, Andrew P. Mukhopadhyay, Snehasis Xia, Yuni Fang, Shiafoen Biomedical text mining Bioinformatics Hypergraphs Information storage and retrieval systems -- Biology Data mining Biological literature Hypergraphs This thesis examines biomedical text mining with an application in bone biology. A special thanks is extended to Anita Park and Mark Jaeger from the Purdue University Graduate School Office, who acted as invaluable assets in the formatting of the thesis. IUPUI and every other university would be fortunate to have staff that respond in such a timely, corteous, and professional manner. Indiana University-Purdue University Indianapolis (IUPUI) The biological literature is a huge and constantly increasing source of information which the biologist may consult for information about their field, but the vast amount of data can sometimes become overwhelming. Medline, which makes a great amount of biological journal data available online, makes the development of automated text mining systems and hence “data-driven discovery” possible. This thesis examines current work in the field of text mining and biological literature, and then aims to mine documents pertaining to bone biology. The documents are retrieved from PubMed, and then direct associations between the terms are computers. Potentially novel transitive associations among biological objects are then discovered using the transitive closure algorithm and the maximum flow algorithm. The thesis discusses in detail the extraction of biological objects from the collected documents and the co-occurrence based text mining algorithm, the transitive closure algorithm, and the maximum network flow which were then run to extract the potentially novel biological associations. Generated hypotheses (novel associations) were assigned with significance scores for further validation by a bone biologist expert. Extension of the work in to hypergraphs for enhanced meaning and accuracy is also examined in the thesis. 2011-07-11T20:19:06Z 2011-07-11T20:19:06Z 2011-05-15 Andrew Hoblitzell, Snehasis Mukhopadhyay, Qian You, Shiaofen Fang, Yuni Xia, and Joseph Bidwell. 2010. Text mining for bone biology. In Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing (HPDC '10). ACM, New York, NY, USA, 522-530. DOI=10.1145/1851476.1851552 http://doi.acm.org/10.1145/1851476.1851552 http://hdl.handle.net/1805/2609 en_US http://doi.acm.org/10.1145/1851476.1851552
collection NDLTD
language en_US
sources NDLTD
topic Biomedical text mining
Bioinformatics
Hypergraphs
Information storage and retrieval systems -- Biology
Data mining
Biological literature
Hypergraphs
spellingShingle Biomedical text mining
Bioinformatics
Hypergraphs
Information storage and retrieval systems -- Biology
Data mining
Biological literature
Hypergraphs
Hoblitzell, Andrew P.
Biomedical Literature Mining with Transitive Closure and Maximum Network Flow
description This thesis examines biomedical text mining with an application in bone biology. A special thanks is extended to Anita Park and Mark Jaeger from the Purdue University Graduate School Office, who acted as invaluable assets in the formatting of the thesis. IUPUI and every other university would be fortunate to have staff that respond in such a timely, corteous, and professional manner. === Indiana University-Purdue University Indianapolis (IUPUI) === The biological literature is a huge and constantly increasing source of information which the biologist may consult for information about their field, but the vast amount of data can sometimes become overwhelming. Medline, which makes a great amount of biological journal data available online, makes the development of automated text mining systems and hence “data-driven discovery” possible. This thesis examines current work in the field of text mining and biological literature, and then aims to mine documents pertaining to bone biology. The documents are retrieved from PubMed, and then direct associations between the terms are computers. Potentially novel transitive associations among biological objects are then discovered using the transitive closure algorithm and the maximum flow algorithm. The thesis discusses in detail the extraction of biological objects from the collected documents and the co-occurrence based text mining algorithm, the transitive closure algorithm, and the maximum network flow which were then run to extract the potentially novel biological associations. Generated hypotheses (novel associations) were assigned with significance scores for further validation by a bone biologist expert. Extension of the work in to hypergraphs for enhanced meaning and accuracy is also examined in the thesis.
author2 Mukhopadhyay, Snehasis
author_facet Mukhopadhyay, Snehasis
Hoblitzell, Andrew P.
author Hoblitzell, Andrew P.
author_sort Hoblitzell, Andrew P.
title Biomedical Literature Mining with Transitive Closure and Maximum Network Flow
title_short Biomedical Literature Mining with Transitive Closure and Maximum Network Flow
title_full Biomedical Literature Mining with Transitive Closure and Maximum Network Flow
title_fullStr Biomedical Literature Mining with Transitive Closure and Maximum Network Flow
title_full_unstemmed Biomedical Literature Mining with Transitive Closure and Maximum Network Flow
title_sort biomedical literature mining with transitive closure and maximum network flow
publisher http://doi.acm.org/10.1145/1851476.1851552
publishDate 2011
url Andrew Hoblitzell, Snehasis Mukhopadhyay, Qian You, Shiaofen Fang, Yuni Xia, and Joseph Bidwell. 2010. Text mining for bone biology. In Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing (HPDC '10). ACM, New York, NY, USA, 522-530. DOI=10.1145/1851476.1851552 http://doi.acm.org/10.1145/1851476.1851552
http://hdl.handle.net/1805/2609
work_keys_str_mv AT hoblitzellandrewp biomedicalliteratureminingwithtransitiveclosureandmaximumnetworkflow
_version_ 1719079815210860544