Using Machine Reading to Understand Alzheimer’s and Related Diseases from the Literature

Purpose: This paper aims to better understand a large number of papers in the medical domain of Alzheimer’s disease (AD) and related diseases using the machine reading approach. Design/methodology/approach: The study uses the topic modeling method to obtain an overview of the field, and employs ope...

Full description

Bibliographic Details
Main Authors: Satoshi Tsutsui, Yi Bu, Ying Ding
Format: Article
Language:English
Published: Chinese Academy of Sciences 2017-12-01
Series:Journal of Data and Information Science
Subjects:
Online Access:http://manu47.magtech.com.cn/Jwk3_jdis/article/2017/2096-157X/2096-157X-2-4-81.shtml
id doaj-72c08ef7c79b4d6f87017e7f28364e72
record_format Article
spelling doaj-72c08ef7c79b4d6f87017e7f28364e722020-11-24T20:59:35ZengChinese Academy of SciencesJournal of Data and Information Science2096-157X2096-157X2017-12-0124819410.1515/jdis-2017-0021Using Machine Reading to Understand Alzheimer’s and Related Diseases from the LiteratureSatoshi Tsutsui0Yi Bu1 Ying Ding2School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN 47408, USASchool of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN 47408, USASchool of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN 47408, USAPurpose: This paper aims to better understand a large number of papers in the medical domain of Alzheimer’s disease (AD) and related diseases using the machine reading approach. Design/methodology/approach: The study uses the topic modeling method to obtain an overview of the field, and employs open information extraction to further comprehend the field at a specific fact level. Findings: Several topics within the AD research field are identified, such as the Human Immunodeficiency Virus (HIV)/Acquired Immune Deficiency Syndrome (AIDS), which can help answer the question of how AIDS/HIV and AD are very different yet related diseases. Research limitations: Some manual data cleaning could improve the study, such as removing incorrect facts found by open information extraction. Practical implications: This study uses the literature to answer specific questions on a scientific domain, which can help domain experts find interesting and meaningful relations among entities in a similar manner, such as to discover relations between AD and AIDS/HIV. Originality/value: Both the overview and specific information from the literature are obtained using two distinct methods in a complementary manner. This combination is novel because previous work has only focused on one of them, and thus provides a better way to understand an important scientific field using data-driven methods.http://manu47.magtech.com.cn/Jwk3_jdis/article/2017/2096-157X/2096-157X-2-4-81.shtmlMachine readingAlzheimer's diseaseKnowledge discovery
collection DOAJ
language English
format Article
sources DOAJ
author Satoshi Tsutsui
Yi Bu
Ying Ding
spellingShingle Satoshi Tsutsui
Yi Bu
Ying Ding
Using Machine Reading to Understand Alzheimer’s and Related Diseases from the Literature
Journal of Data and Information Science
Machine reading
Alzheimer's disease
Knowledge discovery
author_facet Satoshi Tsutsui
Yi Bu
Ying Ding
author_sort Satoshi Tsutsui
title Using Machine Reading to Understand Alzheimer’s and Related Diseases from the Literature
title_short Using Machine Reading to Understand Alzheimer’s and Related Diseases from the Literature
title_full Using Machine Reading to Understand Alzheimer’s and Related Diseases from the Literature
title_fullStr Using Machine Reading to Understand Alzheimer’s and Related Diseases from the Literature
title_full_unstemmed Using Machine Reading to Understand Alzheimer’s and Related Diseases from the Literature
title_sort using machine reading to understand alzheimer’s and related diseases from the literature
publisher Chinese Academy of Sciences
series Journal of Data and Information Science
issn 2096-157X
2096-157X
publishDate 2017-12-01
description Purpose: This paper aims to better understand a large number of papers in the medical domain of Alzheimer’s disease (AD) and related diseases using the machine reading approach. Design/methodology/approach: The study uses the topic modeling method to obtain an overview of the field, and employs open information extraction to further comprehend the field at a specific fact level. Findings: Several topics within the AD research field are identified, such as the Human Immunodeficiency Virus (HIV)/Acquired Immune Deficiency Syndrome (AIDS), which can help answer the question of how AIDS/HIV and AD are very different yet related diseases. Research limitations: Some manual data cleaning could improve the study, such as removing incorrect facts found by open information extraction. Practical implications: This study uses the literature to answer specific questions on a scientific domain, which can help domain experts find interesting and meaningful relations among entities in a similar manner, such as to discover relations between AD and AIDS/HIV. Originality/value: Both the overview and specific information from the literature are obtained using two distinct methods in a complementary manner. This combination is novel because previous work has only focused on one of them, and thus provides a better way to understand an important scientific field using data-driven methods.
topic Machine reading
Alzheimer's disease
Knowledge discovery
url http://manu47.magtech.com.cn/Jwk3_jdis/article/2017/2096-157X/2096-157X-2-4-81.shtml
work_keys_str_mv AT satoshitsutsui usingmachinereadingtounderstandalzheimersandrelateddiseasesfromtheliterature
AT yibu usingmachinereadingtounderstandalzheimersandrelateddiseasesfromtheliterature
AT yingding usingmachinereadingtounderstandalzheimersandrelateddiseasesfromtheliterature
_version_ 1716782241891221504