Normalizing Biomedical Name Entities by Inference Network Weighting and De-ambiguity Matching
碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 96 === In recent years the number of biomedical literatures is increased dramatically and the related experts cannot efficiently manage and extract knowledge from literatures so that much useful information would be lost. In order to construct an intelligent biomedic...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2008
|
Online Access: | http://ndltd.ncl.edu.tw/handle/61339962853464456004 |
id |
ndltd-TW-096NCKU5392032 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-096NCKU53920322015-11-23T04:02:51Z http://ndltd.ncl.edu.tw/handle/61339962853464456004 Normalizing Biomedical Name Entities by Inference Network Weighting and De-ambiguity Matching 利用推論網路加權與反模糊比對之生物醫學文獻詞彙正規化 I-Chin Huang 黃奕欽 碩士 國立成功大學 資訊工程學系碩博士班 96 In recent years the number of biomedical literatures is increased dramatically and the related experts cannot efficiently manage and extract knowledge from literatures so that much useful information would be lost. In order to construct an intelligent biomedical knowledge management system, researchers have proposed many Relation Extraction methods during the last several decades. However, before applying those methods the system has to recognize the name entities in literature and map the entity to the relative concept. Due to the less of conscientious and careful writing style, there are many problems, e.g. term variation and term ambiguity, in the mapping process and they cause error correlation between name entity and concept by the directly mapping method. Thus, the purpose of this study is to automatically and exactly identify the relative concepts mentioned in literatures. In this study, the influence network weighting strategy is applied to weight the similarity score between the entity and the concept as well as to solve the term variation. The proposed de-ambiguity strategy is used to increase the confidence of concept in literature. Different from previous studies, this study makes a good use of the information in entity and concept to increase the precision of system and makes the identified entity even more meaningful. Results of the experiment, the system using those proposed strategies outperforms the simple strategies and previously proposed methods in biomedical entity normalization. Generally, this study proposes to help the next step of text mining researches, e.g. PPI and Co-occurrence, by normalizing the name entity. Hung-Yu Kao 高宏宇 2008 學位論文 ; thesis 45 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 96 === In recent years the number of biomedical literatures is increased dramatically and the related experts cannot efficiently manage and extract knowledge from literatures so that much useful information would be lost. In order to construct an intelligent biomedical knowledge management system, researchers have proposed many Relation Extraction methods during the last several decades. However, before applying those methods the system has to recognize the name entities in literature and map the entity to the relative concept. Due to the less of conscientious and careful writing style, there are many problems, e.g. term variation and term ambiguity, in the mapping process and they cause error correlation between name entity and concept by the directly mapping method. Thus, the purpose of this study is to automatically and exactly identify the relative concepts mentioned in literatures.
In this study, the influence network weighting strategy is applied to weight the similarity score between the entity and the concept as well as to solve the term variation. The proposed de-ambiguity strategy is used to increase the confidence of concept in literature. Different from previous studies, this study makes a good use of the information in entity and concept to increase the precision of system and makes the identified entity even more meaningful. Results of the experiment, the system using those proposed strategies outperforms the simple strategies and previously proposed methods in biomedical entity normalization. Generally, this study proposes to help the next step of text mining researches, e.g. PPI and Co-occurrence, by normalizing the name entity.
|
author2 |
Hung-Yu Kao |
author_facet |
Hung-Yu Kao I-Chin Huang 黃奕欽 |
author |
I-Chin Huang 黃奕欽 |
spellingShingle |
I-Chin Huang 黃奕欽 Normalizing Biomedical Name Entities by Inference Network Weighting and De-ambiguity Matching |
author_sort |
I-Chin Huang |
title |
Normalizing Biomedical Name Entities by Inference Network Weighting and De-ambiguity Matching |
title_short |
Normalizing Biomedical Name Entities by Inference Network Weighting and De-ambiguity Matching |
title_full |
Normalizing Biomedical Name Entities by Inference Network Weighting and De-ambiguity Matching |
title_fullStr |
Normalizing Biomedical Name Entities by Inference Network Weighting and De-ambiguity Matching |
title_full_unstemmed |
Normalizing Biomedical Name Entities by Inference Network Weighting and De-ambiguity Matching |
title_sort |
normalizing biomedical name entities by inference network weighting and de-ambiguity matching |
publishDate |
2008 |
url |
http://ndltd.ncl.edu.tw/handle/61339962853464456004 |
work_keys_str_mv |
AT ichinhuang normalizingbiomedicalnameentitiesbyinferencenetworkweightinganddeambiguitymatching AT huángyìqīn normalizingbiomedicalnameentitiesbyinferencenetworkweightinganddeambiguitymatching AT ichinhuang lìyòngtuīlùnwǎnglùjiāquányǔfǎnmóhúbǐduìzhīshēngwùyīxuéwénxiàncíhuìzhèngguīhuà AT huángyìqīn lìyòngtuīlùnwǎnglùjiāquányǔfǎnmóhúbǐduìzhīshēngwùyīxuéwénxiàncíhuìzhèngguīhuà |
_version_ |
1718133970225332224 |