Unsupervised Discovery of Named Entities with Fine-Grained Category on the Web
碩士 === 國立清華大學 === 資訊系統與應用研究所 === 93 === We introduce a method for finding named entities (NEs) with the same category as a given set of seed named entities on the Web. In our approach, passages containing the given seed NEs are retrieved from the Web and subsequently used to construct linguistic mod...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Online Access: | http://ndltd.ncl.edu.tw/handle/59000480779565169837 |
id |
ndltd-TW-093NTHU5394011 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-093NTHU53940112016-06-06T04:11:21Z http://ndltd.ncl.edu.tw/handle/59000480779565169837 Unsupervised Discovery of Named Entities with Fine-Grained Category on the Web 非督導式細分類之網路具名實體自動發掘系統 Cheng-Han Chiang 江政韓 碩士 國立清華大學 資訊系統與應用研究所 93 We introduce a method for finding named entities (NEs) with the same category as a given set of seed named entities on the Web. In our approach, passages containing the given seed NEs are retrieved from the Web and subsequently used to construct linguistic model aimed at discovering more new NEs with the same category from the Web. The method involves generating a key terms table with word classes from Webpage summaries containing the seed NEs and learning surface patterns containing the seed NEs from these passages. At runtime, we use salient key terms and word classes in the model to find the new Web summaries, filter out unlikely passages and extract the new NEs from the remaining passages using surface patterns. We presented a prototype system, Name Finder, which applies the proposed method to discover additional NEs for a set of given several NEs. We evaluate and compare Name Finder with Google Sets. The experimental results show that our system produces more NEs with an average precision rate comparable with Google Sets. Our methodology cleanly supports automatic knowledge discovery and ontology extension. Jason S. Chang 張俊盛 學位論文 ; thesis 54 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 資訊系統與應用研究所 === 93 === We introduce a method for finding named entities (NEs) with the same category as a given set of seed named entities on the Web. In our approach, passages containing the given seed NEs are retrieved from the Web and subsequently used to construct linguistic model aimed at discovering more new NEs with the same category from the Web.
The method involves generating a key terms table with word classes from Webpage summaries containing the seed NEs and learning surface patterns containing the seed NEs from these passages. At runtime, we use salient key terms and word classes in the model to find the new Web summaries, filter out unlikely passages and extract the new NEs from the remaining passages using surface patterns.
We presented a prototype system, Name Finder, which applies the proposed method to discover additional NEs for a set of given several NEs. We evaluate and compare Name Finder with Google Sets. The experimental results show that our system produces more NEs with an average precision rate comparable with Google Sets. Our methodology cleanly supports automatic knowledge discovery and ontology extension.
|
author2 |
Jason S. Chang |
author_facet |
Jason S. Chang Cheng-Han Chiang 江政韓 |
author |
Cheng-Han Chiang 江政韓 |
spellingShingle |
Cheng-Han Chiang 江政韓 Unsupervised Discovery of Named Entities with Fine-Grained Category on the Web |
author_sort |
Cheng-Han Chiang |
title |
Unsupervised Discovery of Named Entities with Fine-Grained Category on the Web |
title_short |
Unsupervised Discovery of Named Entities with Fine-Grained Category on the Web |
title_full |
Unsupervised Discovery of Named Entities with Fine-Grained Category on the Web |
title_fullStr |
Unsupervised Discovery of Named Entities with Fine-Grained Category on the Web |
title_full_unstemmed |
Unsupervised Discovery of Named Entities with Fine-Grained Category on the Web |
title_sort |
unsupervised discovery of named entities with fine-grained category on the web |
url |
http://ndltd.ncl.edu.tw/handle/59000480779565169837 |
work_keys_str_mv |
AT chenghanchiang unsuperviseddiscoveryofnamedentitieswithfinegrainedcategoryontheweb AT jiāngzhènghán unsuperviseddiscoveryofnamedentitieswithfinegrainedcategoryontheweb AT chenghanchiang fēidūdǎoshìxìfēnlèizhīwǎnglùjùmíngshítǐzìdòngfājuéxìtǒng AT jiāngzhènghán fēidūdǎoshìxìfēnlèizhīwǎnglùjùmíngshítǐzìdòngfājuéxìtǒng |
_version_ |
1718296381672652800 |