Measuring Semantic Relatedness by Wikipedia Revision Information
碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 98 === Nowadays, Wikipedia is an accurate, fashion, huge and wiki-based encyclopedia on the WWW. Simultaneously, Wikipedia is invaluable resource for research work because Wikipedia have many useful properties to enrich. Those properties contain high dense links, liv...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2010
|
Online Access: | http://ndltd.ncl.edu.tw/handle/25508215374684132603 |
id |
ndltd-TW-098NCKU5392105 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-098NCKU53921052016-04-22T04:22:57Z http://ndltd.ncl.edu.tw/handle/25508215374684132603 Measuring Semantic Relatedness by Wikipedia Revision Information 一個基於維基百科歷史修訂資訊之字詞語意關聯度評量方法 Wen-TengYang 楊文籐 碩士 國立成功大學 資訊工程學系碩博士班 98 Nowadays, Wikipedia is an accurate, fashion, huge and wiki-based encyclopedia on the WWW. Simultaneously, Wikipedia is invaluable resource for research work because Wikipedia have many useful properties to enrich. Those properties contain high dense links, live update, URL identification for concepts and complete revision history, etc. In this paper, we deal with the articles which the words represent in Wikipedia. Moreover, each Wikipedia article represents an individual concept and simultaneously contains other concepts which are hyperlinks of other articles in its content. Namely, the semantic relatedness between two articles is also the semantic relatedness between two words. Therefore, we propose an Editor-Contribution-based Rank (ECR) algorithm for ranking the concepts in the content of all revisions of article and take the ranked concepts as a vector to represent the article. ECR ranks those concepts depending on the relationship between concepts and the editors. We classify four types of relationship which behavior of addition and deletion maps to appropriate and inappropriate concepts. The experiment shows our method is better with the development of Wikipedia and gets 4.4% improvement over previous methods which calculate the relatedness between two articles. Hung-Yu Kao 高宏宇 2010 學位論文 ; thesis 54 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 98 === Nowadays, Wikipedia is an accurate, fashion, huge and wiki-based encyclopedia on the WWW. Simultaneously, Wikipedia is invaluable resource for research work because Wikipedia have many useful properties to enrich. Those properties contain high dense links, live update, URL identification for concepts and complete revision history, etc. In this paper, we deal with the articles which the words represent in Wikipedia. Moreover, each Wikipedia article represents an individual concept and simultaneously contains other concepts which are hyperlinks of other articles in its content. Namely, the semantic relatedness between two articles is also the semantic relatedness between two words. Therefore, we propose an Editor-Contribution-based Rank (ECR) algorithm for ranking the concepts in the content of all revisions of article and take the ranked concepts as a vector to represent the article. ECR ranks those concepts depending on the relationship between concepts and the editors. We classify four types of relationship which behavior of addition and deletion maps to appropriate and inappropriate concepts. The experiment shows our method is better with the development of Wikipedia and gets 4.4% improvement over previous methods which calculate the relatedness between two articles.
|
author2 |
Hung-Yu Kao |
author_facet |
Hung-Yu Kao Wen-TengYang 楊文籐 |
author |
Wen-TengYang 楊文籐 |
spellingShingle |
Wen-TengYang 楊文籐 Measuring Semantic Relatedness by Wikipedia Revision Information |
author_sort |
Wen-TengYang |
title |
Measuring Semantic Relatedness by Wikipedia Revision Information |
title_short |
Measuring Semantic Relatedness by Wikipedia Revision Information |
title_full |
Measuring Semantic Relatedness by Wikipedia Revision Information |
title_fullStr |
Measuring Semantic Relatedness by Wikipedia Revision Information |
title_full_unstemmed |
Measuring Semantic Relatedness by Wikipedia Revision Information |
title_sort |
measuring semantic relatedness by wikipedia revision information |
publishDate |
2010 |
url |
http://ndltd.ncl.edu.tw/handle/25508215374684132603 |
work_keys_str_mv |
AT wentengyang measuringsemanticrelatednessbywikipediarevisioninformation AT yángwénténg measuringsemanticrelatednessbywikipediarevisioninformation AT wentengyang yīgèjīyúwéijībǎikēlìshǐxiūdìngzīxùnzhīzìcíyǔyìguānliándùpíngliàngfāngfǎ AT yángwénténg yīgèjīyúwéijībǎikēlìshǐxiūdìngzīxùnzhīzìcíyǔyìguānliándùpíngliàngfāngfǎ |
_version_ |
1718229803078778880 |