On Ontology Learning from Learning Resources of HTML Pages
碩士 === 銘傳大學 === 資訊管理學系碩士班 === 92 === Due to the abundancy and versatileness of resources in the highly developed Internet, it is not easy to find the right resources in the Internet. One important reason is the keywords used in most search engines show inconsistency among different domains and con...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2004
|
Online Access: | http://ndltd.ncl.edu.tw/handle/wgp7ah |
id |
ndltd-TW-092MCU00396018 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-092MCU003960182018-04-27T04:28:41Z http://ndltd.ncl.edu.tw/handle/wgp7ah On Ontology Learning from Learning Resources of HTML Pages 從HTML網頁萃取學習資源Ontology之研究 Wan-Shu Chao 趙婉舒 碩士 銘傳大學 資訊管理學系碩士班 92 Due to the abundancy and versatileness of resources in the highly developed Internet, it is not easy to find the right resources in the Internet. One important reason is the keywords used in most search engines show inconsistency among different domains and contexts. Fortunately, Ontology could eliminate this negative aspect by providing more meaningful semantics. However, building Ontology is a time-consuming and expensive task. Therefore, this thesis proposes a semi-automatic approach to building Ontology by designing a learning mechanism from resources of HTML pages. We use CKIP system to tag the Chinese part-of-speech and improve the TFIDF method by considering the weights of different HTML tag in retrieving important domain terms. Furthermore, we propose a two-level term-finding algorithm that discovers important terms (concepts and relations) not only from the single domain but also form the across-domain. A set of Chinese heuristic grammar rules are developed to extract the “is-a” relation between concepts to establish the concept hierarchy and other concept relations. A relation clustering method is also proposed to establish the relation hierarchy. Finally, the preliminary experiment showed that the built Ontology is satisfactory according to the evaluation of three human experts. 作者未提供 王豐緒 2004 學位論文 ; thesis 73 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 銘傳大學 === 資訊管理學系碩士班 === 92 === Due to the abundancy and versatileness of resources in the highly developed Internet, it is not easy to find the right resources in the Internet. One important reason is the keywords used in most search engines show inconsistency among different domains and contexts. Fortunately, Ontology could eliminate this negative aspect by providing more meaningful semantics. However, building Ontology is a time-consuming and expensive task. Therefore, this thesis proposes a semi-automatic approach to building Ontology by designing a learning mechanism from resources of HTML pages. We use CKIP system to tag the Chinese part-of-speech and improve the TFIDF method by considering the weights of different HTML tag in retrieving important domain terms. Furthermore, we propose a two-level term-finding algorithm that discovers important terms (concepts and relations) not only from the single domain but also form the across-domain. A set of Chinese heuristic grammar rules are developed to extract the “is-a” relation between concepts to establish the concept hierarchy and other concept relations. A relation clustering method is also proposed to establish the relation hierarchy. Finally, the preliminary experiment showed that the built Ontology is satisfactory according to the evaluation of three human experts.
|
author2 |
作者未提供 |
author_facet |
作者未提供 Wan-Shu Chao 趙婉舒 |
author |
Wan-Shu Chao 趙婉舒 |
spellingShingle |
Wan-Shu Chao 趙婉舒 On Ontology Learning from Learning Resources of HTML Pages |
author_sort |
Wan-Shu Chao |
title |
On Ontology Learning from Learning Resources of HTML Pages |
title_short |
On Ontology Learning from Learning Resources of HTML Pages |
title_full |
On Ontology Learning from Learning Resources of HTML Pages |
title_fullStr |
On Ontology Learning from Learning Resources of HTML Pages |
title_full_unstemmed |
On Ontology Learning from Learning Resources of HTML Pages |
title_sort |
on ontology learning from learning resources of html pages |
publishDate |
2004 |
url |
http://ndltd.ncl.edu.tw/handle/wgp7ah |
work_keys_str_mv |
AT wanshuchao onontologylearningfromlearningresourcesofhtmlpages AT zhàowǎnshū onontologylearningfromlearningresourcesofhtmlpages AT wanshuchao cónghtmlwǎngyècuìqǔxuéxízīyuánontologyzhīyánjiū AT zhàowǎnshū cónghtmlwǎngyècuìqǔxuéxízīyuánontologyzhīyánjiū |
_version_ |
1718632821662154752 |