On Ontology Learning from Learning Resources of HTML Pages

碩士 === 銘傳大學 === 資訊管理學系碩士班 === 92 ===   Due to the abundancy and versatileness of resources in the highly developed Internet, it is not easy to find the right resources in the Internet. One important reason is the keywords used in most search engines show inconsistency among different domains and con...

Full description

Bibliographic Details
Main Authors: Wan-Shu Chao, 趙婉舒
Other Authors: 作者未提供
Format: Others
Language:zh-TW
Published: 2004
Online Access:http://ndltd.ncl.edu.tw/handle/wgp7ah
id ndltd-TW-092MCU00396018
record_format oai_dc
spelling ndltd-TW-092MCU003960182018-04-27T04:28:41Z http://ndltd.ncl.edu.tw/handle/wgp7ah On Ontology Learning from Learning Resources of HTML Pages 從HTML網頁萃取學習資源Ontology之研究 Wan-Shu Chao 趙婉舒 碩士 銘傳大學 資訊管理學系碩士班 92   Due to the abundancy and versatileness of resources in the highly developed Internet, it is not easy to find the right resources in the Internet. One important reason is the keywords used in most search engines show inconsistency among different domains and contexts. Fortunately, Ontology could eliminate this negative aspect by providing more meaningful semantics. However, building Ontology is a time-consuming and expensive task. Therefore, this thesis proposes a semi-automatic approach to building Ontology by designing a learning mechanism from resources of HTML pages. We use CKIP system to tag the Chinese part-of-speech and improve the TFIDF method by considering the weights of different HTML tag in retrieving important domain terms. Furthermore, we propose a two-level term-finding algorithm that discovers important terms (concepts and relations) not only from the single domain but also form the across-domain. A set of Chinese heuristic grammar rules are developed to extract the “is-a” relation between concepts to establish the concept hierarchy and other concept relations. A relation clustering method is also proposed to establish the relation hierarchy. Finally, the preliminary experiment showed that the built Ontology is satisfactory according to the evaluation of three human experts. 作者未提供 王豐緒 2004 學位論文 ; thesis 73 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 銘傳大學 === 資訊管理學系碩士班 === 92 ===   Due to the abundancy and versatileness of resources in the highly developed Internet, it is not easy to find the right resources in the Internet. One important reason is the keywords used in most search engines show inconsistency among different domains and contexts. Fortunately, Ontology could eliminate this negative aspect by providing more meaningful semantics. However, building Ontology is a time-consuming and expensive task. Therefore, this thesis proposes a semi-automatic approach to building Ontology by designing a learning mechanism from resources of HTML pages. We use CKIP system to tag the Chinese part-of-speech and improve the TFIDF method by considering the weights of different HTML tag in retrieving important domain terms. Furthermore, we propose a two-level term-finding algorithm that discovers important terms (concepts and relations) not only from the single domain but also form the across-domain. A set of Chinese heuristic grammar rules are developed to extract the “is-a” relation between concepts to establish the concept hierarchy and other concept relations. A relation clustering method is also proposed to establish the relation hierarchy. Finally, the preliminary experiment showed that the built Ontology is satisfactory according to the evaluation of three human experts.
author2 作者未提供
author_facet 作者未提供
Wan-Shu Chao
趙婉舒
author Wan-Shu Chao
趙婉舒
spellingShingle Wan-Shu Chao
趙婉舒
On Ontology Learning from Learning Resources of HTML Pages
author_sort Wan-Shu Chao
title On Ontology Learning from Learning Resources of HTML Pages
title_short On Ontology Learning from Learning Resources of HTML Pages
title_full On Ontology Learning from Learning Resources of HTML Pages
title_fullStr On Ontology Learning from Learning Resources of HTML Pages
title_full_unstemmed On Ontology Learning from Learning Resources of HTML Pages
title_sort on ontology learning from learning resources of html pages
publishDate 2004
url http://ndltd.ncl.edu.tw/handle/wgp7ah
work_keys_str_mv AT wanshuchao onontologylearningfromlearningresourcesofhtmlpages
AT zhàowǎnshū onontologylearningfromlearningresourcesofhtmlpages
AT wanshuchao cónghtmlwǎngyècuìqǔxuéxízīyuánontologyzhīyánjiū
AT zhàowǎnshū cónghtmlwǎngyècuìqǔxuéxízīyuánontologyzhīyánjiū
_version_ 1718632821662154752