On Ontology Learning from Learning Resources of HTML Pages

碩士 === 銘傳大學 === 資訊管理學系碩士班 === 92 === 　　Due to the abundancy and versatileness of resources in the highly developed Internet, it is not easy to find the right resources in the Internet. One important reason is the keywords used in most search engines show inconsistency among different domains and con...

Full description

Bibliographic Details
Main Authors:	Wan-Shu Chao, 趙婉舒
Other Authors:	作者未提供
Format:	Others
Language:	zh-TW
Published:	2004
Online Access:	http://ndltd.ncl.edu.tw/handle/wgp7ah

id	ndltd-TW-092MCU00396018
record_format	oai_dc
spelling	ndltd-TW-092MCU003960182018-04-27T04:28:41Z http://ndltd.ncl.edu.tw/handle/wgp7ah On Ontology Learning from Learning Resources of HTML Pages 從HTML網頁萃取學習資源Ontology之研究 Wan-Shu Chao 趙婉舒碩士銘傳大學資訊管理學系碩士班 92 　　Due to the abundancy and versatileness of resources in the highly developed Internet, it is not easy to find the right resources in the Internet. One important reason is the keywords used in most search engines show inconsistency among different domains and contexts. Fortunately, Ontology could eliminate this negative aspect by providing more meaningful semantics. However, building Ontology is a time-consuming and expensive task. Therefore, this thesis proposes a semi-automatic approach to building Ontology by designing a learning mechanism from resources of HTML pages. We use CKIP system to tag the Chinese part-of-speech and improve the TFIDF method by considering the weights of different HTML tag in retrieving important domain terms. Furthermore, we propose a two-level term-finding algorithm that discovers important terms (concepts and relations) not only from the single domain but also form the across-domain. A set of Chinese heuristic grammar rules are developed to extract the “is-a” relation between concepts to establish the concept hierarchy and other concept relations. A relation clustering method is also proposed to establish the relation hierarchy. Finally, the preliminary experiment showed that the built Ontology is satisfactory according to the evaluation of three human experts. 作者未提供王豐緒 2004 學位論文 ; thesis 73 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 銘傳大學 === 資訊管理學系碩士班 === 92 === 　　Due to the abundancy and versatileness of resources in the highly developed Internet, it is not easy to find the right resources in the Internet. One important reason is the keywords used in most search engines show inconsistency among different domains and contexts. Fortunately, Ontology could eliminate this negative aspect by providing more meaningful semantics. However, building Ontology is a time-consuming and expensive task. Therefore, this thesis proposes a semi-automatic approach to building Ontology by designing a learning mechanism from resources of HTML pages. We use CKIP system to tag the Chinese part-of-speech and improve the TFIDF method by considering the weights of different HTML tag in retrieving important domain terms. Furthermore, we propose a two-level term-finding algorithm that discovers important terms (concepts and relations) not only from the single domain but also form the across-domain. A set of Chinese heuristic grammar rules are developed to extract the “is-a” relation between concepts to establish the concept hierarchy and other concept relations. A relation clustering method is also proposed to establish the relation hierarchy. Finally, the preliminary experiment showed that the built Ontology is satisfactory according to the evaluation of three human experts.
author2	作者未提供
author_facet	作者未提供 Wan-Shu Chao 趙婉舒
author	Wan-Shu Chao 趙婉舒
spellingShingle	Wan-Shu Chao 趙婉舒 On Ontology Learning from Learning Resources of HTML Pages
author_sort	Wan-Shu Chao
title	On Ontology Learning from Learning Resources of HTML Pages
title_short	On Ontology Learning from Learning Resources of HTML Pages
title_full	On Ontology Learning from Learning Resources of HTML Pages
title_fullStr	On Ontology Learning from Learning Resources of HTML Pages
title_full_unstemmed	On Ontology Learning from Learning Resources of HTML Pages
title_sort	on ontology learning from learning resources of html pages
publishDate	2004
url	http://ndltd.ncl.edu.tw/handle/wgp7ah
work_keys_str_mv	AT wanshuchao onontologylearningfromlearningresourcesofhtmlpages AT zhàowǎnshū onontologylearningfromlearningresourcesofhtmlpages AT wanshuchao cónghtmlwǎngyècuìqǔxuéxízīyuánontologyzhīyánjiū AT zhàowǎnshū cónghtmlwǎngyècuìqǔxuéxízīyuánontologyzhīyánjiū
_version_	1718632821662154752

On Ontology Learning from Learning Resources of HTML Pages

Similar Items