Design and Evaluation of Algorithms for Topic Hierarchy Integration

碩士 === 國立中正大學 === 資訊工程研究所 === 90 === In this thesis, we study the problem of integrating documents from different sources into a comprehensive topic hierarchy. Our objective is to develop efficient techniques that improve the accuracy of traditional categorization methods by in...

Full description

Bibliographic Details
Main Authors: Chi-Feng Chang, 張啟峰
Other Authors: Jyh-Jong Tsay
Format: Others
Language:zh-TW
Published: 2002
Online Access:http://ndltd.ncl.edu.tw/handle/02471981001283162993
id ndltd-TW-090CCU00392031
record_format oai_dc
spelling ndltd-TW-090CCU003920312015-10-13T17:34:57Z http://ndltd.ncl.edu.tw/handle/02471981001283162993 Design and Evaluation of Algorithms for Topic Hierarchy Integration 整合階層式分類目錄的演算法設計及評估 Chi-Feng Chang 張啟峰 碩士 國立中正大學 資訊工程研究所 90 In this thesis, we study the problem of integrating documents from different sources into a comprehensive topic hierarchy. Our objective is to develop efficient techniques that improve the accuracy of traditional categorization methods by incorporating categorization information provided by data sources into categorization process. Notice that in the World-Wide Web, categorization information is often available from information sources. For example, news from newspapers, books from publishers, items from electronic commercial sites, or even web pages archived by web information portals are categorized. Observe that many of the topic hierarchies adopted by current information sources are highly related. We believe that categorization information can be used to improve classification accuracy. We present several techniques that explore relations between topic hierarchies and incorporate categorization information from source hierarchies into traditional classification methods such as Baysian methods and support vector machines. Experiment on collections from Openfind and Yam, and Google and Yahoo, well-known popular web sites in Taiwan and USA, respectively, shows that incorporating categorization information from source hierarchies can significantly improve the classification accuracy. Jyh-Jong Tsay 蔡志忠 2002 學位論文 ; thesis 78 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立中正大學 === 資訊工程研究所 === 90 === In this thesis, we study the problem of integrating documents from different sources into a comprehensive topic hierarchy. Our objective is to develop efficient techniques that improve the accuracy of traditional categorization methods by incorporating categorization information provided by data sources into categorization process. Notice that in the World-Wide Web, categorization information is often available from information sources. For example, news from newspapers, books from publishers, items from electronic commercial sites, or even web pages archived by web information portals are categorized. Observe that many of the topic hierarchies adopted by current information sources are highly related. We believe that categorization information can be used to improve classification accuracy. We present several techniques that explore relations between topic hierarchies and incorporate categorization information from source hierarchies into traditional classification methods such as Baysian methods and support vector machines. Experiment on collections from Openfind and Yam, and Google and Yahoo, well-known popular web sites in Taiwan and USA, respectively, shows that incorporating categorization information from source hierarchies can significantly improve the classification accuracy.
author2 Jyh-Jong Tsay
author_facet Jyh-Jong Tsay
Chi-Feng Chang
張啟峰
author Chi-Feng Chang
張啟峰
spellingShingle Chi-Feng Chang
張啟峰
Design and Evaluation of Algorithms for Topic Hierarchy Integration
author_sort Chi-Feng Chang
title Design and Evaluation of Algorithms for Topic Hierarchy Integration
title_short Design and Evaluation of Algorithms for Topic Hierarchy Integration
title_full Design and Evaluation of Algorithms for Topic Hierarchy Integration
title_fullStr Design and Evaluation of Algorithms for Topic Hierarchy Integration
title_full_unstemmed Design and Evaluation of Algorithms for Topic Hierarchy Integration
title_sort design and evaluation of algorithms for topic hierarchy integration
publishDate 2002
url http://ndltd.ncl.edu.tw/handle/02471981001283162993
work_keys_str_mv AT chifengchang designandevaluationofalgorithmsfortopichierarchyintegration
AT zhāngqǐfēng designandevaluationofalgorithmsfortopichierarchyintegration
AT chifengchang zhěnghéjiēcéngshìfēnlèimùlùdeyǎnsuànfǎshèjìjípínggū
AT zhāngqǐfēng zhěnghéjiēcéngshìfēnlèimùlùdeyǎnsuànfǎshèjìjípínggū
_version_ 1717782077312598016