Design and Evaluation of Algorithms for Topic Hierarchy Integration
碩士 === 國立中正大學 === 資訊工程研究所 === 90 === In this thesis, we study the problem of integrating documents from different sources into a comprehensive topic hierarchy. Our objective is to develop efficient techniques that improve the accuracy of traditional categorization methods by in...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2002
|
Online Access: | http://ndltd.ncl.edu.tw/handle/02471981001283162993 |
id |
ndltd-TW-090CCU00392031 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-090CCU003920312015-10-13T17:34:57Z http://ndltd.ncl.edu.tw/handle/02471981001283162993 Design and Evaluation of Algorithms for Topic Hierarchy Integration 整合階層式分類目錄的演算法設計及評估 Chi-Feng Chang 張啟峰 碩士 國立中正大學 資訊工程研究所 90 In this thesis, we study the problem of integrating documents from different sources into a comprehensive topic hierarchy. Our objective is to develop efficient techniques that improve the accuracy of traditional categorization methods by incorporating categorization information provided by data sources into categorization process. Notice that in the World-Wide Web, categorization information is often available from information sources. For example, news from newspapers, books from publishers, items from electronic commercial sites, or even web pages archived by web information portals are categorized. Observe that many of the topic hierarchies adopted by current information sources are highly related. We believe that categorization information can be used to improve classification accuracy. We present several techniques that explore relations between topic hierarchies and incorporate categorization information from source hierarchies into traditional classification methods such as Baysian methods and support vector machines. Experiment on collections from Openfind and Yam, and Google and Yahoo, well-known popular web sites in Taiwan and USA, respectively, shows that incorporating categorization information from source hierarchies can significantly improve the classification accuracy. Jyh-Jong Tsay 蔡志忠 2002 學位論文 ; thesis 78 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中正大學 === 資訊工程研究所 === 90 === In this thesis, we study the problem of integrating documents from different sources into a comprehensive topic hierarchy. Our
objective is to develop efficient techniques that improve the
accuracy of traditional categorization methods by incorporating
categorization information provided by data sources into
categorization process. Notice that in the World-Wide Web,
categorization information is often available from information
sources. For example, news from newspapers, books from publishers, items from electronic commercial sites, or even web pages archived by web information portals are categorized. Observe that many of the topic hierarchies adopted by current information sources are highly related. We believe that categorization information can be used to improve classification accuracy. We present several techniques that explore relations between topic hierarchies and incorporate categorization information from source hierarchies into traditional classification methods such as Baysian methods
and support vector machines. Experiment on collections from
Openfind and Yam, and Google and Yahoo, well-known popular web
sites in Taiwan and USA, respectively, shows that incorporating
categorization information from source hierarchies can
significantly improve the classification accuracy.
|
author2 |
Jyh-Jong Tsay |
author_facet |
Jyh-Jong Tsay Chi-Feng Chang 張啟峰 |
author |
Chi-Feng Chang 張啟峰 |
spellingShingle |
Chi-Feng Chang 張啟峰 Design and Evaluation of Algorithms for Topic Hierarchy Integration |
author_sort |
Chi-Feng Chang |
title |
Design and Evaluation of Algorithms for Topic Hierarchy Integration |
title_short |
Design and Evaluation of Algorithms for Topic Hierarchy Integration |
title_full |
Design and Evaluation of Algorithms for Topic Hierarchy Integration |
title_fullStr |
Design and Evaluation of Algorithms for Topic Hierarchy Integration |
title_full_unstemmed |
Design and Evaluation of Algorithms for Topic Hierarchy Integration |
title_sort |
design and evaluation of algorithms for topic hierarchy integration |
publishDate |
2002 |
url |
http://ndltd.ncl.edu.tw/handle/02471981001283162993 |
work_keys_str_mv |
AT chifengchang designandevaluationofalgorithmsfortopichierarchyintegration AT zhāngqǐfēng designandevaluationofalgorithmsfortopichierarchyintegration AT chifengchang zhěnghéjiēcéngshìfēnlèimùlùdeyǎnsuànfǎshèjìjípínggū AT zhāngqǐfēng zhěnghéjiēcéngshìfēnlèimùlùdeyǎnsuànfǎshèjìjípínggū |
_version_ |
1717782077312598016 |