Theme Classification for Web Forum Content

碩士 === 元智大學 === 資訊管理學系 === 98 === Web forum has enormous topical discussion or questions. After automatic theme classification, users can easier acquire the information for each theme, and also make use of the valuable information. The expectation of research method, adaptive weighting classificatio...

Full description

Bibliographic Details
Main Authors: Jia-Luen Tsai, 蔡佳倫
Other Authors: Chao-Chang Chiou
Format: Others
Language:zh-TW
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/74029617879366880025
id ndltd-TW-098YZU05396093
record_format oai_dc
spelling ndltd-TW-098YZU053960932015-10-13T18:20:57Z http://ndltd.ncl.edu.tw/handle/74029617879366880025 Theme Classification for Web Forum Content 自動議題分類機制於網路論壇之應用 Jia-Luen Tsai 蔡佳倫 碩士 元智大學 資訊管理學系 98 Web forum has enormous topical discussion or questions. After automatic theme classification, users can easier acquire the information for each theme, and also make use of the valuable information. The expectation of research method, adaptive weighting classification, is flexible and simple computation to theme classification. First, our research use word segmentation and selecting specific part of speech in order to obtain theme-related words. Second, we applied MI to extract keywords, and realize these keywords belong to which theme. Meanwhile, grouping keywords as keyword set (KS).Third, we calculate the importance of each KS in a document. Forth, applying Genetic Algorithm (GA) to optimize weights, we calculate the weighted score in each theme. And we take the theme which has highest score as the theme of the document. In experiment, we classified 11 phone-related themes. And the classification result shows the F-Measure of each theme are about 30% to 45%. Chao-Chang Chiou 邱昭彰 2010 學位論文 ; thesis 44 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 元智大學 === 資訊管理學系 === 98 === Web forum has enormous topical discussion or questions. After automatic theme classification, users can easier acquire the information for each theme, and also make use of the valuable information. The expectation of research method, adaptive weighting classification, is flexible and simple computation to theme classification. First, our research use word segmentation and selecting specific part of speech in order to obtain theme-related words. Second, we applied MI to extract keywords, and realize these keywords belong to which theme. Meanwhile, grouping keywords as keyword set (KS).Third, we calculate the importance of each KS in a document. Forth, applying Genetic Algorithm (GA) to optimize weights, we calculate the weighted score in each theme. And we take the theme which has highest score as the theme of the document. In experiment, we classified 11 phone-related themes. And the classification result shows the F-Measure of each theme are about 30% to 45%.
author2 Chao-Chang Chiou
author_facet Chao-Chang Chiou
Jia-Luen Tsai
蔡佳倫
author Jia-Luen Tsai
蔡佳倫
spellingShingle Jia-Luen Tsai
蔡佳倫
Theme Classification for Web Forum Content
author_sort Jia-Luen Tsai
title Theme Classification for Web Forum Content
title_short Theme Classification for Web Forum Content
title_full Theme Classification for Web Forum Content
title_fullStr Theme Classification for Web Forum Content
title_full_unstemmed Theme Classification for Web Forum Content
title_sort theme classification for web forum content
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/74029617879366880025
work_keys_str_mv AT jialuentsai themeclassificationforwebforumcontent
AT càijiālún themeclassificationforwebforumcontent
AT jialuentsai zìdòngyìtífēnlèijīzhìyúwǎnglùlùntánzhīyīngyòng
AT càijiālún zìdòngyìtífēnlèijīzhìyúwǎnglùlùntánzhīyīngyòng
_version_ 1718030569064890368