Theme Classification for Web Forum Content
碩士 === 元智大學 === 資訊管理學系 === 98 === Web forum has enormous topical discussion or questions. After automatic theme classification, users can easier acquire the information for each theme, and also make use of the valuable information. The expectation of research method, adaptive weighting classificatio...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2010
|
Online Access: | http://ndltd.ncl.edu.tw/handle/74029617879366880025 |
id |
ndltd-TW-098YZU05396093 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-098YZU053960932015-10-13T18:20:57Z http://ndltd.ncl.edu.tw/handle/74029617879366880025 Theme Classification for Web Forum Content 自動議題分類機制於網路論壇之應用 Jia-Luen Tsai 蔡佳倫 碩士 元智大學 資訊管理學系 98 Web forum has enormous topical discussion or questions. After automatic theme classification, users can easier acquire the information for each theme, and also make use of the valuable information. The expectation of research method, adaptive weighting classification, is flexible and simple computation to theme classification. First, our research use word segmentation and selecting specific part of speech in order to obtain theme-related words. Second, we applied MI to extract keywords, and realize these keywords belong to which theme. Meanwhile, grouping keywords as keyword set (KS).Third, we calculate the importance of each KS in a document. Forth, applying Genetic Algorithm (GA) to optimize weights, we calculate the weighted score in each theme. And we take the theme which has highest score as the theme of the document. In experiment, we classified 11 phone-related themes. And the classification result shows the F-Measure of each theme are about 30% to 45%. Chao-Chang Chiou 邱昭彰 2010 學位論文 ; thesis 44 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 元智大學 === 資訊管理學系 === 98 === Web forum has enormous topical discussion or questions. After automatic theme classification, users can easier acquire the information for each theme, and also make use of the valuable information. The expectation of research method, adaptive weighting classification, is flexible and simple computation to theme classification.
First, our research use word segmentation and selecting specific part of speech in order to obtain theme-related words. Second, we applied MI to extract keywords, and realize these keywords belong to which theme. Meanwhile, grouping keywords as keyword set (KS).Third, we calculate the importance of each KS in a document. Forth, applying Genetic Algorithm (GA) to optimize weights, we calculate the weighted score in each theme. And we take the theme which has highest score as the theme of the document.
In experiment, we classified 11 phone-related themes. And the classification result shows the F-Measure of each theme are about 30% to 45%.
|
author2 |
Chao-Chang Chiou |
author_facet |
Chao-Chang Chiou Jia-Luen Tsai 蔡佳倫 |
author |
Jia-Luen Tsai 蔡佳倫 |
spellingShingle |
Jia-Luen Tsai 蔡佳倫 Theme Classification for Web Forum Content |
author_sort |
Jia-Luen Tsai |
title |
Theme Classification for Web Forum Content |
title_short |
Theme Classification for Web Forum Content |
title_full |
Theme Classification for Web Forum Content |
title_fullStr |
Theme Classification for Web Forum Content |
title_full_unstemmed |
Theme Classification for Web Forum Content |
title_sort |
theme classification for web forum content |
publishDate |
2010 |
url |
http://ndltd.ncl.edu.tw/handle/74029617879366880025 |
work_keys_str_mv |
AT jialuentsai themeclassificationforwebforumcontent AT càijiālún themeclassificationforwebforumcontent AT jialuentsai zìdòngyìtífēnlèijīzhìyúwǎnglùlùntánzhīyīngyòng AT càijiālún zìdòngyìtífēnlèijīzhìyúwǎnglùlùntánzhīyīngyòng |
_version_ |
1718030569064890368 |