A Microblog Content Fusion System Based on User Participation Degree and Enhanced NIL Model
碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 100 === Microblog users publish their opinions by using condensed text with some non-textual contents because of the limitation of content length. Moreover, user-generated content often includes chaotic messages, useless information or unrelated information to th...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2012
|
Online Access: | http://ndltd.ncl.edu.tw/handle/87957026524946796743 |
id |
ndltd-TW-100NCKU5392091 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-100NCKU53920912015-10-13T21:38:04Z http://ndltd.ncl.edu.tw/handle/87957026524946796743 A Microblog Content Fusion System Based on User Participation Degree and Enhanced NIL Model 基於討論參與度與非正規網路語言增強模型之微網誌內容融合系統 Wo-ChenLiu 柳沃辰 碩士 國立成功大學 資訊工程學系碩博士班 100 Microblog users publish their opinions by using condensed text with some non-textual contents because of the limitation of content length. Moreover, user-generated content often includes chaotic messages, useless information or unrelated information to the theme of original post. Microblog posts and responses also contain Network Informal Language (NIL) such as abbreviations, misspelled and phonetic words and. In this paper, a novel approach of Maximum Discussion Group Detection (MDGD) from each post and its responses is proposed. Briefly, the MDGs with higher user participation degree are selected to extract the significant terms from unconventional expressions of microblog posts by modified NIL and Lexical Chain models. To enrich the fusion results, we refer the related contents from multiple microblog platforms according to the previous extracted terms. In the experiments, we use test data set collected from the microblog platforms on Plurk and Facebook which includes the terms of “林書豪”, “馬英九” and “蔡英文”. Then, the NIL dictionary is constructed for ENIL model. Comparing with CKIP, the segmentation results indicate that the precision of ENIL improved 7.4% to 17.5% significantly. Finally, NDCG metrics is used to evaluate the user satisfactions of fusion results. The results of user satisfactions show that our system is capable to provide qualified fused results. Yau-Hwang Kuo 郭耀煌 2012 學位論文 ; thesis 46 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 100 === Microblog users publish their opinions by using condensed text with some non-textual contents because of the limitation of content length. Moreover, user-generated content often includes chaotic messages, useless information or unrelated information to the theme of original post. Microblog posts and responses also contain Network Informal Language (NIL) such as abbreviations, misspelled and phonetic words and. In this paper, a novel approach of Maximum Discussion Group Detection (MDGD) from each post and its responses is proposed. Briefly, the MDGs with higher user participation degree are selected to extract the significant terms from unconventional expressions of microblog posts by modified NIL and Lexical Chain models. To enrich the fusion results, we refer the related contents from multiple microblog platforms according to the previous extracted terms.
In the experiments, we use test data set collected from the microblog platforms on Plurk and Facebook which includes the terms of “林書豪”, “馬英九” and “蔡英文”. Then, the NIL dictionary is constructed for ENIL model. Comparing with CKIP, the segmentation results indicate that the precision of ENIL improved 7.4% to 17.5% significantly. Finally, NDCG metrics is used to evaluate the user satisfactions of fusion results. The results of user satisfactions show that our system is capable to provide qualified fused results.
|
author2 |
Yau-Hwang Kuo |
author_facet |
Yau-Hwang Kuo Wo-ChenLiu 柳沃辰 |
author |
Wo-ChenLiu 柳沃辰 |
spellingShingle |
Wo-ChenLiu 柳沃辰 A Microblog Content Fusion System Based on User Participation Degree and Enhanced NIL Model |
author_sort |
Wo-ChenLiu |
title |
A Microblog Content Fusion System Based on User Participation Degree and Enhanced NIL Model |
title_short |
A Microblog Content Fusion System Based on User Participation Degree and Enhanced NIL Model |
title_full |
A Microblog Content Fusion System Based on User Participation Degree and Enhanced NIL Model |
title_fullStr |
A Microblog Content Fusion System Based on User Participation Degree and Enhanced NIL Model |
title_full_unstemmed |
A Microblog Content Fusion System Based on User Participation Degree and Enhanced NIL Model |
title_sort |
microblog content fusion system based on user participation degree and enhanced nil model |
publishDate |
2012 |
url |
http://ndltd.ncl.edu.tw/handle/87957026524946796743 |
work_keys_str_mv |
AT wochenliu amicroblogcontentfusionsystembasedonuserparticipationdegreeandenhancednilmodel AT liǔwòchén amicroblogcontentfusionsystembasedonuserparticipationdegreeandenhancednilmodel AT wochenliu jīyútǎolùncānyǔdùyǔfēizhèngguīwǎnglùyǔyánzēngqiángmóxíngzhīwēiwǎngzhìnèiróngrónghéxìtǒng AT liǔwòchén jīyútǎolùncānyǔdùyǔfēizhèngguīwǎnglùyǔyánzēngqiángmóxíngzhīwēiwǎngzhìnèiróngrónghéxìtǒng AT wochenliu microblogcontentfusionsystembasedonuserparticipationdegreeandenhancednilmodel AT liǔwòchén microblogcontentfusionsystembasedonuserparticipationdegreeandenhancednilmodel |
_version_ |
1718067621390188544 |