Burst Events Detection in Text Streams by Using Keyphrases

碩士 === 國立中正大學 === 資訊管理學系暨研究所 === 99 === Mining text streams for hot topics and events has attracted extensive attention in the world because of its broad applications. Since keyphrases have more expressive power than single term and keyphrases can be utilized to represent documents more semantically...

Full description

Bibliographic Details
Main Authors: Chen, ShengHsiang, 陳聖翔
Other Authors: Wu, RoungShiunn
Format: Others
Language:en_US
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/44508835581874558353
id ndltd-TW-099CCU00396034
record_format oai_dc
spelling ndltd-TW-099CCU003960342016-04-13T04:16:57Z http://ndltd.ncl.edu.tw/handle/44508835581874558353 Burst Events Detection in Text Streams by Using Keyphrases 使用關鍵片語及詞組從文字資料流找出突發事件 Chen, ShengHsiang 陳聖翔 碩士 國立中正大學 資訊管理學系暨研究所 99 Mining text streams for hot topics and events has attracted extensive attention in the world because of its broad applications. Since keyphrases have more expressive power than single term and keyphrases can be utilized to represent documents more semantically. In this research, we try to detecting burst events by using keyphrases. We give a formal definition to the above problem and present the frameworks with five steps to solve the problem: (1) use KP-Miner to extract keyphrases from text streams as features set; (2) cluster keyphrases with synonymy or hypernymy into groups; (3) calculate occurrence frequencies of the groups in sliding windows; (4) evaluate burst groups; (5) burst event as burst groups. We also find the problem about loosing potential burst groups in fixed time window. In order to alleviate this problem, the original time window and the shift time window are good ways to settle the problem. We evaluate the proposed framework on real Google news stream which is suitable for our research. Experimental results show that our framework can detect more descriptive burst events than external events. Wu, RoungShiunn 吳榮訓 2011 學位論文 ; thesis 51 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立中正大學 === 資訊管理學系暨研究所 === 99 === Mining text streams for hot topics and events has attracted extensive attention in the world because of its broad applications. Since keyphrases have more expressive power than single term and keyphrases can be utilized to represent documents more semantically. In this research, we try to detecting burst events by using keyphrases. We give a formal definition to the above problem and present the frameworks with five steps to solve the problem: (1) use KP-Miner to extract keyphrases from text streams as features set; (2) cluster keyphrases with synonymy or hypernymy into groups; (3) calculate occurrence frequencies of the groups in sliding windows; (4) evaluate burst groups; (5) burst event as burst groups. We also find the problem about loosing potential burst groups in fixed time window. In order to alleviate this problem, the original time window and the shift time window are good ways to settle the problem. We evaluate the proposed framework on real Google news stream which is suitable for our research. Experimental results show that our framework can detect more descriptive burst events than external events.
author2 Wu, RoungShiunn
author_facet Wu, RoungShiunn
Chen, ShengHsiang
陳聖翔
author Chen, ShengHsiang
陳聖翔
spellingShingle Chen, ShengHsiang
陳聖翔
Burst Events Detection in Text Streams by Using Keyphrases
author_sort Chen, ShengHsiang
title Burst Events Detection in Text Streams by Using Keyphrases
title_short Burst Events Detection in Text Streams by Using Keyphrases
title_full Burst Events Detection in Text Streams by Using Keyphrases
title_fullStr Burst Events Detection in Text Streams by Using Keyphrases
title_full_unstemmed Burst Events Detection in Text Streams by Using Keyphrases
title_sort burst events detection in text streams by using keyphrases
publishDate 2011
url http://ndltd.ncl.edu.tw/handle/44508835581874558353
work_keys_str_mv AT chenshenghsiang bursteventsdetectionintextstreamsbyusingkeyphrases
AT chénshèngxiáng bursteventsdetectionintextstreamsbyusingkeyphrases
AT chenshenghsiang shǐyòngguānjiànpiànyǔjícízǔcóngwénzìzīliàoliúzhǎochūtūfāshìjiàn
AT chénshèngxiáng shǐyòngguānjiànpiànyǔjícízǔcóngwénzìzīliàoliúzhǎochūtūfāshìjiàn
_version_ 1718222254453555200