Burst Events Detection in Text Streams by Using Keyphrases
碩士 === 國立中正大學 === 資訊管理學系暨研究所 === 99 === Mining text streams for hot topics and events has attracted extensive attention in the world because of its broad applications. Since keyphrases have more expressive power than single term and keyphrases can be utilized to represent documents more semantically...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2011
|
Online Access: | http://ndltd.ncl.edu.tw/handle/44508835581874558353 |
id |
ndltd-TW-099CCU00396034 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-099CCU003960342016-04-13T04:16:57Z http://ndltd.ncl.edu.tw/handle/44508835581874558353 Burst Events Detection in Text Streams by Using Keyphrases 使用關鍵片語及詞組從文字資料流找出突發事件 Chen, ShengHsiang 陳聖翔 碩士 國立中正大學 資訊管理學系暨研究所 99 Mining text streams for hot topics and events has attracted extensive attention in the world because of its broad applications. Since keyphrases have more expressive power than single term and keyphrases can be utilized to represent documents more semantically. In this research, we try to detecting burst events by using keyphrases. We give a formal definition to the above problem and present the frameworks with five steps to solve the problem: (1) use KP-Miner to extract keyphrases from text streams as features set; (2) cluster keyphrases with synonymy or hypernymy into groups; (3) calculate occurrence frequencies of the groups in sliding windows; (4) evaluate burst groups; (5) burst event as burst groups. We also find the problem about loosing potential burst groups in fixed time window. In order to alleviate this problem, the original time window and the shift time window are good ways to settle the problem. We evaluate the proposed framework on real Google news stream which is suitable for our research. Experimental results show that our framework can detect more descriptive burst events than external events. Wu, RoungShiunn 吳榮訓 2011 學位論文 ; thesis 51 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中正大學 === 資訊管理學系暨研究所 === 99 === Mining text streams for hot topics and events has attracted extensive attention in the world because of its broad applications. Since keyphrases have more expressive power than single term and keyphrases can be utilized to represent documents more semantically. In this research, we try to detecting burst events by using keyphrases.
We give a formal definition to the above problem and present the frameworks with five steps to solve the problem: (1) use KP-Miner to extract keyphrases from text streams as features set; (2) cluster keyphrases with synonymy or hypernymy into groups; (3) calculate occurrence frequencies of the groups in sliding windows; (4) evaluate burst groups; (5) burst event as burst groups. We also find the problem about loosing potential burst groups in fixed time window. In order to alleviate this problem, the original time window and the shift time window are good ways to settle the problem. We evaluate the proposed framework on real Google news stream which is suitable for our research. Experimental results show that our framework can detect more descriptive burst events than external events.
|
author2 |
Wu, RoungShiunn |
author_facet |
Wu, RoungShiunn Chen, ShengHsiang 陳聖翔 |
author |
Chen, ShengHsiang 陳聖翔 |
spellingShingle |
Chen, ShengHsiang 陳聖翔 Burst Events Detection in Text Streams by Using Keyphrases |
author_sort |
Chen, ShengHsiang |
title |
Burst Events Detection in Text Streams by Using Keyphrases |
title_short |
Burst Events Detection in Text Streams by Using Keyphrases |
title_full |
Burst Events Detection in Text Streams by Using Keyphrases |
title_fullStr |
Burst Events Detection in Text Streams by Using Keyphrases |
title_full_unstemmed |
Burst Events Detection in Text Streams by Using Keyphrases |
title_sort |
burst events detection in text streams by using keyphrases |
publishDate |
2011 |
url |
http://ndltd.ncl.edu.tw/handle/44508835581874558353 |
work_keys_str_mv |
AT chenshenghsiang bursteventsdetectionintextstreamsbyusingkeyphrases AT chénshèngxiáng bursteventsdetectionintextstreamsbyusingkeyphrases AT chenshenghsiang shǐyòngguānjiànpiànyǔjícízǔcóngwénzìzīliàoliúzhǎochūtūfāshìjiàn AT chénshèngxiáng shǐyòngguānjiànpiànyǔjícízǔcóngwénzìzīliàoliúzhǎochūtūfāshìjiàn |
_version_ |
1718222254453555200 |