Incremental Significant URL Mining and Comments Summarization in Real-Time Social Media

博士 === 國立臺灣大學 === 電機工程學研究所 === 105 === Social media platforms have emerged as a powerful and real-time means of communication recently. People are using social media to share and exchange information about any events, ranging from breaking news stories to natural disasters and information about loca...

Full description

Bibliographic Details
Main Authors: Cheng-Ying Liu, 劉承穎
Other Authors: Ming-Syan Chen
Format: Others
Language:en_US
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/w3942j
id ndltd-TW-105NTU05442069
record_format oai_dc
spelling ndltd-TW-105NTU054420692019-05-15T23:39:39Z http://ndltd.ncl.edu.tw/handle/w3942j Incremental Significant URL Mining and Comments Summarization in Real-Time Social Media 於即時社群媒體之增益式超連結探勘與訊息串流彙總 Cheng-Ying Liu 劉承穎 博士 國立臺灣大學 電機工程學研究所 105 Social media platforms have emerged as a powerful and real-time means of communication recently. People are using social media to share and exchange information about any events, ranging from breaking news stories to natural disasters and information about local festivals. With the help of rapid development of mobile technologies, messages posted in social media can typically reflect these events as they happen. However, since the dramatic growth of the social media data, it becomes infeasible for users to read all posts or comments. Therefore, mining and summarizing rich user generated content in social media can present great opportunities for developing many potential applications (e.g., breaking news discovery, traffic monitoring, and natural disaster monitoring.) On the other hand, the celebrities, corporations, and organizations also set up social pages to interact with their fans and the public. Although it is important for them to understand how their fans and customers reacting to certain topics and content, the volume and the rapidly increment nature of social media make it time-consuming to get the overview of a comment stream. Therefore, in this dissertation, we first propose a significant URL mining approach (named SURLMINE) to rank the URL on social media based on various features. Note that URL is a global language without language dependency. It is also worthy to know that only 35\% of tweets on Twitter are posted in English. In other words, mining social media content through URL is able to involve more data from different languages. Most of all, it is efficient and there is no lost in translation. On the other hand, to summarize the comment stream, we propose a real-time incremental short text summarization on comment streams (abbreviated as IncreSTS) to provide an at-a-glance presentation that users can easily and rapidly understand the main points of similar comments. Our experiments conducted on real datasets show that the SURLMINE can reach up to 92\% of precision based on YouTube datasets and the increSTS possesses the advantages of high efficiency, high scalability, and better handling outliers on the target problem. Ming-Syan Chen 陳銘憲 2017 學位論文 ; thesis 69 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 博士 === 國立臺灣大學 === 電機工程學研究所 === 105 === Social media platforms have emerged as a powerful and real-time means of communication recently. People are using social media to share and exchange information about any events, ranging from breaking news stories to natural disasters and information about local festivals. With the help of rapid development of mobile technologies, messages posted in social media can typically reflect these events as they happen. However, since the dramatic growth of the social media data, it becomes infeasible for users to read all posts or comments. Therefore, mining and summarizing rich user generated content in social media can present great opportunities for developing many potential applications (e.g., breaking news discovery, traffic monitoring, and natural disaster monitoring.) On the other hand, the celebrities, corporations, and organizations also set up social pages to interact with their fans and the public. Although it is important for them to understand how their fans and customers reacting to certain topics and content, the volume and the rapidly increment nature of social media make it time-consuming to get the overview of a comment stream. Therefore, in this dissertation, we first propose a significant URL mining approach (named SURLMINE) to rank the URL on social media based on various features. Note that URL is a global language without language dependency. It is also worthy to know that only 35\% of tweets on Twitter are posted in English. In other words, mining social media content through URL is able to involve more data from different languages. Most of all, it is efficient and there is no lost in translation. On the other hand, to summarize the comment stream, we propose a real-time incremental short text summarization on comment streams (abbreviated as IncreSTS) to provide an at-a-glance presentation that users can easily and rapidly understand the main points of similar comments. Our experiments conducted on real datasets show that the SURLMINE can reach up to 92\% of precision based on YouTube datasets and the increSTS possesses the advantages of high efficiency, high scalability, and better handling outliers on the target problem.
author2 Ming-Syan Chen
author_facet Ming-Syan Chen
Cheng-Ying Liu
劉承穎
author Cheng-Ying Liu
劉承穎
spellingShingle Cheng-Ying Liu
劉承穎
Incremental Significant URL Mining and Comments Summarization in Real-Time Social Media
author_sort Cheng-Ying Liu
title Incremental Significant URL Mining and Comments Summarization in Real-Time Social Media
title_short Incremental Significant URL Mining and Comments Summarization in Real-Time Social Media
title_full Incremental Significant URL Mining and Comments Summarization in Real-Time Social Media
title_fullStr Incremental Significant URL Mining and Comments Summarization in Real-Time Social Media
title_full_unstemmed Incremental Significant URL Mining and Comments Summarization in Real-Time Social Media
title_sort incremental significant url mining and comments summarization in real-time social media
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/w3942j
work_keys_str_mv AT chengyingliu incrementalsignificanturlminingandcommentssummarizationinrealtimesocialmedia
AT liúchéngyǐng incrementalsignificanturlminingandcommentssummarizationinrealtimesocialmedia
AT chengyingliu yújíshíshèqúnméitǐzhīzēngyìshìchāoliánjiétànkānyǔxùnxīchuànliúhuìzǒng
AT liúchéngyǐng yújíshíshèqúnméitǐzhīzēngyìshìchāoliánjiétànkānyǔxùnxīchuànliúhuìzǒng
_version_ 1719151886713487360