Detecting Microblog Spam using User Behavior and Content Analysis
碩士 === 國立臺灣科技大學 === 資訊工程系 === 98 === In these years, Internet grows up quickly. Microblog is a new form of blog. A microblog differs from a traditional blog in that its content is typically much smaller, in both actual size and aggregate file size. Microbolog can post up to 140 characters on the aut...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2010
|
Online Access: | http://ndltd.ncl.edu.tw/handle/83217783551264850485 |
id |
ndltd-TW-098NTUS5392074 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-098NTUS53920742016-04-22T04:23:47Z http://ndltd.ncl.edu.tw/handle/83217783551264850485 Detecting Microblog Spam using User Behavior and Content Analysis 依使用者行為和內文分析偵測垃圾微網誌 Shih-liang Chang 張世良 碩士 國立臺灣科技大學 資訊工程系 98 In these years, Internet grows up quickly. Microblog is a new form of blog. A microblog differs from a traditional blog in that its content is typically much smaller, in both actual size and aggregate file size. Microbolog can post up to 140 characters on the author's profile page. Because microblog is an easy way to contact with other people, spammer could use it to spread malicious links, sex ad and meaningless content to bother users. This paper propose a method that combines Content-based features and User-behavior features to identify if a mircoblog is a spam. The former is used to detect the relationship of the posted contents and the latter is used to detect the user’s behavior. The data in the experimental database all were collected from Twitter's users and there are 2100 users. Experimental results show that the detection rate of the proposed microblog spammer detector is up to 90%. Shi-Jinn Horng 洪西進 2010 學位論文 ; thesis 50 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣科技大學 === 資訊工程系 === 98 === In these years, Internet grows up quickly. Microblog is a new form of blog. A microblog differs from a traditional blog in that its content is typically much smaller, in both actual size and aggregate file size. Microbolog can post up to 140 characters on the author's profile page. Because microblog is an easy way to contact with other people, spammer could use it to spread malicious links, sex ad and meaningless content to bother users.
This paper propose a method that combines Content-based features and User-behavior features to identify if a mircoblog is a spam. The former is used to detect the relationship of the posted contents and the latter is used to detect the user’s behavior. The data in the experimental database all were collected from Twitter's users and there are 2100 users. Experimental results show that the detection rate of the proposed microblog spammer detector is up to 90%.
|
author2 |
Shi-Jinn Horng |
author_facet |
Shi-Jinn Horng Shih-liang Chang 張世良 |
author |
Shih-liang Chang 張世良 |
spellingShingle |
Shih-liang Chang 張世良 Detecting Microblog Spam using User Behavior and Content Analysis |
author_sort |
Shih-liang Chang |
title |
Detecting Microblog Spam using User Behavior and Content Analysis |
title_short |
Detecting Microblog Spam using User Behavior and Content Analysis |
title_full |
Detecting Microblog Spam using User Behavior and Content Analysis |
title_fullStr |
Detecting Microblog Spam using User Behavior and Content Analysis |
title_full_unstemmed |
Detecting Microblog Spam using User Behavior and Content Analysis |
title_sort |
detecting microblog spam using user behavior and content analysis |
publishDate |
2010 |
url |
http://ndltd.ncl.edu.tw/handle/83217783551264850485 |
work_keys_str_mv |
AT shihliangchang detectingmicroblogspamusinguserbehaviorandcontentanalysis AT zhāngshìliáng detectingmicroblogspamusinguserbehaviorandcontentanalysis AT shihliangchang yīshǐyòngzhěxíngwèihénèiwénfēnxīzhēncèlājīwēiwǎngzhì AT zhāngshìliáng yīshǐyòngzhěxíngwèihénèiwénfēnxīzhēncèlājīwēiwǎngzhì |
_version_ |
1718231243933351936 |