Detecting Microblog Spam using User Behavior and Content Analysis

碩士 === 國立臺灣科技大學 === 資訊工程系 === 98 === In these years, Internet grows up quickly. Microblog is a new form of blog. A microblog differs from a traditional blog in that its content is typically much smaller, in both actual size and aggregate file size. Microbolog can post up to 140 characters on the aut...

Full description

Bibliographic Details
Main Authors: Shih-liang Chang, 張世良
Other Authors: Shi-Jinn Horng
Format: Others
Language:zh-TW
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/83217783551264850485
id ndltd-TW-098NTUS5392074
record_format oai_dc
spelling ndltd-TW-098NTUS53920742016-04-22T04:23:47Z http://ndltd.ncl.edu.tw/handle/83217783551264850485 Detecting Microblog Spam using User Behavior and Content Analysis 依使用者行為和內文分析偵測垃圾微網誌 Shih-liang Chang 張世良 碩士 國立臺灣科技大學 資訊工程系 98 In these years, Internet grows up quickly. Microblog is a new form of blog. A microblog differs from a traditional blog in that its content is typically much smaller, in both actual size and aggregate file size. Microbolog can post up to 140 characters on the author's profile page. Because microblog is an easy way to contact with other people, spammer could use it to spread malicious links, sex ad and meaningless content to bother users. This paper propose a method that combines Content-based features and User-behavior features to identify if a mircoblog is a spam. The former is used to detect the relationship of the posted contents and the latter is used to detect the user’s behavior. The data in the experimental database all were collected from Twitter's users and there are 2100 users. Experimental results show that the detection rate of the proposed microblog spammer detector is up to 90%. Shi-Jinn Horng 洪西進 2010 學位論文 ; thesis 50 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 資訊工程系 === 98 === In these years, Internet grows up quickly. Microblog is a new form of blog. A microblog differs from a traditional blog in that its content is typically much smaller, in both actual size and aggregate file size. Microbolog can post up to 140 characters on the author's profile page. Because microblog is an easy way to contact with other people, spammer could use it to spread malicious links, sex ad and meaningless content to bother users. This paper propose a method that combines Content-based features and User-behavior features to identify if a mircoblog is a spam. The former is used to detect the relationship of the posted contents and the latter is used to detect the user’s behavior. The data in the experimental database all were collected from Twitter's users and there are 2100 users. Experimental results show that the detection rate of the proposed microblog spammer detector is up to 90%.
author2 Shi-Jinn Horng
author_facet Shi-Jinn Horng
Shih-liang Chang
張世良
author Shih-liang Chang
張世良
spellingShingle Shih-liang Chang
張世良
Detecting Microblog Spam using User Behavior and Content Analysis
author_sort Shih-liang Chang
title Detecting Microblog Spam using User Behavior and Content Analysis
title_short Detecting Microblog Spam using User Behavior and Content Analysis
title_full Detecting Microblog Spam using User Behavior and Content Analysis
title_fullStr Detecting Microblog Spam using User Behavior and Content Analysis
title_full_unstemmed Detecting Microblog Spam using User Behavior and Content Analysis
title_sort detecting microblog spam using user behavior and content analysis
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/83217783551264850485
work_keys_str_mv AT shihliangchang detectingmicroblogspamusinguserbehaviorandcontentanalysis
AT zhāngshìliáng detectingmicroblogspamusinguserbehaviorandcontentanalysis
AT shihliangchang yīshǐyòngzhěxíngwèihénèiwénfēnxīzhēncèlājīwēiwǎngzhì
AT zhāngshìliáng yīshǐyòngzhěxíngwèihénèiwénfēnxīzhēncèlājīwēiwǎngzhì
_version_ 1718231243933351936