CPSFS: A Credible Personalized Spam Filtering Scheme by Crowdsourcing

Email spam consumes a lot of network resources and threatens many systems because of its unwanted or malicious content. Most existing spam filters only target complete-spam but ignore semispam. This paper proposes a novel and comprehensive CPSFS scheme: Credible Personalized Spam Filtering Scheme, w...

Full description

Bibliographic Details
Main Authors: Xin Liu, Pingjun Zou, Weishan Zhang, Jiehan Zhou, Changying Dai, Feng Wang, Xiaomiao Zhang
Format: Article
Language:English
Published: Hindawi-Wiley 2017-01-01
Series:Wireless Communications and Mobile Computing
Online Access:http://dx.doi.org/10.1155/2017/1457870
id doaj-45e764d9d7ed44079fd3cf1ecce9db46
record_format Article
spelling doaj-45e764d9d7ed44079fd3cf1ecce9db462020-11-25T01:34:29ZengHindawi-WileyWireless Communications and Mobile Computing1530-86691530-86772017-01-01201710.1155/2017/14578701457870CPSFS: A Credible Personalized Spam Filtering Scheme by CrowdsourcingXin Liu0Pingjun Zou1Weishan Zhang2Jiehan Zhou3Changying Dai4Feng Wang5Xiaomiao Zhang6College of Computer & Communication Engineering China University of Petroleum (East China), Qingdao, ChinaCollege of Computer & Communication Engineering China University of Petroleum (East China), Qingdao, ChinaCollege of Computer & Communication Engineering China University of Petroleum (East China), Qingdao, ChinaUniversity of Oulu, Oulu, FinlandCollege of Computer & Communication Engineering China University of Petroleum (East China), Qingdao, ChinaCollege of Computer & Communication Engineering China University of Petroleum (East China), Qingdao, ChinaCollege of Computer & Communication Engineering China University of Petroleum (East China), Qingdao, ChinaEmail spam consumes a lot of network resources and threatens many systems because of its unwanted or malicious content. Most existing spam filters only target complete-spam but ignore semispam. This paper proposes a novel and comprehensive CPSFS scheme: Credible Personalized Spam Filtering Scheme, which classifies spam into two categories: complete-spam and semispam, and targets filtering both kinds of spam. Complete-spam is always spam for all users; semispam is an email identified as spam by some users and as regular email by other users. Most existing spam filters target complete-spam but ignore semispam. In CPSFS, Bayesian filtering is deployed at email servers to identify complete-spam, while semispam is identified at client side by crowdsourcing. An email user client can distinguish junk from legitimate emails according to spam reports from credible contacts with the similar interests. Social trust and interest similarity between users and their contacts are calculated so that spam reports are more accurately targeted to similar users. The experimental results show that the proposed CPSFS can improve the accuracy rate of distinguishing spam from legitimate emails compared with that of Bayesian filter alone.http://dx.doi.org/10.1155/2017/1457870
collection DOAJ
language English
format Article
sources DOAJ
author Xin Liu
Pingjun Zou
Weishan Zhang
Jiehan Zhou
Changying Dai
Feng Wang
Xiaomiao Zhang
spellingShingle Xin Liu
Pingjun Zou
Weishan Zhang
Jiehan Zhou
Changying Dai
Feng Wang
Xiaomiao Zhang
CPSFS: A Credible Personalized Spam Filtering Scheme by Crowdsourcing
Wireless Communications and Mobile Computing
author_facet Xin Liu
Pingjun Zou
Weishan Zhang
Jiehan Zhou
Changying Dai
Feng Wang
Xiaomiao Zhang
author_sort Xin Liu
title CPSFS: A Credible Personalized Spam Filtering Scheme by Crowdsourcing
title_short CPSFS: A Credible Personalized Spam Filtering Scheme by Crowdsourcing
title_full CPSFS: A Credible Personalized Spam Filtering Scheme by Crowdsourcing
title_fullStr CPSFS: A Credible Personalized Spam Filtering Scheme by Crowdsourcing
title_full_unstemmed CPSFS: A Credible Personalized Spam Filtering Scheme by Crowdsourcing
title_sort cpsfs: a credible personalized spam filtering scheme by crowdsourcing
publisher Hindawi-Wiley
series Wireless Communications and Mobile Computing
issn 1530-8669
1530-8677
publishDate 2017-01-01
description Email spam consumes a lot of network resources and threatens many systems because of its unwanted or malicious content. Most existing spam filters only target complete-spam but ignore semispam. This paper proposes a novel and comprehensive CPSFS scheme: Credible Personalized Spam Filtering Scheme, which classifies spam into two categories: complete-spam and semispam, and targets filtering both kinds of spam. Complete-spam is always spam for all users; semispam is an email identified as spam by some users and as regular email by other users. Most existing spam filters target complete-spam but ignore semispam. In CPSFS, Bayesian filtering is deployed at email servers to identify complete-spam, while semispam is identified at client side by crowdsourcing. An email user client can distinguish junk from legitimate emails according to spam reports from credible contacts with the similar interests. Social trust and interest similarity between users and their contacts are calculated so that spam reports are more accurately targeted to similar users. The experimental results show that the proposed CPSFS can improve the accuracy rate of distinguishing spam from legitimate emails compared with that of Bayesian filter alone.
url http://dx.doi.org/10.1155/2017/1457870
work_keys_str_mv AT xinliu cpsfsacrediblepersonalizedspamfilteringschemebycrowdsourcing
AT pingjunzou cpsfsacrediblepersonalizedspamfilteringschemebycrowdsourcing
AT weishanzhang cpsfsacrediblepersonalizedspamfilteringschemebycrowdsourcing
AT jiehanzhou cpsfsacrediblepersonalizedspamfilteringschemebycrowdsourcing
AT changyingdai cpsfsacrediblepersonalizedspamfilteringschemebycrowdsourcing
AT fengwang cpsfsacrediblepersonalizedspamfilteringschemebycrowdsourcing
AT xiaomiaozhang cpsfsacrediblepersonalizedspamfilteringschemebycrowdsourcing
_version_ 1725071766551789568