CPSFS: A Credible Personalized Spam Filtering Scheme by Crowdsourcing
Email spam consumes a lot of network resources and threatens many systems because of its unwanted or malicious content. Most existing spam filters only target complete-spam but ignore semispam. This paper proposes a novel and comprehensive CPSFS scheme: Credible Personalized Spam Filtering Scheme, w...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi-Wiley
2017-01-01
|
Series: | Wireless Communications and Mobile Computing |
Online Access: | http://dx.doi.org/10.1155/2017/1457870 |
id |
doaj-45e764d9d7ed44079fd3cf1ecce9db46 |
---|---|
record_format |
Article |
spelling |
doaj-45e764d9d7ed44079fd3cf1ecce9db462020-11-25T01:34:29ZengHindawi-WileyWireless Communications and Mobile Computing1530-86691530-86772017-01-01201710.1155/2017/14578701457870CPSFS: A Credible Personalized Spam Filtering Scheme by CrowdsourcingXin Liu0Pingjun Zou1Weishan Zhang2Jiehan Zhou3Changying Dai4Feng Wang5Xiaomiao Zhang6College of Computer & Communication Engineering China University of Petroleum (East China), Qingdao, ChinaCollege of Computer & Communication Engineering China University of Petroleum (East China), Qingdao, ChinaCollege of Computer & Communication Engineering China University of Petroleum (East China), Qingdao, ChinaUniversity of Oulu, Oulu, FinlandCollege of Computer & Communication Engineering China University of Petroleum (East China), Qingdao, ChinaCollege of Computer & Communication Engineering China University of Petroleum (East China), Qingdao, ChinaCollege of Computer & Communication Engineering China University of Petroleum (East China), Qingdao, ChinaEmail spam consumes a lot of network resources and threatens many systems because of its unwanted or malicious content. Most existing spam filters only target complete-spam but ignore semispam. This paper proposes a novel and comprehensive CPSFS scheme: Credible Personalized Spam Filtering Scheme, which classifies spam into two categories: complete-spam and semispam, and targets filtering both kinds of spam. Complete-spam is always spam for all users; semispam is an email identified as spam by some users and as regular email by other users. Most existing spam filters target complete-spam but ignore semispam. In CPSFS, Bayesian filtering is deployed at email servers to identify complete-spam, while semispam is identified at client side by crowdsourcing. An email user client can distinguish junk from legitimate emails according to spam reports from credible contacts with the similar interests. Social trust and interest similarity between users and their contacts are calculated so that spam reports are more accurately targeted to similar users. The experimental results show that the proposed CPSFS can improve the accuracy rate of distinguishing spam from legitimate emails compared with that of Bayesian filter alone.http://dx.doi.org/10.1155/2017/1457870 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xin Liu Pingjun Zou Weishan Zhang Jiehan Zhou Changying Dai Feng Wang Xiaomiao Zhang |
spellingShingle |
Xin Liu Pingjun Zou Weishan Zhang Jiehan Zhou Changying Dai Feng Wang Xiaomiao Zhang CPSFS: A Credible Personalized Spam Filtering Scheme by Crowdsourcing Wireless Communications and Mobile Computing |
author_facet |
Xin Liu Pingjun Zou Weishan Zhang Jiehan Zhou Changying Dai Feng Wang Xiaomiao Zhang |
author_sort |
Xin Liu |
title |
CPSFS: A Credible Personalized Spam Filtering Scheme by Crowdsourcing |
title_short |
CPSFS: A Credible Personalized Spam Filtering Scheme by Crowdsourcing |
title_full |
CPSFS: A Credible Personalized Spam Filtering Scheme by Crowdsourcing |
title_fullStr |
CPSFS: A Credible Personalized Spam Filtering Scheme by Crowdsourcing |
title_full_unstemmed |
CPSFS: A Credible Personalized Spam Filtering Scheme by Crowdsourcing |
title_sort |
cpsfs: a credible personalized spam filtering scheme by crowdsourcing |
publisher |
Hindawi-Wiley |
series |
Wireless Communications and Mobile Computing |
issn |
1530-8669 1530-8677 |
publishDate |
2017-01-01 |
description |
Email spam consumes a lot of network resources and threatens many systems because of its unwanted or malicious content. Most existing spam filters only target complete-spam but ignore semispam. This paper proposes a novel and comprehensive CPSFS scheme: Credible Personalized Spam Filtering Scheme, which classifies spam into two categories: complete-spam and semispam, and targets filtering both kinds of spam. Complete-spam is always spam for all users; semispam is an email identified as spam by some users and as regular email by other users. Most existing spam filters target complete-spam but ignore semispam. In CPSFS, Bayesian filtering is deployed at email servers to identify complete-spam, while semispam is identified at client side by crowdsourcing. An email user client can distinguish junk from legitimate emails according to spam reports from credible contacts with the similar interests. Social trust and interest similarity between users and their contacts are calculated so that spam reports are more accurately targeted to similar users. The experimental results show that the proposed CPSFS can improve the accuracy rate of distinguishing spam from legitimate emails compared with that of Bayesian filter alone. |
url |
http://dx.doi.org/10.1155/2017/1457870 |
work_keys_str_mv |
AT xinliu cpsfsacrediblepersonalizedspamfilteringschemebycrowdsourcing AT pingjunzou cpsfsacrediblepersonalizedspamfilteringschemebycrowdsourcing AT weishanzhang cpsfsacrediblepersonalizedspamfilteringschemebycrowdsourcing AT jiehanzhou cpsfsacrediblepersonalizedspamfilteringschemebycrowdsourcing AT changyingdai cpsfsacrediblepersonalizedspamfilteringschemebycrowdsourcing AT fengwang cpsfsacrediblepersonalizedspamfilteringschemebycrowdsourcing AT xiaomiaozhang cpsfsacrediblepersonalizedspamfilteringschemebycrowdsourcing |
_version_ |
1725071766551789568 |