Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training

Fake reviews may mislead consumers. A large number of fake reviews will even cause huge property losses and public opinion crises. Therefore, it is necessary to detect and filter fake reviews. However, most existing methods have lower accuracy in detecting fake reviews due to they just use single fe...

Full description

Bibliographic Details
Main Authors:	Jingdong Wang, Haitao Kan, Fanqi Meng, Qizi Mu, Genhua Shi, Xixi Xiao
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Fake review detection machine learning multiple feature fusion feature extraction rolling collaborative training
Online Access:	https://ieeexplore.ieee.org/document/9212374/

id	doaj-7ee943acb3d640dcb5caab8246cdcb5d
record_format	Article
spelling	doaj-7ee943acb3d640dcb5caab8246cdcb5d2021-03-30T04:23:14ZengIEEEIEEE Access2169-35362020-01-01818262518263910.1109/ACCESS.2020.30285889212374Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative TrainingJingdong Wang0https://orcid.org/0000-0001-7037-951XHaitao Kan1https://orcid.org/0000-0002-6493-4668Fanqi Meng2Qizi Mu3https://orcid.org/0000-0001-7079-4182Genhua Shi4Xixi Xiao5https://orcid.org/0000-0002-1317-7820School of Computer Science, Northeast Electric Power University, Jilin City, ChinaSchool of Computer Science, Northeast Electric Power University, Jilin City, ChinaSchool of Computer Science, Northeast Electric Power University, Jilin City, ChinaSchool of Computer Science, Northeast Electric Power University, Jilin City, ChinaSchool of Computer Science, Northeast Electric Power University, Jilin City, ChinaSchool of Computer Science, Northeast Electric Power University, Jilin City, ChinaFake reviews may mislead consumers. A large number of fake reviews will even cause huge property losses and public opinion crises. Therefore, it is necessary to detect and filter fake reviews. However, most existing methods have lower accuracy in detecting fake reviews due to they just use single features and lack of labeled experimental data. To solve this problem, we propose a novelty method to detect fake reviews based on multiple feature fusion and rolling collaborative training. First, the method requires an initial index system with multiple features such as text features, sentiment features of reviews and behavior features of reviewers. Second, the method needs an initial training sample set. Thus, we designed related algorithms to extract all the features of a review. Then the classification of the review is labeled manually. Finally, the method uses the initial sample set to train 7 classifiers, and the most accurate classifier will be selected to classify new reviews. The novelty of the method lies in that the features and the classification labels of the new reviews will be added into the initial sample set as new samples. So the size of the sample set will increase automatically. The experimental results in the reviews of yelp shopping website show that the accuracy of the proposed method for detecting fake reviews is 84.45%, which is 3.5% higher than the baseline methods. And compared with the latest deep learning model, its baseline precision has increased by 5.3%. According to the Friedman test, the support vector machine (SVM) classifier and random forest (RF) classifier has been proven to be the best one by statistical means. It means our method which uses multiple features has higher accuracy than the baseline models. Meanwhile, it also resolves the problem of lacking labeled training samples in fake reviews detection.https://ieeexplore.ieee.org/document/9212374/Fake review detectionmachine learningmultiple feature fusionfeature extractionrolling collaborative training
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Jingdong Wang Haitao Kan Fanqi Meng Qizi Mu Genhua Shi Xixi Xiao
spellingShingle	Jingdong Wang Haitao Kan Fanqi Meng Qizi Mu Genhua Shi Xixi Xiao Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training IEEE Access Fake review detection machine learning multiple feature fusion feature extraction rolling collaborative training
author_facet	Jingdong Wang Haitao Kan Fanqi Meng Qizi Mu Genhua Shi Xixi Xiao
author_sort	Jingdong Wang
title	Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training
title_short	Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training
title_full	Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training
title_fullStr	Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training
title_full_unstemmed	Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training
title_sort	fake review detection based on multiple feature fusion and rolling collaborative training
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2020-01-01
description	Fake reviews may mislead consumers. A large number of fake reviews will even cause huge property losses and public opinion crises. Therefore, it is necessary to detect and filter fake reviews. However, most existing methods have lower accuracy in detecting fake reviews due to they just use single features and lack of labeled experimental data. To solve this problem, we propose a novelty method to detect fake reviews based on multiple feature fusion and rolling collaborative training. First, the method requires an initial index system with multiple features such as text features, sentiment features of reviews and behavior features of reviewers. Second, the method needs an initial training sample set. Thus, we designed related algorithms to extract all the features of a review. Then the classification of the review is labeled manually. Finally, the method uses the initial sample set to train 7 classifiers, and the most accurate classifier will be selected to classify new reviews. The novelty of the method lies in that the features and the classification labels of the new reviews will be added into the initial sample set as new samples. So the size of the sample set will increase automatically. The experimental results in the reviews of yelp shopping website show that the accuracy of the proposed method for detecting fake reviews is 84.45%, which is 3.5% higher than the baseline methods. And compared with the latest deep learning model, its baseline precision has increased by 5.3%. According to the Friedman test, the support vector machine (SVM) classifier and random forest (RF) classifier has been proven to be the best one by statistical means. It means our method which uses multiple features has higher accuracy than the baseline models. Meanwhile, it also resolves the problem of lacking labeled training samples in fake reviews detection.
topic	Fake review detection machine learning multiple feature fusion feature extraction rolling collaborative training
url	https://ieeexplore.ieee.org/document/9212374/
work_keys_str_mv	AT jingdongwang fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining AT haitaokan fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining AT fanqimeng fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining AT qizimu fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining AT genhuashi fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining AT xixixiao fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining
_version_	1724181889071185920

Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training

Similar Items