Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training

Fake reviews may mislead consumers. A large number of fake reviews will even cause huge property losses and public opinion crises. Therefore, it is necessary to detect and filter fake reviews. However, most existing methods have lower accuracy in detecting fake reviews due to they just use single fe...

Full description

Bibliographic Details
Main Authors: Jingdong Wang, Haitao Kan, Fanqi Meng, Qizi Mu, Genhua Shi, Xixi Xiao
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9212374/
id doaj-7ee943acb3d640dcb5caab8246cdcb5d
record_format Article
spelling doaj-7ee943acb3d640dcb5caab8246cdcb5d2021-03-30T04:23:14ZengIEEEIEEE Access2169-35362020-01-01818262518263910.1109/ACCESS.2020.30285889212374Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative TrainingJingdong Wang0https://orcid.org/0000-0001-7037-951XHaitao Kan1https://orcid.org/0000-0002-6493-4668Fanqi Meng2Qizi Mu3https://orcid.org/0000-0001-7079-4182Genhua Shi4Xixi Xiao5https://orcid.org/0000-0002-1317-7820School of Computer Science, Northeast Electric Power University, Jilin City, ChinaSchool of Computer Science, Northeast Electric Power University, Jilin City, ChinaSchool of Computer Science, Northeast Electric Power University, Jilin City, ChinaSchool of Computer Science, Northeast Electric Power University, Jilin City, ChinaSchool of Computer Science, Northeast Electric Power University, Jilin City, ChinaSchool of Computer Science, Northeast Electric Power University, Jilin City, ChinaFake reviews may mislead consumers. A large number of fake reviews will even cause huge property losses and public opinion crises. Therefore, it is necessary to detect and filter fake reviews. However, most existing methods have lower accuracy in detecting fake reviews due to they just use single features and lack of labeled experimental data. To solve this problem, we propose a novelty method to detect fake reviews based on multiple feature fusion and rolling collaborative training. First, the method requires an initial index system with multiple features such as text features, sentiment features of reviews and behavior features of reviewers. Second, the method needs an initial training sample set. Thus, we designed related algorithms to extract all the features of a review. Then the classification of the review is labeled manually. Finally, the method uses the initial sample set to train 7 classifiers, and the most accurate classifier will be selected to classify new reviews. The novelty of the method lies in that the features and the classification labels of the new reviews will be added into the initial sample set as new samples. So the size of the sample set will increase automatically. The experimental results in the reviews of yelp shopping website show that the accuracy of the proposed method for detecting fake reviews is 84.45%, which is 3.5% higher than the baseline methods. And compared with the latest deep learning model, its baseline precision has increased by 5.3%. According to the Friedman test, the support vector machine (SVM) classifier and random forest (RF) classifier has been proven to be the best one by statistical means. It means our method which uses multiple features has higher accuracy than the baseline models. Meanwhile, it also resolves the problem of lacking labeled training samples in fake reviews detection.https://ieeexplore.ieee.org/document/9212374/Fake review detectionmachine learningmultiple feature fusionfeature extractionrolling collaborative training
collection DOAJ
language English
format Article
sources DOAJ
author Jingdong Wang
Haitao Kan
Fanqi Meng
Qizi Mu
Genhua Shi
Xixi Xiao
spellingShingle Jingdong Wang
Haitao Kan
Fanqi Meng
Qizi Mu
Genhua Shi
Xixi Xiao
Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training
IEEE Access
Fake review detection
machine learning
multiple feature fusion
feature extraction
rolling collaborative training
author_facet Jingdong Wang
Haitao Kan
Fanqi Meng
Qizi Mu
Genhua Shi
Xixi Xiao
author_sort Jingdong Wang
title Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training
title_short Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training
title_full Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training
title_fullStr Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training
title_full_unstemmed Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training
title_sort fake review detection based on multiple feature fusion and rolling collaborative training
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Fake reviews may mislead consumers. A large number of fake reviews will even cause huge property losses and public opinion crises. Therefore, it is necessary to detect and filter fake reviews. However, most existing methods have lower accuracy in detecting fake reviews due to they just use single features and lack of labeled experimental data. To solve this problem, we propose a novelty method to detect fake reviews based on multiple feature fusion and rolling collaborative training. First, the method requires an initial index system with multiple features such as text features, sentiment features of reviews and behavior features of reviewers. Second, the method needs an initial training sample set. Thus, we designed related algorithms to extract all the features of a review. Then the classification of the review is labeled manually. Finally, the method uses the initial sample set to train 7 classifiers, and the most accurate classifier will be selected to classify new reviews. The novelty of the method lies in that the features and the classification labels of the new reviews will be added into the initial sample set as new samples. So the size of the sample set will increase automatically. The experimental results in the reviews of yelp shopping website show that the accuracy of the proposed method for detecting fake reviews is 84.45%, which is 3.5% higher than the baseline methods. And compared with the latest deep learning model, its baseline precision has increased by 5.3%. According to the Friedman test, the support vector machine (SVM) classifier and random forest (RF) classifier has been proven to be the best one by statistical means. It means our method which uses multiple features has higher accuracy than the baseline models. Meanwhile, it also resolves the problem of lacking labeled training samples in fake reviews detection.
topic Fake review detection
machine learning
multiple feature fusion
feature extraction
rolling collaborative training
url https://ieeexplore.ieee.org/document/9212374/
work_keys_str_mv AT jingdongwang fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining
AT haitaokan fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining
AT fanqimeng fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining
AT qizimu fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining
AT genhuashi fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining
AT xixixiao fakereviewdetectionbasedonmultiplefeaturefusionandrollingcollaborativetraining
_version_ 1724181889071185920