Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vec

The rapid development of information technologies like Internet of Things, Big Data, Artificial Intelligence, Blockchain, etc., has profoundly affected people’s consumption behaviors and changed the development model of the financial industry. The financial services on Internet and IoT wi...

Full description

Bibliographic Details
Main Authors: Hangjun Zhou, Guang Sun, Sha Fu, Linli Wang, Juan Hu, Ying Gao
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9363921/
id doaj-a0838d9c0f6640fbb079d2fe02c4f7e7
record_format Article
spelling doaj-a0838d9c0f6640fbb079d2fe02c4f7e72021-03-30T15:10:56ZengIEEEIEEE Access2169-35362021-01-019433784338610.1109/ACCESS.2021.30624679363921Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vecHangjun Zhou0https://orcid.org/0000-0001-5860-8404Guang Sun1Sha Fu2Linli Wang3Juan Hu4Ying Gao5Department of Information Technology and Management, Hunan University of Finance and Economics, Changsha, ChinaDepartment of Information Technology and Management, Hunan University of Finance and Economics, Changsha, ChinaDepartment of Information Technology and Management, Hunan University of Finance and Economics, Changsha, ChinaDepartment of Information Technology and Management, Hunan University of Finance and Economics, Changsha, ChinaDepartment of Information Technology and Management, Hunan University of Finance and Economics, Changsha, ChinaDepartment of Information Technology and Management, Hunan University of Finance and Economics, Changsha, ChinaThe rapid development of information technologies like Internet of Things, Big Data, Artificial Intelligence, Blockchain, etc., has profoundly affected people’s consumption behaviors and changed the development model of the financial industry. The financial services on Internet and IoT with new technologies has provided convenience and efficiency for consumers, but new hidden fraud risks are generated also. Fraud, arbitrage, vicious collection, etc., have caused bad effects and huge losses to the development of finance on Internet and IoT. However, as the scale of financial data continues to increase dramatically, it is more and more difficult for existing rule-based expert systems and traditional machine learning model systems to detect financial frauds from large-scale historical data. In the meantime, as the degree of specialization of financial fraud continues to increase, fraudsters can evade fraud detection by frequently changing their fraud methods. In this article, an intelligent and distributed Big Data approach for Internet financial fraud detections is proposed to implement graph embedding algorithm Node2Vec to learn and represent the topological features in the financial network graph into low-dimensional dense vectors, so as to intelligently and efficiently classify and predict the data samples of the large-scale dataset with the deep neural network. The approach is distributedly performed on the clusters of Apache Spark GraphX and Hadoop to process the large dataset in parallel. The groups of experimental results demonstrate that the proposed approach can improve the efficiency of Internet financial fraud detections with better precision rate, recall rate, F1-Score and F2-Score.https://ieeexplore.ieee.org/document/9363921/Internet of Things (IoT)Internet financefraud detectiongraph embedding algorithmNode2Vec
collection DOAJ
language English
format Article
sources DOAJ
author Hangjun Zhou
Guang Sun
Sha Fu
Linli Wang
Juan Hu
Ying Gao
spellingShingle Hangjun Zhou
Guang Sun
Sha Fu
Linli Wang
Juan Hu
Ying Gao
Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vec
IEEE Access
Internet of Things (IoT)
Internet finance
fraud detection
graph embedding algorithm
Node2Vec
author_facet Hangjun Zhou
Guang Sun
Sha Fu
Linli Wang
Juan Hu
Ying Gao
author_sort Hangjun Zhou
title Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vec
title_short Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vec
title_full Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vec
title_fullStr Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vec
title_full_unstemmed Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vec
title_sort internet financial fraud detection based on a distributed big data approach with node2vec
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description The rapid development of information technologies like Internet of Things, Big Data, Artificial Intelligence, Blockchain, etc., has profoundly affected people’s consumption behaviors and changed the development model of the financial industry. The financial services on Internet and IoT with new technologies has provided convenience and efficiency for consumers, but new hidden fraud risks are generated also. Fraud, arbitrage, vicious collection, etc., have caused bad effects and huge losses to the development of finance on Internet and IoT. However, as the scale of financial data continues to increase dramatically, it is more and more difficult for existing rule-based expert systems and traditional machine learning model systems to detect financial frauds from large-scale historical data. In the meantime, as the degree of specialization of financial fraud continues to increase, fraudsters can evade fraud detection by frequently changing their fraud methods. In this article, an intelligent and distributed Big Data approach for Internet financial fraud detections is proposed to implement graph embedding algorithm Node2Vec to learn and represent the topological features in the financial network graph into low-dimensional dense vectors, so as to intelligently and efficiently classify and predict the data samples of the large-scale dataset with the deep neural network. The approach is distributedly performed on the clusters of Apache Spark GraphX and Hadoop to process the large dataset in parallel. The groups of experimental results demonstrate that the proposed approach can improve the efficiency of Internet financial fraud detections with better precision rate, recall rate, F1-Score and F2-Score.
topic Internet of Things (IoT)
Internet finance
fraud detection
graph embedding algorithm
Node2Vec
url https://ieeexplore.ieee.org/document/9363921/
work_keys_str_mv AT hangjunzhou internetfinancialfrauddetectionbasedonadistributedbigdataapproachwithnode2vec
AT guangsun internetfinancialfrauddetectionbasedonadistributedbigdataapproachwithnode2vec
AT shafu internetfinancialfrauddetectionbasedonadistributedbigdataapproachwithnode2vec
AT linliwang internetfinancialfrauddetectionbasedonadistributedbigdataapproachwithnode2vec
AT juanhu internetfinancialfrauddetectionbasedonadistributedbigdataapproachwithnode2vec
AT yinggao internetfinancialfrauddetectionbasedonadistributedbigdataapproachwithnode2vec
_version_ 1724179898137837568