Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vec
The rapid development of information technologies like Internet of Things, Big Data, Artificial Intelligence, Blockchain, etc., has profoundly affected people’s consumption behaviors and changed the development model of the financial industry. The financial services on Internet and IoT wi...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9363921/ |
id |
doaj-a0838d9c0f6640fbb079d2fe02c4f7e7 |
---|---|
record_format |
Article |
spelling |
doaj-a0838d9c0f6640fbb079d2fe02c4f7e72021-03-30T15:10:56ZengIEEEIEEE Access2169-35362021-01-019433784338610.1109/ACCESS.2021.30624679363921Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vecHangjun Zhou0https://orcid.org/0000-0001-5860-8404Guang Sun1Sha Fu2Linli Wang3Juan Hu4Ying Gao5Department of Information Technology and Management, Hunan University of Finance and Economics, Changsha, ChinaDepartment of Information Technology and Management, Hunan University of Finance and Economics, Changsha, ChinaDepartment of Information Technology and Management, Hunan University of Finance and Economics, Changsha, ChinaDepartment of Information Technology and Management, Hunan University of Finance and Economics, Changsha, ChinaDepartment of Information Technology and Management, Hunan University of Finance and Economics, Changsha, ChinaDepartment of Information Technology and Management, Hunan University of Finance and Economics, Changsha, ChinaThe rapid development of information technologies like Internet of Things, Big Data, Artificial Intelligence, Blockchain, etc., has profoundly affected people’s consumption behaviors and changed the development model of the financial industry. The financial services on Internet and IoT with new technologies has provided convenience and efficiency for consumers, but new hidden fraud risks are generated also. Fraud, arbitrage, vicious collection, etc., have caused bad effects and huge losses to the development of finance on Internet and IoT. However, as the scale of financial data continues to increase dramatically, it is more and more difficult for existing rule-based expert systems and traditional machine learning model systems to detect financial frauds from large-scale historical data. In the meantime, as the degree of specialization of financial fraud continues to increase, fraudsters can evade fraud detection by frequently changing their fraud methods. In this article, an intelligent and distributed Big Data approach for Internet financial fraud detections is proposed to implement graph embedding algorithm Node2Vec to learn and represent the topological features in the financial network graph into low-dimensional dense vectors, so as to intelligently and efficiently classify and predict the data samples of the large-scale dataset with the deep neural network. The approach is distributedly performed on the clusters of Apache Spark GraphX and Hadoop to process the large dataset in parallel. The groups of experimental results demonstrate that the proposed approach can improve the efficiency of Internet financial fraud detections with better precision rate, recall rate, F1-Score and F2-Score.https://ieeexplore.ieee.org/document/9363921/Internet of Things (IoT)Internet financefraud detectiongraph embedding algorithmNode2Vec |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Hangjun Zhou Guang Sun Sha Fu Linli Wang Juan Hu Ying Gao |
spellingShingle |
Hangjun Zhou Guang Sun Sha Fu Linli Wang Juan Hu Ying Gao Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vec IEEE Access Internet of Things (IoT) Internet finance fraud detection graph embedding algorithm Node2Vec |
author_facet |
Hangjun Zhou Guang Sun Sha Fu Linli Wang Juan Hu Ying Gao |
author_sort |
Hangjun Zhou |
title |
Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vec |
title_short |
Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vec |
title_full |
Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vec |
title_fullStr |
Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vec |
title_full_unstemmed |
Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vec |
title_sort |
internet financial fraud detection based on a distributed big data approach with node2vec |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
The rapid development of information technologies like Internet of Things, Big Data, Artificial Intelligence, Blockchain, etc., has profoundly affected people’s consumption behaviors and changed the development model of the financial industry. The financial services on Internet and IoT with new technologies has provided convenience and efficiency for consumers, but new hidden fraud risks are generated also. Fraud, arbitrage, vicious collection, etc., have caused bad effects and huge losses to the development of finance on Internet and IoT. However, as the scale of financial data continues to increase dramatically, it is more and more difficult for existing rule-based expert systems and traditional machine learning model systems to detect financial frauds from large-scale historical data. In the meantime, as the degree of specialization of financial fraud continues to increase, fraudsters can evade fraud detection by frequently changing their fraud methods. In this article, an intelligent and distributed Big Data approach for Internet financial fraud detections is proposed to implement graph embedding algorithm Node2Vec to learn and represent the topological features in the financial network graph into low-dimensional dense vectors, so as to intelligently and efficiently classify and predict the data samples of the large-scale dataset with the deep neural network. The approach is distributedly performed on the clusters of Apache Spark GraphX and Hadoop to process the large dataset in parallel. The groups of experimental results demonstrate that the proposed approach can improve the efficiency of Internet financial fraud detections with better precision rate, recall rate, F1-Score and F2-Score. |
topic |
Internet of Things (IoT) Internet finance fraud detection graph embedding algorithm Node2Vec |
url |
https://ieeexplore.ieee.org/document/9363921/ |
work_keys_str_mv |
AT hangjunzhou internetfinancialfrauddetectionbasedonadistributedbigdataapproachwithnode2vec AT guangsun internetfinancialfrauddetectionbasedonadistributedbigdataapproachwithnode2vec AT shafu internetfinancialfrauddetectionbasedonadistributedbigdataapproachwithnode2vec AT linliwang internetfinancialfrauddetectionbasedonadistributedbigdataapproachwithnode2vec AT juanhu internetfinancialfrauddetectionbasedonadistributedbigdataapproachwithnode2vec AT yinggao internetfinancialfrauddetectionbasedonadistributedbigdataapproachwithnode2vec |
_version_ |
1724179898137837568 |