A Novel Methodology for HYIP Operators’ Bitcoin Addresses Identification

Bitcoin is one of the most popular decentralized cryptocurrencies to date. However, it has been widely reported that it can be used for investment scams, which are referred to as high yield investment programs (HYIP). Although from the security forensic point of view it is very important to identify...

Full description

Bibliographic Details
Main Authors: Kentaroh Toyoda, P. Takis Mathiopoulos, Tomoaki Ohtsuki
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8731919/
id doaj-010cc123c53844fd897570feba20c13f
record_format Article
spelling doaj-010cc123c53844fd897570feba20c13f2021-03-29T23:44:04ZengIEEEIEEE Access2169-35362019-01-017748357484810.1109/ACCESS.2019.29210878731919A Novel Methodology for HYIP Operators’ Bitcoin Addresses IdentificationKentaroh Toyoda0https://orcid.org/0000-0002-6233-3121P. Takis Mathiopoulos1Tomoaki Ohtsuki2Agency for Science, Technology, and Research (A*STAR), Singapore Institute of Manufacturing Technology, SingaporeDepartment of Informatics and Telecommunications, National and Kapodistrian University of Athens, GreeceDepartment of Information and Computer Science, Keio University, Kanagawa, JapanBitcoin is one of the most popular decentralized cryptocurrencies to date. However, it has been widely reported that it can be used for investment scams, which are referred to as high yield investment programs (HYIP). Although from the security forensic point of view it is very important to identify the HYIP operators' Bitcoin addresses, so far in the open technical literature no systematic method which reliably collects and identifies such Bitcoin addresses has been proposed. In this paper, a novel methodology is introduced, which efficiently collects a large number of the HYIP operators' Bitcoin addresses and identifies them based upon a novel analysis of their transactions history. In particular, a scraping-based method is first proposed which is able to collect more than 2,000 HYIP operators' Bitcoin addresses from the Internet thus providing a large number of the HYIPs' samples. Second, a supervised machine learning technique, which classifies, whether or not, specific Bitcoin addresses belong to the HYIP operators, is introduced and its performance is evaluated. The proposed classification method is based upon two novel approaches, namely the rate conversion technique that mitigates the effect of Bitcoin price volatility and the sampling technique that reduces the computational amount without sacrificing the classification performance. By employing close to 30,000 real Bitcoin addresses, extensive performance evaluation results obtained by means of computer simulation experiments have shown that the proposed methodology achieves excellent performance, i.e., 95% of the HYIP addresses can be correctly classified, while maintaining a false positive rate less than 4.9%. In order to further validate the proposed classifier's ability to detect the HYIP operators' Bitcoin addresses, our designed classifier has been tested against a recently published list of the HYIP addresses maintaining its excellent detection accuracy by achieving a 93.75% success rate.https://ieeexplore.ieee.org/document/8731919/Bitcoinblockchain analysisforensicsdata miningHYIP (high yield investment programs)
collection DOAJ
language English
format Article
sources DOAJ
author Kentaroh Toyoda
P. Takis Mathiopoulos
Tomoaki Ohtsuki
spellingShingle Kentaroh Toyoda
P. Takis Mathiopoulos
Tomoaki Ohtsuki
A Novel Methodology for HYIP Operators’ Bitcoin Addresses Identification
IEEE Access
Bitcoin
blockchain analysis
forensics
data mining
HYIP (high yield investment programs)
author_facet Kentaroh Toyoda
P. Takis Mathiopoulos
Tomoaki Ohtsuki
author_sort Kentaroh Toyoda
title A Novel Methodology for HYIP Operators’ Bitcoin Addresses Identification
title_short A Novel Methodology for HYIP Operators’ Bitcoin Addresses Identification
title_full A Novel Methodology for HYIP Operators’ Bitcoin Addresses Identification
title_fullStr A Novel Methodology for HYIP Operators’ Bitcoin Addresses Identification
title_full_unstemmed A Novel Methodology for HYIP Operators’ Bitcoin Addresses Identification
title_sort novel methodology for hyip operators’ bitcoin addresses identification
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Bitcoin is one of the most popular decentralized cryptocurrencies to date. However, it has been widely reported that it can be used for investment scams, which are referred to as high yield investment programs (HYIP). Although from the security forensic point of view it is very important to identify the HYIP operators' Bitcoin addresses, so far in the open technical literature no systematic method which reliably collects and identifies such Bitcoin addresses has been proposed. In this paper, a novel methodology is introduced, which efficiently collects a large number of the HYIP operators' Bitcoin addresses and identifies them based upon a novel analysis of their transactions history. In particular, a scraping-based method is first proposed which is able to collect more than 2,000 HYIP operators' Bitcoin addresses from the Internet thus providing a large number of the HYIPs' samples. Second, a supervised machine learning technique, which classifies, whether or not, specific Bitcoin addresses belong to the HYIP operators, is introduced and its performance is evaluated. The proposed classification method is based upon two novel approaches, namely the rate conversion technique that mitigates the effect of Bitcoin price volatility and the sampling technique that reduces the computational amount without sacrificing the classification performance. By employing close to 30,000 real Bitcoin addresses, extensive performance evaluation results obtained by means of computer simulation experiments have shown that the proposed methodology achieves excellent performance, i.e., 95% of the HYIP addresses can be correctly classified, while maintaining a false positive rate less than 4.9%. In order to further validate the proposed classifier's ability to detect the HYIP operators' Bitcoin addresses, our designed classifier has been tested against a recently published list of the HYIP addresses maintaining its excellent detection accuracy by achieving a 93.75% success rate.
topic Bitcoin
blockchain analysis
forensics
data mining
HYIP (high yield investment programs)
url https://ieeexplore.ieee.org/document/8731919/
work_keys_str_mv AT kentarohtoyoda anovelmethodologyforhyipoperatorsx2019bitcoinaddressesidentification
AT ptakismathiopoulos anovelmethodologyforhyipoperatorsx2019bitcoinaddressesidentification
AT tomoakiohtsuki anovelmethodologyforhyipoperatorsx2019bitcoinaddressesidentification
AT kentarohtoyoda novelmethodologyforhyipoperatorsx2019bitcoinaddressesidentification
AT ptakismathiopoulos novelmethodologyforhyipoperatorsx2019bitcoinaddressesidentification
AT tomoakiohtsuki novelmethodologyforhyipoperatorsx2019bitcoinaddressesidentification
_version_ 1724188942446624768