A Novel Methodology for HYIP Operators’ Bitcoin Addresses Identification
Bitcoin is one of the most popular decentralized cryptocurrencies to date. However, it has been widely reported that it can be used for investment scams, which are referred to as high yield investment programs (HYIP). Although from the security forensic point of view it is very important to identify...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8731919/ |
id |
doaj-010cc123c53844fd897570feba20c13f |
---|---|
record_format |
Article |
spelling |
doaj-010cc123c53844fd897570feba20c13f2021-03-29T23:44:04ZengIEEEIEEE Access2169-35362019-01-017748357484810.1109/ACCESS.2019.29210878731919A Novel Methodology for HYIP Operators’ Bitcoin Addresses IdentificationKentaroh Toyoda0https://orcid.org/0000-0002-6233-3121P. Takis Mathiopoulos1Tomoaki Ohtsuki2Agency for Science, Technology, and Research (A*STAR), Singapore Institute of Manufacturing Technology, SingaporeDepartment of Informatics and Telecommunications, National and Kapodistrian University of Athens, GreeceDepartment of Information and Computer Science, Keio University, Kanagawa, JapanBitcoin is one of the most popular decentralized cryptocurrencies to date. However, it has been widely reported that it can be used for investment scams, which are referred to as high yield investment programs (HYIP). Although from the security forensic point of view it is very important to identify the HYIP operators' Bitcoin addresses, so far in the open technical literature no systematic method which reliably collects and identifies such Bitcoin addresses has been proposed. In this paper, a novel methodology is introduced, which efficiently collects a large number of the HYIP operators' Bitcoin addresses and identifies them based upon a novel analysis of their transactions history. In particular, a scraping-based method is first proposed which is able to collect more than 2,000 HYIP operators' Bitcoin addresses from the Internet thus providing a large number of the HYIPs' samples. Second, a supervised machine learning technique, which classifies, whether or not, specific Bitcoin addresses belong to the HYIP operators, is introduced and its performance is evaluated. The proposed classification method is based upon two novel approaches, namely the rate conversion technique that mitigates the effect of Bitcoin price volatility and the sampling technique that reduces the computational amount without sacrificing the classification performance. By employing close to 30,000 real Bitcoin addresses, extensive performance evaluation results obtained by means of computer simulation experiments have shown that the proposed methodology achieves excellent performance, i.e., 95% of the HYIP addresses can be correctly classified, while maintaining a false positive rate less than 4.9%. In order to further validate the proposed classifier's ability to detect the HYIP operators' Bitcoin addresses, our designed classifier has been tested against a recently published list of the HYIP addresses maintaining its excellent detection accuracy by achieving a 93.75% success rate.https://ieeexplore.ieee.org/document/8731919/Bitcoinblockchain analysisforensicsdata miningHYIP (high yield investment programs) |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Kentaroh Toyoda P. Takis Mathiopoulos Tomoaki Ohtsuki |
spellingShingle |
Kentaroh Toyoda P. Takis Mathiopoulos Tomoaki Ohtsuki A Novel Methodology for HYIP Operators’ Bitcoin Addresses Identification IEEE Access Bitcoin blockchain analysis forensics data mining HYIP (high yield investment programs) |
author_facet |
Kentaroh Toyoda P. Takis Mathiopoulos Tomoaki Ohtsuki |
author_sort |
Kentaroh Toyoda |
title |
A Novel Methodology for HYIP Operators’ Bitcoin Addresses Identification |
title_short |
A Novel Methodology for HYIP Operators’ Bitcoin Addresses Identification |
title_full |
A Novel Methodology for HYIP Operators’ Bitcoin Addresses Identification |
title_fullStr |
A Novel Methodology for HYIP Operators’ Bitcoin Addresses Identification |
title_full_unstemmed |
A Novel Methodology for HYIP Operators’ Bitcoin Addresses Identification |
title_sort |
novel methodology for hyip operators’ bitcoin addresses identification |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
Bitcoin is one of the most popular decentralized cryptocurrencies to date. However, it has been widely reported that it can be used for investment scams, which are referred to as high yield investment programs (HYIP). Although from the security forensic point of view it is very important to identify the HYIP operators' Bitcoin addresses, so far in the open technical literature no systematic method which reliably collects and identifies such Bitcoin addresses has been proposed. In this paper, a novel methodology is introduced, which efficiently collects a large number of the HYIP operators' Bitcoin addresses and identifies them based upon a novel analysis of their transactions history. In particular, a scraping-based method is first proposed which is able to collect more than 2,000 HYIP operators' Bitcoin addresses from the Internet thus providing a large number of the HYIPs' samples. Second, a supervised machine learning technique, which classifies, whether or not, specific Bitcoin addresses belong to the HYIP operators, is introduced and its performance is evaluated. The proposed classification method is based upon two novel approaches, namely the rate conversion technique that mitigates the effect of Bitcoin price volatility and the sampling technique that reduces the computational amount without sacrificing the classification performance. By employing close to 30,000 real Bitcoin addresses, extensive performance evaluation results obtained by means of computer simulation experiments have shown that the proposed methodology achieves excellent performance, i.e., 95% of the HYIP addresses can be correctly classified, while maintaining a false positive rate less than 4.9%. In order to further validate the proposed classifier's ability to detect the HYIP operators' Bitcoin addresses, our designed classifier has been tested against a recently published list of the HYIP addresses maintaining its excellent detection accuracy by achieving a 93.75% success rate. |
topic |
Bitcoin blockchain analysis forensics data mining HYIP (high yield investment programs) |
url |
https://ieeexplore.ieee.org/document/8731919/ |
work_keys_str_mv |
AT kentarohtoyoda anovelmethodologyforhyipoperatorsx2019bitcoinaddressesidentification AT ptakismathiopoulos anovelmethodologyforhyipoperatorsx2019bitcoinaddressesidentification AT tomoakiohtsuki anovelmethodologyforhyipoperatorsx2019bitcoinaddressesidentification AT kentarohtoyoda novelmethodologyforhyipoperatorsx2019bitcoinaddressesidentification AT ptakismathiopoulos novelmethodologyforhyipoperatorsx2019bitcoinaddressesidentification AT tomoakiohtsuki novelmethodologyforhyipoperatorsx2019bitcoinaddressesidentification |
_version_ |
1724188942446624768 |