Transaction Analysis On Ethereum Network Using Machine Learning A Proposed System
Transaction Analysis On Ethereum Network Using Machine Learning A Proposed System
ISSN No:-2456-2165
Abstract:- Ethereum, a well-known blockchain’s most Ethereum, breeding ground for various cybercrimes due to its
famous implementation builds and deploys a decentralized advantages of anonymity and low transfer costs, as well as its
application where users can transact with cryptocurrency huge economic value has been increased [4]. An initial-coin-
using smart contracts. The latest technological offering (ICO) provides funding technique for block-chain
developments in cryptocurrencies and the benefits applications where investors receive tokens in exchange of their
associated with them have been hidden by a number of monetary investment. However, it is currently reported that
illegal activities on the network. Like bribery, phishing more than 10% of ICOs issued on Ethereum are victims of
scam, money laundering and fraud etc. The ‘Pseudo- various types of fraud, including phishing scams; Ponzi based
anonymous’ nature of the participants in Ethereum crimes, etc. [5]. Detailed report by analysis, source
blockchain network leads to cause difficulty in detecting investigative & safety management software for digital cash or
illicit behavior of the users. Anomalies must be detected and virtual-currencies, in 2021, crypto-based crime hit a new high
resolved quickly to ensure participant trust in the largest with $14 billion in illegal addresses during the year, which is
blockchain platforms. There is a lot of work on detailed over $7.8 billion. In 2020 [6], this indicates that financial
analysis of Ethereum transactions in terms of how well they security needs to be assured as it become an important concern
work, but this research is the first to the best of my to get over block-chain environment. However, between the
knowledge to detect anomalies in Ethereum transaction many risk concerns associated with block-chain-based digital
records. To achieve our goal, we have extracted more cryptocurrencies; Fraudulent transactions attributed for over
comprehensive feature of Ethereum transaction data to get 50% of all Ethereum cybercrimes in 2017, and this type of fraud
rid of the shortcomings of existing work. We have used poses a major security threat to Ethereum trading [7]-[8]. When
SMOTE technique to deal with highly imbalanced dataset using Ethereum transaction records to detect fraudster we may
and implemented five supervised-machine-learning model, encounter following problems which may cause difficulties in
Logistic-Regression, KNN, Decision-tree, Random-Forest the detection of fraud.
and SVC classifier to access and compare the best
performer among them. Random Forest Outperforms with While dealing with real time Ethereum transaction data
accuracy of 98%. We evaluated the accuracy, precision, and detection of malicious users is very difficult, as they are very
F-value of each method and backed them up with few in such a huge transaction records, finding defaulter are like
experimental results. searching a needle in a haystack. We are experiencing the same
problem due to excessive data imbalance nature of the dataset.
Keywords:- Blockchain, Ethereum, Cryptocurrency, Smart
Contracts, Anomaly Detection. The diversity of the Ethereum trading system reflects the
diversity of addresses, such as wallets, stock exchanges, and
I. INTRODUCTION well-known ICOs [9], but the trading of typical loss accounts is
relatively small. Using up-to-date information to distinguish
Blockchain is a growing collection of blocks of digital between malicious and non-malicious email addresses can be
records of currency transactions, conceived by Satoshi very difficult in many networks.
Nakamoto [1] to create a decentralized payment system.
Recently, blockchain are in trend in computer technology, and The Ethereum fraud detection concept is a classification
blockchain technology as a whole will lead to major changes in concept in machine learning. The success of this process is
economy of nation [2]. Cryptocurrencies or digital assets are the related to the policy chosen and the accessibility of the
most vital and famous blockchain applications in terms of information provided. Feature extraction that accurately
economy. The Bitcoin project is the latest successful huge differentiates between malicious and non-malicious
implementation for the block-chain technology. Ethereum is transactions can be effectively used for classification models
considered to be the second largest blockchain application [10]-[11].The main contribution of this article is we can detect
supporting smart-contracts, cryptocurrency (Ether), are the the malicious nodes in the Ethereum network with a
second huge crypto-currency [3]. While, due to rapid growth of comparatively high probability. The Proposed method does not
𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 𝑋 𝑅𝑒𝑐𝑎𝑙𝑙
F-score= 2 𝑋 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛+ 𝑅𝑒𝑐𝑎𝑙𝑙
A. Classification-Performance
In this Section, we first present the classification report of
each method followed by their confusion matrix, which gives
the accuracy measures of each classifier. Next we will provide
a comparison report of accuracy and f-score of selected
methods applied for classification.
Table 1. Classification-Report-of-Logistic-Regression
Precision Recall F1- Support
(%) (%) score(%)
0 97 88 92 1547
1 68 88 77 422 Table 3. Classification Report of Random Forest
Accuracy - - 83 1969 Precision Recall F1- Support
Macro Avg 82 88 85 1969 score
Weighted 90 88 89 1969 0 0.99 0.98 0.98 1547
Avg 1 0.93 0.95 0.94 422
Accuracy - - 0.98 1969
Fig.1. Confusion Matrix of Logistic Regression Macro Avg 0.96 0.97 0.96 1969
Weighted 0.97 0.97 0.89 1969
Avg