Prediction of IPL Match Outcome Using Machine Lear
Prediction of IPL Match Outcome Using Machine Lear
ABSTRACT
India's most popular sport is cricket and is played across all over the nation in different formats like T20, ODI, and Test.
The Indian Premier League (IPL) is a national cricketmatch where players are drawn from regional teams of India,
National Team and also from international team. Many factors like live streaming, radio, TV broadcast made this league
as popular among cricket fans. The prediction of the outcome of the IPL matches is very important for online traders
and sponsors. We can predict the match between two teams based on various factors like team composition, batting and
bowling averages of each player in the team, and the team's success in their previous matches, in addition to traditional
factors such as toss, venue, and day-night, the probability of winning by batting first at a specified match venue against
a specific team. In this paper, we have proposed a model for predicting outcome of the IPL matches using Machine
learning Algorithms namely SVM, Random Forest Classifier (RFC), Logistic Regression and K-Nearest Neighbor.
Experimental results showed that the Random Forest algorithm outperforms other algorithms with an accuracy of
88.10%.
Keywords: Cricket, Indian Premier League, Logistic Regression, Machine Learning, Prediction of match
outcome, Random Forest Classifier.
400
Atlantis Highlights in Computer Sciences, volume 4
3.2.1.1. Team wise home and away dataset the match was
played.
The Teamwise Home and Away dataset contains 6 Team 1 Name of team who
columns for the datasets which are as follows: played match.
home_wins, away_wins, home_matches, away_matches, Team 2 Name of team which
home_win_percentage and away_win_percentage. It played match.
describes about the team performance in the home and Toss winner Team that won the
away conditions with their win percentage. The table 1 toss.
shows the dataset and its description [22]. Toss decision Decision of batting
or fielding after
Table 1. Teamwise Home and Away dataset description winning toss.
Column name Column Result Outcome of match
description whether normal or
tie.
Home_wins Tells the number
of matches won by Dl_applied Information on
a team in their whether DL method
home ground. was applied or not.
Tells the number Winner Team that won the
away_wins
of matches won by match.
a team other than Win by runs The number of runs
their home ground. by which the team
Tells the number won.
home_matches
of matches played Win by wickets The number of
by a team in their wickets by which the
home ground. team won.
Away_matches Tells the number Player of match Name of player who
of matches played was awarded player
by a team other of the match.
than their home Venue Stadium in which
ground. match took place.
home_win_percentage Percentage of Umpire 1 On field umpire
matches won when name.
played in home Umpire 2 On field umpire
ground. name.
Away_win_percentage Percentage of
matches won when 3.2.1.3. Player’s dataset
played in ground
other than their The Player’s dataset contains 5 columns namely
home ground. Player_Name, DOB, Batting_Hand, Bowling_Skill and
Country. This dataset tells about the player and his
3.2.1.2. Matches dataset bowling and batting style. The table 3 shows the dataset
column and its description.
The Matches datasets contains 16 columns i.e season,
Table 3. Player’s dataset description
city, date, team1, team2, toss_winer, toss_decision,
result, dl_applied, winer, win_by_runs, win_by_matches Column name Column description
and player_of_the_match, venue, umpire1 and umpire2. Player Name Name of the players
This dataset tells about the matches that are played
between two teams and who was the winner of the match. DOB Date of birth of the players
It also tells about the toss decision taken in the match. Batting Hand Tells whether the players are left-handed
The table 2 shows the dataset column and its description or right-handed.
[23].
Bowling skill Bowling style of players
Table 2. Matches dataset description
Country Name of countries to which the player
Column name Column description belongs.
Season This column tells the
season in which the
match was played.
City The city in which the
match was played.
Date The date on which
401
Atlantis Highlights in Computer Sciences, volume 4
3.2.1.4. Teams Datasets player_dismissed If the player was given out or not
The teams’ datasets contain a single column named as dismissal_kind What kind of dismissal it was
team1 which shows the various IPL teams. The table 4
shows the dataset column and its description. Fielder Player who caused dismissal
402
Atlantis Highlights in Computer Sciences, volume 4
4. EXPERIMENTAL RESULTS
403
Atlantis Highlights in Computer Sciences, volume 4
404
Atlantis Highlights in Computer Sciences, volume 4
405
Atlantis Highlights in Computer Sciences, volume 4
Selection of Optimal sports Team based on the sensing images." Remote Sensing 12, no. 24
Players Performance, Proceedings of the Fifth (2020): 4135.
International Conference on Communication [23] Puttamadappa, C., and B. D. Parameshachari.
and Electronics Systems (ICCES 2020), IEEE "Demand side management of small scale loads
Conference Record, ISBN: 978-1-7281-5371-1, in a smart grid using glow-worm swarm
2020. optimization technique." Microprocessors and
[14] Balasundaram A, Ashokkkumar S, Jayashree D, Microsystems 71 (2019): 102886.
Magesh Kumar S, Data Mining based [24] K. Yu, L. Tan, X. Shang, J. Huang, G. Srivastava
Classification of Players in Game of Cricket, and P. Chatterjee, “Efficient and Privacy-
proceedings of the International Conference on Preserving Medical Research Support Platform
Smart Electronics and Communication Against COVID-19: A Blockchain-Based
(ICOSEC 2020), IEEE Xplore Part Number: Approach”, IEEE Consumer Electronics
CFP20V90-ART; ISBN: 978-1-7281-5461-9. Magazine, doi: 10.1109/MCE.2020.3035520.
[15] Jalaz Kumar, Rajeev Kumar, Pushpender [25] K. Yu, M. Arifuzzaman, Z. Wen, D. Zhang and T.
Kumar, Outcome Prediction of ODI Cricket Sato, “A Key Management Scheme for Secure
Matches Using Decision Trees and MLP Communications of Information Centric
Networks, 2018 First International Conference Advanced Metering Infrastructure in Smart
on Secure Cyber Computing and Grid,” IEEE Transactions on Instrumentation and
Communication (ICSCCC). Measurement, vol. 64, no. 8, pp. 2072-2085,
[16] Prabu, S., Balamurugan Velan, F. V. Jayasudha, August 2015.
P. Visu, and K. Janarthanan. "Mobile https://ieeexplore.ieee.org/document/7138617
technologies for contact tracing and prevention [26] K. Yu, L. Lin, M. Alazab, L. Tan, B. Gu, “Deep
of COVID-19 positive cases: a cross-sectional Learning-Based Traffic Safety Solution for a
study." International Journal of Pervasive Mixture of Autonomous and Manual Vehicles in
Computing and Communications (2020). a 5G-Enabled Intelligent Transportation System”,
[17] Subramani, Prabu, K. Srinivas, R. Sujatha, and B. IEEE Transactions on Intelligent Transportation
D. Parameshachari. "Prediction of muscular Systems, doi: 10.1109/TITS.2020.3042504.
paralysis disease based on hybrid feature [27] K. Yu, L. Tan, M. Aloqaily, H. Yang, and Y.
extraction with machine learning technique for Jararweh, “Blockchain-Enhanced Data Sharing
COVID-19 and post-COVID-19 with Traceable and Direct Revocation in IIoT”,
patients." Personal and Ubiquitous IEEE Transactions on Industrial Informatics, doi:
Computing (2021): 1-14. 10.1109/TII.2021.3049141.
[18] Do, Dinh-Thuan, Tu Anh Le, Tu N. Nguyen, [28] Z.Guo, K. Yu, Y. Li, G. Srivastava, and J. C. -
Xingwang Li, and Khaled M. Rabie. "Joint
W. Lin, “Deep Learning-Embedded Social
impacts of imperfect CSI and imperfect SIC in
Internet of Things for Ambiguity-Aware Social
cognitive radio-assisted NOMA-V2X
Recommendations”, IEEE Transactions on
communications." IEEE Access 8 (2020):
Network Science and Engineering, doi:
128629-128645.
10.1109/TNSE.2021.3049262.
[19] Le, Ngoc Tuyen, Jing-Wein Wang, Duc Huy Le,
Chih-Chiang Wang, and Tu N. Nguyen. [29] Z. Guo, A. K. Bashir, K. Yu, J. C. Lin, Y. Shen,
"Fingerprint enhancement based on tensor of “Graph Embedding-based Intelligent Industrial
wavelet subbands for classification." IEEE Decision for Complex Sewage Treatment
Access 8 (2020): 6602-6615. Processes”, International Journal of Intelligent
[20] Nguyen, Tu N., Bing-Hong Liu, Nam P. Nguyen, Systems,2021, doi: 10.1002/int.22540.
and Jung-Te Chou. "Cyber security of smart grid:
attacks and defenses." In ICC 2020-2020 IEEE
International Conference on Communications
(ICC), pp. 1-6. IEEE, 2020.
[21] Kumar, M. Keerthi, B. D. Parameshachari, S.
Prabu, and Silvia liberata Ullo. "Comparative
Analysis to Identify Efficient Technique for
Interfacing BCI System." In IOP Conference
Series: Materials Science and Engineering, vol.
925, no. 1, p. 012062. IOP Publishing, 2020
[22] Rajendran, Ganesh B., Uma M. Kumarasamy,
Chiara Zarro, Parameshachari B. Divakarachari,
and Silvia L. Ullo. "Land-use and land-cover
classification using a human group-based particle
swarm optimization algorithm with an LSTM
Classifier on hybrid pre-processing remote-
406