0% found this document useful (0 votes)
40 views7 pages

Super Predictor of Indian Premier League (IPL) Using Various ML Techniques With Help of IBM Cloud

This document discusses using machine learning techniques to predict matches in the Indian Premier League (IPL) cricket tournament. It describes analyzing IPL match data from 2008-2019 to build predictive models and visualize performance data of teams and players. Different machine learning algorithms are tested for their ability to accurately predict IPL match outcomes.

Uploaded by

soyff7spxf
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
40 views7 pages

Super Predictor of Indian Premier League (IPL) Using Various ML Techniques With Help of IBM Cloud

This document discusses using machine learning techniques to predict matches in the Indian Premier League (IPL) cricket tournament. It describes analyzing IPL match data from 2008-2019 to build predictive models and visualize performance data of teams and players. Different machine learning algorithms are tested for their ability to accurately predict IPL match outcomes.

Uploaded by

soyff7spxf
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

Vol-8 Issue-2 2022 IJARIIE-ISSN(O)-2395-4396

Super Predictor of Indian Premier League


(IPL) using Various ML techniques with help
of IBM Cloud
[1]
K RUSHIKESH REDDY, [2] CHANDRAKANTH V,
[3]
A. PREM SAI, [4] SHIVA SUMANTH REDDY
[1][2][3]
BE Students, Department of Computer Science and Engineering
[4]
Asst. Professor, Department of Computer Science and Engineering
[1][2][3][4]
Dayananda Sagar Academy of Technology and Management, Bangalore, Karnataka, India

Abstract

Indian Premier League matches are one in every of the foremost important events in India. it's professional
cricket league in India contested by eight teams representing the various cities in India. The paper focuses on
the performance analysis of the eight contesting IPL teams supported the runs of the team, wickets, decisions
usurping winning the toss and Duckworth Lewis rule analysis. The IPL data from 2008 to 2019 is used for the
player analysis. The team performances are visualized graphically using data analytics to render the
interpretation in an exceedingly good manner. The performance data using visual analytics helps in selecting
players for future matches and provides additional information on player yet as team profiles. Almost every IPL
team’s Management use Analytics for better games. Not only team’s owners, there are several betting and
fantasy cricket platform, which are highly rely upon analytics for his or her success. Analytics can help all of
them for his or her success.

The research paper tries to predict the IPL matches using machine learning models with variables like match id,
inning, batting team, bowling team, over, ball, batsman, non-striker, bowler is super over, wide runs, bye runs,
leg by runs, bowling runs, penalty runs, batsman runs, extra runs, total runs, player dismissed, dismissal kind
fielder. to appear out the result it uses different machine learning models like statistical method model, Random
Forest. The results of the study shows that for IPL game, Teams, Venue, Winning Toss, Venue of the Match and
Decision after winning the toss are important influencers to win a match. Different Machine Learning helps to
predict outcome of a match. Right selection of Machine Learning Model helps to extend Accuracy of Prediction.
From Different Classification Models, Support Vector Machine, Decision Tree and Random Forest are best to
predict outcome of an IPL games. All of the subsequent gives almost 88% accuracy Level. The study has been
conducted from data of Kaggle. Secondary data has been used for the analysis.

I. INTRODUCTION
A. Overview of our project:
Creating a legendary milestone in the world of a sport like cricket, the BCCI introduced Indian Premier
League (IPL). As the volume of cricket enthusiasts across the globe increases, the curiosity on prediction of the
future matches led to creation of many websites like Dream11, IPL fantasy league etc. IPL predictions can be
right or wrong. Sports analytics is not new in cricket. Cricket being an attractive and profitable sports, there are
so many stakeholders, involved in these sports. That’s why decision-making process is very critical in cricket.
Analytics is a supporting pillar for every decision maker in this game. The IPL predictor visualizes useful
insights [4] and predicts outputs for instances provided by the user. The product caters to the needs of sports
analysts, broadcasters, sports enthusiasts, business corporates [3], team management and so on [3]. The
objective of this research is to observe impact of different Machine learning models in Prediction of an IPL
match. Another objective of this study is to explore information, pattern related to Matches, Player etc. using
descriptive analysis so as to increase the decision-making effectiveness. The main objective of our project is to
analyse the IPL statistics of various matches and teams. Dashboards [4] are created for individual teams to
provide an insight of their performance from the beginning of IPL. Future scores pertaining to a match are
predicted using our web application. The project, developed as a web application, will satisfy the needs of a
cricket expert as well as a beginner.

16293 ijariie.com 1156


Vol-8 Issue-2 2022 IJARIIE-ISSN(O)-2395-4396

II. SCOPE OF THE PROJECT


IPL predictor provides useful insights obtained by performing exploratory and predictive analysis. The
product serves both commercial and entertainment purpose. The product is user friendly where the insights are
presented in comprehensible manner and the user can navigate freely through the pages, improving the
reliability of the product. Features present in the product will give the user a better knowledge about IPL and
features incorporated are:
A. Analysis about overview of IPL:
Set of eleven questions were framed in such a way that it covers all the aspects of the game. The answers for
the questions are found by performing exploratory analysis [1] on the dataset containing attributes like (toss
winner, batting team, bowling toss, match winner and so on) with the help of SQL queries and python. The
insights obtained are visualized in the form of graph using Cognos feature provided by IBM. .[1][2][3][9]
B. Team wise Analysis:
Team wise analysis provides insights about the strengths and weaknesses of the team. The analysis includes
happenings in different phases of game, highest run scorer and highest wicket taker and so on. The happenings
in different phases of game (powerplay, middle over and death over) can help the team management and
opponents to strategize their game plan, and the users an idea about the proceedings of the game. The analysis is
done in IBM Watson and insights obtained are visualized using IBM Cognos.[2][3][9]

C. Teams’ performance in particular season:


The feature gives an idea to the user about the proceedings of the game in a particular year. Data mining is
performed to fetch results relevant to the particular team in the year prescribed by the future.[2][9]

D. Predicting Model
An important type of analysis in data analytics is predictive analysis. It provides insights about what is going
to happen in the near future. This is predicted by a model trained using machine learning algorithms. Various
ML algorithms like Random Forest were applied and linear regression...etc. [1][5][7][8]

E. Chatbot:
Chatbot is created to answer minimal queries asked by the users. The chatbot is created using Watson
assistant by adding relevant dialog and entities that can handle irrelevant questions and answer relevant
questions. The chatbot can solve queries related to the structure of IPL, format of the game and so on. The above
are features of the product that add value to the product and the user-friendly interface improves the reliability
of the product [6].

III. DATASET DESCRIPTION


The section describes the IPL dataset providing information on team performance from 2008 to 2019. The
section provides details on the attributes, instances, missing values in the dataset.
Number of attributes: 18
Number of instances: 756
ID –The attributes contain the information about the unique id for a match.
SEASON –The attribute contains the information about the year when the match has been conducted.
CITY - The attribute holds the information about the city where the match took place.
DATE – The attribute holds the information about the date when the match has been held.
TEAM 1 – The attribute describes that which team is going to bat first.
TEAM 2 – The attribute describes that which team is going to bat second.
TOSS_WINNER – The attribute holds the information about who wins the toss in that match.
TOSS_DECISION – The attribute contains the information about the decision (bat/field) taken by the toss
winner. RESULT – The attribute contains information about the result (normal/tie) of the players.
DL_APPLIED – The attribute describes whether the. Duckworth Lewis (DL) rule is applied.
WINNER – The attribute holds the information about the winner of the match.
WIN_BY_RUNS – The attribute describes that which team had win by runs.
WIN_BY_WICKETS – The attribute describes that which team had win by wickets.
PLAYER_OF_MATCH – The attribute contains information about the man of the match.
VENUE – The attribute contains information about in which place the match has been played.
UMPIRE 1 – The attribute contains information about the names of the umpire 1.
UMPIRE 2 – The attribute contains information about the names of the umpire 2.
UMPIRE 3 – N/A.

16293 ijariie.com 1157


Vol-8 Issue-2 2022 IJARIIE-ISSN(O)-2395-4396

IV. LITERATURE SURVEY

S.no Name of the Author(s) Year Variable Used Result


Paper
Predicting Rabindra 2018 home team, away team, toss winner, The Multilayer
Outcome of Lamsal, venue, umpires, home team score, perception classifier
Indian Ayesha away team score, power play score, outperformed other
1 Premier Choudhary playing 11 players, Number of classifiers with correctly
League wickets taken, Number of dot balls predicting 43 out of 60,
(IPL) given, Number of fours, Number of 2018 Indian Premier
Matches sixes, Number of catches, Number of League matches. The
Using stampings Twenty20 format of
Machine cricket carries a lot of
Learning randomness, because a
single over can
completely change the
ongoing pace of the
game. Indian Premier
League is still at
infantry stage, it is just a
decade old league and
has way a smaller
number of matches
compared to test and
one-day international
formats.
2 Increased Kalpdrum 2018 No. of Innings, Batting Average, Random Forest builds
prediction Passi, Nirav Strike Rate, Highest Score, Overs, the most accurate
accuracy in Kumar Bowling Average, Bowling Strike prediction models for
the game of Pandey Rate, Four/Five Wicket Haul, Venue, both batting and
cricket Centuries,Fifties,Batting,Match bowling in all the cases.
using Time,Hand,Match Type,Batting Also, the accuracy of
machine Position, Bowling Hand the models increases as
learning increase the size of the
training dataset for all
algorithms except in
case of Naïve Bayes for
batting where the
accuracy decreases as
we increase the size of
the training set.
Selection of the right
players for each match
plays a significant role
in a team’s victory. An
accurate prediction of
how many runs a
batsman is likely to
score and how many
wickets a bowler is
likely to take in a match
will help the team
management select best
players for each match.
3 Cricket Mr. Suyash 2019 City, Venue, Toss Result, Home From the previous data,
Analytics Mahajan, Ms. Team, Away Team it is beneficial to the
and Gunjan owner to get the details
Predictor Kandhari, of the IPL match played
16293 Ms. Salma ijariie.com and the users who 1158
Shaikh, Ms. predict the winning
Rutuja Pawar, percentage of the team
Vol-8 Issue-2 2022 IJARIIE-ISSN(O)-2395-4396

Mr. Jash and get the statistics of


Vora, Ms. A. the player.
R. Deshpande
4 Prediction Pramila M. 2018 Pitch, Toss, Team strength, Home a predictive model, a
of Live Chawan Ground Advantage user makes a prediction
Cricket on every game, and ends
Score and up watching that game
Winning to check if his prediction
is going right Thus the
project will not only
improve the existing
system of Fantasy
Cricket, but will also
augment the reach of
Cricket in India

In this there are many past research paper which has been discussed. Predicting the Indian Premier League
using Rabindra Lamsal machine reading, Ayesha Choudhary 2018 where there are variables used as home team,
missing team, tos winner, location, referee, home team points, outside team points, power game. points, playing
11 players, Number of wickets taken, Number of balls with dots awarded, Number Four, Number Six, Hold
number, stamp number. The Multilayer Vision segment has done much better than other class designers by
accurately predicting 43 of the 60, 2018, Indian Premier League games. The Twenty 20 cricket format has a lot
of unexpected features because one over can change the continuous pace of the game. Increased prediction
accuracy in the game of cricket using the Kalpdrum Passi learning machine, Nirav Kumar Pandey in 2018
where the variables were the number of Innings, Batting Average, Strike Rate, Highest Score, Overs, Bowling
Average, Bowling Strike Rate, Four / Five Wicket Haul, Location, Hundreds, Fifty, Hitting, Game Time, Hand,
Game Type, Hitting, Hand Bowling where the analysis was a random jungle that created the most accurate
guessing models in both hitting and throwing in all directions. Also, the accuracy of the models increases as it
increases the size of the training data for all increases the size of the training data set in all algorithms except in
the case of the Naïve Bayes bit where accuracy decreases as we increase the size of the training. set. The
selection of the right players for the entire game played a big part in the team's victory. Accurate predictions
about how much the batsman can gain and how many sticks a player may need in a game will help the team
select players for a particular game. Live Cricket Score Prediction with Pramila M. Chawan Winner of 2018
where the variables used were Pitch, Toss, team strength, Home Ground Advantage where the result was a
predictable model, the user predicts every game, and ends up watching the game checking that prediction it goes
well. The project will therefore not only improve the popular cricket system but will also increase access to
cricket in India. Cricket Analytics and Mr. Predictor Suyash Mahajan, Mr. Gunjan Kandhari, Mr. Salma Shaikh,
Mr. Rutuja Pawar, Mr. Jash Vora, Ms. AR Deshpande in 2019 where other uses were City, Venue, Toss Result,
Home Team, Away Team where previous data, it is beneficial for the owner to get details of the IPL game
played by users who predict the winning percentage of the team and get player stats.

V. RESEARCH METHODOLOGY
A. Research Objective:

The objective of this research is to observe impact of different Machine learning models in Prediction of an
IPL match. Another objective of this study is to explore information, pattern related to Matches, Player etc.
using descriptive analysis so as to increase the decision-making effectiveness.

B. Purpose of Study:

Betting is an illegal activity for some region like India, but most of the country, is profitable business
because Betting is not the only skill gambling, it is also a game of mathematical skills. No bets only, there are
many arenas of the dream game like Dream11, my cricket circle where millions of users invest to get a good
value profit where mathematical skills and data are present it is important.
This study primarily aims to find out different statistical measures from IPL historical data and predict outcome
of a match based on important factors to help users of betting sites and fantasy cricket league with scientific
proof to support in their decision-making process.

16293 ijariie.com 1159


Vol-8 Issue-2 2022 IJARIIE-ISSN(O)-2395-4396

VI. METHODOLOGY

FLOW CHART

A. Sample Design:
Secondary data is what is collected by someone. Some Common sources of secondary data include
government public services department’s Repository, libraries, internet searches and censuses. For this project, I
have used Secondary data source to collect data.
B. Data Source:
For this project work, data has been taken from Kaggle.com. Kaggle is subsidiary of Google LLC. It is an
online community of data scientists and machine learning aspirant. It is also a repository of open-source data.
Analytical Methodology:

This Project work focus on following two Analytical Methods-

I)Descriptive Analytics
II)Predictive Analytics

16293 ijariie.com 1160


Vol-8 Issue-2 2022 IJARIIE-ISSN(O)-2395-4396

A. Descriptive Analytics: Descriptive Analytics is a Method use in primary stages of any Analytics project to
create a summary of historical data to mine useful knowledge, based on which further analysis can be done. In
simple language, Descriptive analytics answered question like “what happened?”. In our Project work,
Descriptive model focus on two aspects:
I. Describe the data statistically.
II. Describe important factors.

C. Predictive Analytics: Predictive Analytics is a Method use in Advance stages of Analytics Projects to
Predict Unknown future events based on different factors. Predictive Analytics use different Algorithms to
build predictive models. Some of popular Algorithms, used in Predictive Modelling, are –Linear Regression,
Random Forest, etc. For our project work, we will use predictive analytics to predict result of any matches.

VII. DATA FLOW DIAGRAM (DFD):

VIII. TOOLS USED


A. IBM Cognos Tool:
Cognos is a web-based Business Intelligence platform created by IBM. Provides a set of tools for analysing,
reporting, and monitoring various metrics. IBM Cognos contains various components that are used to meet the
needs of different information in any company. IBM Cognos has components such as IBM Cognos Framework
Manager, IBM Cognos Cube Designer, IBM Cognos Transformer to help analyse data easily. Cognos will use
this project as a descriptive mathematical tool.

B. Python:
Python is an interpreted, high-level, general-purpose language. It’s language uses an object-oriented
approach that helps programmers write clear and logical code for any type of project. Thanks to its extensive
library, Large Community, memory management, python is very popular among the Machine Learning
community. In this Project Report, Python will be used for both descriptive and predictive analysis.
C. IBM Watson Studio (Watson Assistant):
IBM Watson Studio is software that simplifies developing, training, managing models, and deploying AI
applications and used purpose, business and communication. It develops with many new features to build
Artificial Intelligence applications. IBM Watson Assistant is a cloud service that allows for the development of
a visual assistant in the software they are developing and branding the assistant as its own. Watson's assistant
will help develop a visual assistant for this project.

16293 ijariie.com 1161


Vol-8 Issue-2 2022 IJARIIE-ISSN(O)-2395-4396

IX. RESULT AND ANALYSIS


1. From the analysis most IPL matches tool place in Eden Garden and Wankhade stadium and it is also
important factor for winning the match.
2. Mumbai Indians are the most successful groups who has won 109 games out of 11 times (2008-2020).
Chennai Super kings is the second most successful group in the 100 wins, followed by the Kolkata
Night Riders with 92 wins. Mumbai Indians won the most cups.
3. Most of the time the man of the match of IPL game goes to Chris Gayle who owns the top number
followed by AB de Villers and David Warner and Indians MS Dhoni is top on the list.
4. In this it is clear that toss win or loss has least effect to win a match. After winning the toss, team wins
the match is higher than when team win the toss but did not manage to win the match but however the
decision to balling or batting first play a major role.
5. In a different models of machine learning, we find that Linear Regression, Random Forest has a very
accurate model at a rate of 88% accuracy.

X. BUSINESS / SOCIAL IMPACT


As I am forecasting the predictive analysis of IPL, it will help the Sponsors to know whom they can sponsor
and which players they can buy in auction.

XI. CONCLUSION
Analytics can be used for Cricket match Prediction and its analysis in very easy way. For IPL game,
Teams, Venue, Winning Toss, Venue of the Match and Decision after winning the toss are important influencers
to win a match. Different Machine Learning helps to predict outcome of a match. Right selection of Machine
Learning Model helps to increase Accuracy of Prediction. From Different machine learning Models, Linear
Regression and Random Forest are best to predict outcome of an IPL games. Both of the following gives almost
88% accuracy Level. With this we can predict the IPL match through machine learning models.

XII. SCOPE OF FUTURE STUDY


I have taken only few factors as a predictor. but in cricket there are many factors which could have impact on a
Match result. Player is one of those influencers. Every year IPL teams use to change their players; lots of new
cricketer gets chances to play for the teams. So, in future studies Player can be used as a Predictor. Another
important factor is pitch, in future pitch can be use as predictor. In this study I used only simple popular
machine learning models, more complex models can be used to increase accuracy in future. Thus, using
modelling effective decision-making can be accomplished through using tools and techniques in sports
analytics.

REFERENCES
Extensive study about the topic was performed and various methodologies used in this domain were found.
Predominantly there were two types of analysis: exploratory and predictive. Exploratory analysis visualizes
events that have occurred in the past and provides meaningful insights that can be used for decision making
[1] Application of Machine Learning for Predictive Analytics: Indian Premier League (IPL) T-20 Cricket
Matches 02 Subhashish Mahata, Neetu Kamra, Naina Kumari Agarwal
[2] INSIGHTS ON IPL TEAM PERFORMANCE USING VISUAL ANALYTICS S. Sharuka, R. Vani
[IJESRT][Sharuka, et al., 8(11): November, 2019]
[3] T. H. Davenport, “What businesses can learn from sports analytics,” MIT Sloan Management Review, vol.
55, no. 4, p. 10, 2014.
[4] Get started with Dashboards and Stories
[5] Score Prediction and Player Classification Model in the Game of Cricket Using Machine Learning Sonu
Kumar, Sneha Roy
[6] Watson Assistant: Intelligent virtual agent
[7] Predicting Outcome of Indian Premier League (IPL) Matches Using Machine Learning Rabindra Lamsal
and Ayesha Choudhary [arXiv:1809.09813 [stat. AP]]
[8] PREDICTION ON IPL DATA USING MACHINE LEARNING TECHNIQUES IN R PACKAGE G.
Sudhamathy and G. Raja Meenakshi (OCTOBER 2020, VOLUME: 11, ISSUE: 01)
[9] Analysis and Predicting Results of IPL T20 Matches Praveen Banasode, Minal Patil, Supriya-Verma
[10] C.Deep Prakash, C.Patvardhan and Sushobhit Singh,” A new Category based Deep Performance Index
using Machine Learning for ranking IPL Cricketers”, Int. Jl. of Electronics, Electrical and Computational
System IJEECS ISSN 2348-117X Volume 5, Issue 2 February 2016

16293 ijariie.com 1162

You might also like