Super Predictor of Indian Premier League (IPL) Using Various ML Techniques With Help of IBM Cloud
Super Predictor of Indian Premier League (IPL) Using Various ML Techniques With Help of IBM Cloud
Abstract
Indian Premier League matches are one in every of the foremost important events in India. it's professional
cricket league in India contested by eight teams representing the various cities in India. The paper focuses on
the performance analysis of the eight contesting IPL teams supported the runs of the team, wickets, decisions
usurping winning the toss and Duckworth Lewis rule analysis. The IPL data from 2008 to 2019 is used for the
player analysis. The team performances are visualized graphically using data analytics to render the
interpretation in an exceedingly good manner. The performance data using visual analytics helps in selecting
players for future matches and provides additional information on player yet as team profiles. Almost every IPL
team’s Management use Analytics for better games. Not only team’s owners, there are several betting and
fantasy cricket platform, which are highly rely upon analytics for his or her success. Analytics can help all of
them for his or her success.
The research paper tries to predict the IPL matches using machine learning models with variables like match id,
inning, batting team, bowling team, over, ball, batsman, non-striker, bowler is super over, wide runs, bye runs,
leg by runs, bowling runs, penalty runs, batsman runs, extra runs, total runs, player dismissed, dismissal kind
fielder. to appear out the result it uses different machine learning models like statistical method model, Random
Forest. The results of the study shows that for IPL game, Teams, Venue, Winning Toss, Venue of the Match and
Decision after winning the toss are important influencers to win a match. Different Machine Learning helps to
predict outcome of a match. Right selection of Machine Learning Model helps to extend Accuracy of Prediction.
From Different Classification Models, Support Vector Machine, Decision Tree and Random Forest are best to
predict outcome of an IPL games. All of the subsequent gives almost 88% accuracy Level. The study has been
conducted from data of Kaggle. Secondary data has been used for the analysis.
I. INTRODUCTION
A. Overview of our project:
Creating a legendary milestone in the world of a sport like cricket, the BCCI introduced Indian Premier
League (IPL). As the volume of cricket enthusiasts across the globe increases, the curiosity on prediction of the
future matches led to creation of many websites like Dream11, IPL fantasy league etc. IPL predictions can be
right or wrong. Sports analytics is not new in cricket. Cricket being an attractive and profitable sports, there are
so many stakeholders, involved in these sports. That’s why decision-making process is very critical in cricket.
Analytics is a supporting pillar for every decision maker in this game. The IPL predictor visualizes useful
insights [4] and predicts outputs for instances provided by the user. The product caters to the needs of sports
analysts, broadcasters, sports enthusiasts, business corporates [3], team management and so on [3]. The
objective of this research is to observe impact of different Machine learning models in Prediction of an IPL
match. Another objective of this study is to explore information, pattern related to Matches, Player etc. using
descriptive analysis so as to increase the decision-making effectiveness. The main objective of our project is to
analyse the IPL statistics of various matches and teams. Dashboards [4] are created for individual teams to
provide an insight of their performance from the beginning of IPL. Future scores pertaining to a match are
predicted using our web application. The project, developed as a web application, will satisfy the needs of a
cricket expert as well as a beginner.
D. Predicting Model
An important type of analysis in data analytics is predictive analysis. It provides insights about what is going
to happen in the near future. This is predicted by a model trained using machine learning algorithms. Various
ML algorithms like Random Forest were applied and linear regression...etc. [1][5][7][8]
E. Chatbot:
Chatbot is created to answer minimal queries asked by the users. The chatbot is created using Watson
assistant by adding relevant dialog and entities that can handle irrelevant questions and answer relevant
questions. The chatbot can solve queries related to the structure of IPL, format of the game and so on. The above
are features of the product that add value to the product and the user-friendly interface improves the reliability
of the product [6].
In this there are many past research paper which has been discussed. Predicting the Indian Premier League
using Rabindra Lamsal machine reading, Ayesha Choudhary 2018 where there are variables used as home team,
missing team, tos winner, location, referee, home team points, outside team points, power game. points, playing
11 players, Number of wickets taken, Number of balls with dots awarded, Number Four, Number Six, Hold
number, stamp number. The Multilayer Vision segment has done much better than other class designers by
accurately predicting 43 of the 60, 2018, Indian Premier League games. The Twenty 20 cricket format has a lot
of unexpected features because one over can change the continuous pace of the game. Increased prediction
accuracy in the game of cricket using the Kalpdrum Passi learning machine, Nirav Kumar Pandey in 2018
where the variables were the number of Innings, Batting Average, Strike Rate, Highest Score, Overs, Bowling
Average, Bowling Strike Rate, Four / Five Wicket Haul, Location, Hundreds, Fifty, Hitting, Game Time, Hand,
Game Type, Hitting, Hand Bowling where the analysis was a random jungle that created the most accurate
guessing models in both hitting and throwing in all directions. Also, the accuracy of the models increases as it
increases the size of the training data for all increases the size of the training data set in all algorithms except in
the case of the Naïve Bayes bit where accuracy decreases as we increase the size of the training. set. The
selection of the right players for the entire game played a big part in the team's victory. Accurate predictions
about how much the batsman can gain and how many sticks a player may need in a game will help the team
select players for a particular game. Live Cricket Score Prediction with Pramila M. Chawan Winner of 2018
where the variables used were Pitch, Toss, team strength, Home Ground Advantage where the result was a
predictable model, the user predicts every game, and ends up watching the game checking that prediction it goes
well. The project will therefore not only improve the popular cricket system but will also increase access to
cricket in India. Cricket Analytics and Mr. Predictor Suyash Mahajan, Mr. Gunjan Kandhari, Mr. Salma Shaikh,
Mr. Rutuja Pawar, Mr. Jash Vora, Ms. AR Deshpande in 2019 where other uses were City, Venue, Toss Result,
Home Team, Away Team where previous data, it is beneficial for the owner to get details of the IPL game
played by users who predict the winning percentage of the team and get player stats.
V. RESEARCH METHODOLOGY
A. Research Objective:
The objective of this research is to observe impact of different Machine learning models in Prediction of an
IPL match. Another objective of this study is to explore information, pattern related to Matches, Player etc.
using descriptive analysis so as to increase the decision-making effectiveness.
B. Purpose of Study:
Betting is an illegal activity for some region like India, but most of the country, is profitable business
because Betting is not the only skill gambling, it is also a game of mathematical skills. No bets only, there are
many arenas of the dream game like Dream11, my cricket circle where millions of users invest to get a good
value profit where mathematical skills and data are present it is important.
This study primarily aims to find out different statistical measures from IPL historical data and predict outcome
of a match based on important factors to help users of betting sites and fantasy cricket league with scientific
proof to support in their decision-making process.
VI. METHODOLOGY
FLOW CHART
A. Sample Design:
Secondary data is what is collected by someone. Some Common sources of secondary data include
government public services department’s Repository, libraries, internet searches and censuses. For this project, I
have used Secondary data source to collect data.
B. Data Source:
For this project work, data has been taken from Kaggle.com. Kaggle is subsidiary of Google LLC. It is an
online community of data scientists and machine learning aspirant. It is also a repository of open-source data.
Analytical Methodology:
I)Descriptive Analytics
II)Predictive Analytics
A. Descriptive Analytics: Descriptive Analytics is a Method use in primary stages of any Analytics project to
create a summary of historical data to mine useful knowledge, based on which further analysis can be done. In
simple language, Descriptive analytics answered question like “what happened?”. In our Project work,
Descriptive model focus on two aspects:
I. Describe the data statistically.
II. Describe important factors.
C. Predictive Analytics: Predictive Analytics is a Method use in Advance stages of Analytics Projects to
Predict Unknown future events based on different factors. Predictive Analytics use different Algorithms to
build predictive models. Some of popular Algorithms, used in Predictive Modelling, are –Linear Regression,
Random Forest, etc. For our project work, we will use predictive analytics to predict result of any matches.
B. Python:
Python is an interpreted, high-level, general-purpose language. It’s language uses an object-oriented
approach that helps programmers write clear and logical code for any type of project. Thanks to its extensive
library, Large Community, memory management, python is very popular among the Machine Learning
community. In this Project Report, Python will be used for both descriptive and predictive analysis.
C. IBM Watson Studio (Watson Assistant):
IBM Watson Studio is software that simplifies developing, training, managing models, and deploying AI
applications and used purpose, business and communication. It develops with many new features to build
Artificial Intelligence applications. IBM Watson Assistant is a cloud service that allows for the development of
a visual assistant in the software they are developing and branding the assistant as its own. Watson's assistant
will help develop a visual assistant for this project.
XI. CONCLUSION
Analytics can be used for Cricket match Prediction and its analysis in very easy way. For IPL game,
Teams, Venue, Winning Toss, Venue of the Match and Decision after winning the toss are important influencers
to win a match. Different Machine Learning helps to predict outcome of a match. Right selection of Machine
Learning Model helps to increase Accuracy of Prediction. From Different machine learning Models, Linear
Regression and Random Forest are best to predict outcome of an IPL games. Both of the following gives almost
88% accuracy Level. With this we can predict the IPL match through machine learning models.
REFERENCES
Extensive study about the topic was performed and various methodologies used in this domain were found.
Predominantly there were two types of analysis: exploratory and predictive. Exploratory analysis visualizes
events that have occurred in the past and provides meaningful insights that can be used for decision making
[1] Application of Machine Learning for Predictive Analytics: Indian Premier League (IPL) T-20 Cricket
Matches 02 Subhashish Mahata, Neetu Kamra, Naina Kumari Agarwal
[2] INSIGHTS ON IPL TEAM PERFORMANCE USING VISUAL ANALYTICS S. Sharuka, R. Vani
[IJESRT][Sharuka, et al., 8(11): November, 2019]
[3] T. H. Davenport, “What businesses can learn from sports analytics,” MIT Sloan Management Review, vol.
55, no. 4, p. 10, 2014.
[4] Get started with Dashboards and Stories
[5] Score Prediction and Player Classification Model in the Game of Cricket Using Machine Learning Sonu
Kumar, Sneha Roy
[6] Watson Assistant: Intelligent virtual agent
[7] Predicting Outcome of Indian Premier League (IPL) Matches Using Machine Learning Rabindra Lamsal
and Ayesha Choudhary [arXiv:1809.09813 [stat. AP]]
[8] PREDICTION ON IPL DATA USING MACHINE LEARNING TECHNIQUES IN R PACKAGE G.
Sudhamathy and G. Raja Meenakshi (OCTOBER 2020, VOLUME: 11, ISSUE: 01)
[9] Analysis and Predicting Results of IPL T20 Matches Praveen Banasode, Minal Patil, Supriya-Verma
[10] C.Deep Prakash, C.Patvardhan and Sushobhit Singh,” A new Category based Deep Performance Index
using Machine Learning for ranking IPL Cricketers”, Int. Jl. of Electronics, Electrical and Computational
System IJEECS ISSN 2348-117X Volume 5, Issue 2 February 2016