0% found this document useful (0 votes)

25 views22 pages

YEAR: 2024 - 2025: Ipl Data Analysis Using Mysql and Python Connectivy

class 12 cs investigatory project , ipl data analysis using my sql

Uploaded by

Raghav.S /6218

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views22 pages

YEAR: 2024 - 2025: Ipl Data Analysis Using Mysql and Python Connectivy

class 12 cs investigatory project , ipl data analysis using my sql

Uploaded by

Raghav.S /6218

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

YEAR: 2024 - 2025

IPL DATA ANALYSIS

USING MySQL AND PYTHON CONNECTIVY

PROJECT BY:
S.Raghav
INDEX

S.No Contents Page No

1 Objective 1

2 Introduction 2

3 Hardwares and Softwars used 3

4 Python Code 4

5 Table Structure 11

6 Output 13

7 Conclusion 19
8 Bibliography 20
OBJECTIVE

The main objective of this Python project on IPL Data Analysis is to explore
and analyze IPL match data, stored in a MySQL database, to gain insights into
team performance, player statistics, and match outcomes. This project
provides an interactive platform to visualize key statistics and trends,
enabling data-driven decision-making and a deeper understanding of
historical patterns in the IPL.

This project serves as an analytical tool for cricket analysts, enthusiasts, and
coaching staff, offering a Python interface to interact with IPL datasets stored
in MySQL. The purpose is to develop a system that streamlines data handling,
from retrieving and processing data in MySQL to generating reports and
visualizations. With automated data extraction and dynamic insights into
player and team performance
INTRODUCTION

The IPL Data Analysis System is a Python-based application developed to

manage and analyze data from the Indian Premier League, providing in-depth
insights into match statistics, player performance, and team strategies. This
system leverages MySQL to securely store historical match data, player
details, and season summaries, creating a reliable data source for robust
analysis. Through an intuitive Python interface, users can retrieve and
visualize data, exploring trends across seasons, evaluating player consistency,
and comparing team performances.
Analysts and cricket enthusiasts can access a variety of features, including
match outcome statistics, player performance metrics, and season-over-
season comparisons. This system also supports custom reports and
visualizations, helping users gain a comprehensive understanding of IPL
dynamics and uncovering valuable insights for strategy enhancement and
decision-making.
Python acts as the front end of this system, processing data and presenting it
through an interactive interface, while MySQL serves as the back end,
organizing and storing IPL data securely in tables. The application is designed
to be efficient, user-friendly, and suitable for users with minimal technical
knowledge, providing a powerful tool for IPL data analysis and forecasting.
HARDWARE USED

Intel i5 Core Processer

16 GB RAM
1TB Hard Disk

SOFTWARE USED

Windows 10 - 64 bit operating system.

Python 3.12.7
MySQL v8.0.0
Python Code
import math
import numpy as np
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt

#PRINTING THE DEATILS OF FIRST TWO DELIVERIES IN IPL HISTORY

ipl_data=pd.read_csv("D:\\cs project\\deliveries.csv")
ipl_matches=pd.read_csv("D:\\cs project\\matches.csv")

ipl_data.head(2)

print('PRINTING DATA TYPES OF ALL FIELDS IN THE CSV FILE')

ipl_data.info()
#MODIFYING THE FILE TO BE IMPORTED AS MYSQL TABLE
ipl_data[ipl_data.duplicated(keep=False)]
ipl_data.drop_duplicates(inplace=True) #inplace=True,it removes duplicates permanantely
ipl_data.duplicated().sum()

ipl_data.isnull().sum()

import pandas as pd
import mysql.connector
#creating a connection between notebook and database.
mydb=mysql.connector.connect(host="localhost",
database="ipl",
user="root",
password="root")

cursor=mydb.cursor() # making a cursor to excute queries.

cursor.execute('''Select * from ipl_ball_by_ball''') #SQL query execution.

mydb.close()
print('To Find the top five venue where most of IPL Matches played.')
top_played_venue=ipl_matches.groupby(['venue','id']).count().droplevel(level=1).index.value_co
unts().head()
top_played_venue=top_played_venue.reset_index() #reset_index() will convert series to
DataFrame
top_played_venue.rename(columns={'count':'Total_match'},inplace=True) #Renaming the
column to appropriate field
top_played_venue

print('Which team has played highest number of matches till 2020.')

import pandas as pd
import mysql.connector
mydb=mysql.connector.connect(host="localhost",
database="ipl",
user="root",
password="root")

mycursor=mydb.cursor()
sql_statement='''
with cte_matches as (
select team1 as team from matches
UNION ALL
select team2 as team from matches)
select team,count(1) total_played from cte_matches group by team order by 2 desc'''

mycursor.execute(sql_statement)
total_match_df = pd.DataFrame(mycursor.fetchall(),columns=['Team Name','Total Played
Matches'])
total_match_df.head(3)

print('plot of the horizontal bar plot of matches played by individual teams')

import matplotlib.pyplot as plt
import seaborn as sns
plt.figure(figsize=(6,5))
sns.barplot(data=total_match_df,x='Total Played Matches',y='Team Name',orient='h',width=0.6)
plt.show()
print('how many times team has won and loss the matches')
import pandas as pd
import mysql.connector
mydb=mysql.connector.connect(host="localhost",
database="ipl",
user="root",
password="root")

mycursor=mydb.cursor()
sql_statement='''with CTE_matches as(
select team1 as team,winner from matches
union all
select team2 as team,winner from matches
)
select team, count(case when team = winner then 1 end) as total_won,
count(case when team <> winner then 1 end) as total_loss
from CTE_matches
group by team order by 2 desc'''
mycursor.execute(sql_statement)
won_loss_df = pd.DataFrame(mycursor.fetchall(),columns=['Team','Won','Lose'])
won_loss_df.head(19)

print('WINNING RATIO OF TEAMS')

won_loss_df['Winning Ratio'] = round((won_loss_df['Won']/(won_loss_df['Lose'] +
won_loss_df['Won']))*100,2)
won_loss_df.head(19)

print('Most IPL centuries by a player')

import pandas as pd
import mysql.connector
mydb=mysql.connector.connect(host="localhost",
database="ipl",
user="root",
password="root")
mycursor=mydb.cursor()
statement='''WITH CTE_Run_scored as(
select id Match_id,batter,
sum(batsman_run) as 'Run_scored'
from ipl_ball_by_ball
group by id,batter
)
select * from (
Select batter,
count(Case when run_scored>=100 THEN 1 end) as total_centurie
from CTE_Run_scored
group by batter order by 2 desc) temp
where temp.total_centurie>0'''
mycursor.execute(statement)
most_centuries_df = pd.DataFrame(mycursor.fetchall(),columns=['Player','Total Centuries'])
most_centuries_df.head()

print('TOP 5 RUN SCORERS FROM EACH TEAM')

import pandas as pd
import mysql.connector
mydb=mysql.connector.connect(host="localhost",
database="ipl",
user="root",
password="root")
cursor=mydb.cursor()
SQL_statement='''
WITH IPL_Ranking as(
select Battingteam,batter,sum(batsman_run) as batsman_runs,
dense_rank() OVER(partition by battingteam order by sum(batsman_run) desc) as
'Player_Rank'
from ipl_ball_by_ball group by 1,2
)
select * from IPL_Ranking Where Player_rank <= %s
'''

cursor.execute(SQL_statement,(5,))
df=pd.DataFrame(cursor.fetchall(),columns=['Team_name','Batsman','Total_Run','Ranking'])
mydb.close()
#Output will contain top 5 batsman from each team but we will only see first 10
df.head(10)

print('plot a bar chart over player runs')

import matplotlib.pyplot as plt
plt.figure(figsize=(6,4))
plt.bar(df['Batsman'].head(10).to_list(),df['Total_Run'].head(10).to_list(),width=0.3)
plt.xticks(rotation=90)
plt.show()
print('find the total run scored by Virat Kohli till his 25th,5oth,100th and 200th match')
import pandas as pd
import mysql.connector
mydb=mysql.connector.connect(host="localhost",
database="ipl",
user="root",
password="root")
cursor=mydb.cursor()
SQL_statement='''
WITH CTE_Run_scored as(
select concat('Match-',row_number() Over(order by id)) as Match_No,
sum(batsman_run) as 'Run_scored'
from ipl_ball_by_ball
where batter='V Kohli'
group by id)
Select * from (select Match_No,Run_scored,
sum(run_scored) over(rows between unbounded preceding and current row) as
'cumulative_run'
from CTE_Run_scored) temp
where temp.Match_No IN ("Match-25","Match-50","Match-75",
"Match-100","Match-125","Match-150")
'''
cursor.execute(SQL_statement)
cummulative_run_df=pd.DataFrame(cursor.fetchall(),columns=
["Match_No","Run_scored","cumulative_run"])
mydb.close()
cummulative_run_df

print('Adding one extra column in above dataframe to make easy x-axis.')

cummulative_run_df['Match']=cummulative_run_df['Match_No'].apply(lambda x:x[6:])
cummulative_run_df

print('plotting the graph of above data')

import matplotlib.pyplot as plt

import seaborn as sns
fig=plt.figure(figsize=(6,4))
axes=fig.add_axes([0.1,0.1,0.8,0.8])
cummulative_run_df['Match']=cummulative_run_df['Match_No'].apply(lambda x:x[6:])
axes.plot(cummulative_run_df['Match'].to_list(),cummulative_run_df['cumulative_run'].to_list(),
color='red',linestyle='--',marker='o',markerfacecolor='k')
plt.show()
print('Average of Suresh Raina till his 50th,100th,150th match')
import pandas as pd
import mysql.connector
mydb=mysql.connector.connect(host="localhost",
database="ipl",
user="root",
password="root")
cursor=mydb.cursor()

sql_statement='''
with CTE_Total_run as(
select batter,concat('Match-',row_number() over(order by id)) as Match_No,
sum(batsman_run) as Run_scored
from ipl_ball_by_ball
where batter='SK Raina'
group by id
)
Select * from(
select *,Round(avg(Run_scored) OVER(rows between unbounded preceding and current
row),2) as avg_each_match
from CTE_Total_run) temp
where temp.Match_No="Match-50"
OR temp.Match_No="Match-100"
OR temp.Match_No="Match-150"
'''
cursor.execute(sql_statement)
running_avg_raina=pd.DataFrame(cursor.fetchall(),columns=
["Batsman","Match_No","Run_scored","avg_each_match"])
mydb.close()
running_avg_raina
print('Most Dot Ball by a Bowler')
import pandas as pd
import mysql.connector
mydb=mysql.connector.connect(host='localhost',
database='ipl',
user='root',
password='root')
mycursor=mydb.cursor()
sql_statement='''
select bowler, sum(dot_ball) as total_dot_ball from(
select id,bowler,count(case when total_run=0 then 1 end) as dot_ball
from ipl_ball_by_ball group by id,bowler
)temp
group by bowler order by 2 desc
'''
mycursor.execute(sql_statement)
dot_ball_df = pd.DataFrame(mycursor.fetchall(),columns=['Bowler','Total_dot_ball'])
dot_ball_df.head()
Table Structure:
Python Output:
CONCLUSION

The IPL Data Analysis project using Python and MySQL helps analyze the
performance of players and teams in the Indian Premier League. By storing
match and player data in MySQL and using Python for data processing and
visualization, we can easily explore insights like top run-scorers, winning
ratios, and popular match venues. The project allows for a better
understanding of IPL trends and performances, making it a useful tool for
analyzing team and player statistics. This combination of database
management and data analysis provides a simple yet powerful way to
uncover key information from IPL data.
BIIBLIOGRAPHY:
https://medium.com/@keep9647smile/ipl-data-analysis-
11250e6ee603
https://www.kaggle.com/datasets/patrickb1912/ipl-
complete-dataset-20082020 (For Data)
https://chatgpt.com/
https://www.linkedin.com/pulse/python-practice-project-
ipl-2022-cricket-sports-data-analysis-mishra

IP Practical File 2024-25
100% (7)
IP Practical File 2024-25
22 pages
IP Project
100% (11)
IP Project
28 pages
Ipl Data Anlysis
No ratings yet
Ipl Data Anlysis
20 pages
IP PROJECT On Ipl Sahil Uppal
No ratings yet
IP PROJECT On Ipl Sahil Uppal
27 pages
IPL Data Analysis
100% (1)
IPL Data Analysis
26 pages
Share INFORMATICS PRACTICES KABIR
No ratings yet
Share INFORMATICS PRACTICES KABIR
37 pages
Ipl Data Analysis
No ratings yet
Ipl Data Analysis
19 pages
SREE
No ratings yet
SREE
24 pages
Ip Project
No ratings yet
Ip Project
20 pages
Class 12 CS Project On Cricket Stat Analysis
No ratings yet
Class 12 CS Project On Cricket Stat Analysis
37 pages
RAKESH
No ratings yet
RAKESH
24 pages
IPL-ExploratoryDataAnalysis - With MySQL
No ratings yet
IPL-ExploratoryDataAnalysis - With MySQL
12 pages
Final Ipl Project 1
100% (1)
Final Ipl Project 1
37 pages
Cs Ip (A)
No ratings yet
Cs Ip (A)
34 pages
Ip Project
No ratings yet
Ip Project
16 pages
Advanced IPL Match Analysis Using Python (Advanced)
No ratings yet
Advanced IPL Match Analysis Using Python (Advanced)
4 pages
Informatics Practices Project File PDF
0% (1)
Informatics Practices Project File PDF
45 pages
Ip Investigatory Project
No ratings yet
Ip Investigatory Project
28 pages
KUNJ1
No ratings yet
KUNJ1
17 pages
Code2pdf 6714bd5247d05
No ratings yet
Code2pdf 6714bd5247d05
3 pages
IP PROJECT On Ipl
No ratings yet
IP PROJECT On Ipl
27 pages
Indian Premier League Ip Project File
No ratings yet
Indian Premier League Ip Project File
42 pages
Ipl Analysis
No ratings yet
Ipl Analysis
19 pages
Modifiedip
No ratings yet
Modifiedip
27 pages
Week 5 Essay
No ratings yet
Week 5 Essay
2 pages
Ipl Data Analysis Porgram
No ratings yet
Ipl Data Analysis Porgram
6 pages
SQL Project Ipl Data Analysis
No ratings yet
SQL Project Ipl Data Analysis
23 pages
IP Project
No ratings yet
IP Project
28 pages
T 20 WC
No ratings yet
T 20 WC
4 pages
Advanced IPL Match Analysis Using Python (Basic)
No ratings yet
Advanced IPL Match Analysis Using Python (Basic)
3 pages
# IP Practical #
No ratings yet
# IP Practical #
12 pages
Write Queries For The Following Tasks
No ratings yet
Write Queries For The Following Tasks
3 pages
IPL T20 Cricket Analysis Shallshkagksgsohssgsigsgslhsagsjsgsjgsjsh
No ratings yet
IPL T20 Cricket Analysis Shallshkagksgsohssgsigsgslhsagsjsgsjgsjsh
37 pages
IPL - Prediction - Model - Training - Final - Ipynb - Colab
No ratings yet
IPL - Prediction - Model - Training - Final - Ipynb - Colab
8 pages
IP Project
No ratings yet
IP Project
28 pages
IP - PRACTICAL EXAM - Revision
No ratings yet
IP - PRACTICAL EXAM - Revision
24 pages
Ip Final Practical File
No ratings yet
Ip Final Practical File
22 pages
Ip Practical File
No ratings yet
Ip Practical File
23 pages
Assignment 3
No ratings yet
Assignment 3
2 pages
Questions and Answers On Ipl Database
No ratings yet
Questions and Answers On Ipl Database
5 pages
Business Analytics
No ratings yet
Business Analytics
25 pages
t20 Batting Analysis System (Ip Class 12) (2024-25)
No ratings yet
t20 Batting Analysis System (Ip Class 12) (2024-25)
25 pages
Matchdata - Ipynb - Colaboratory
No ratings yet
Matchdata - Ipynb - Colaboratory
3 pages
Program Dataframe
No ratings yet
Program Dataframe
8 pages
Written Practice On 17-Jan-2025
No ratings yet
Written Practice On 17-Jan-2025
3 pages
Vedant Aggarwal IP Project File
No ratings yet
Vedant Aggarwal IP Project File
27 pages
Project (IPL)
No ratings yet
Project (IPL)
1 page
Py Report
No ratings yet
Py Report
13 pages
Badri Project New 1
No ratings yet
Badri Project New 1
26 pages
Data Analytics Using Python
No ratings yet
Data Analytics Using Python
14 pages
Untitled Document
No ratings yet
Untitled Document
27 pages
24 Gourav
No ratings yet
24 Gourav
75 pages
Computer Project
No ratings yet
Computer Project
21 pages
IP Practical File 2022
No ratings yet
IP Practical File 2022
26 pages
AmanKhajuria IPproject
No ratings yet
AmanKhajuria IPproject
29 pages
Project Documents On Cricket Analysis
No ratings yet
Project Documents On Cricket Analysis
19 pages
Virat Kohil
No ratings yet
Virat Kohil
31 pages
Data Frames and Charts: 2.1 Working With Dataframes
No ratings yet
Data Frames and Charts: 2.1 Working With Dataframes
13 pages
12 Pandas
No ratings yet
12 Pandas
14 pages
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
Intro To Google Colab
No ratings yet
Intro To Google Colab
18 pages
Devops Timetable
No ratings yet
Devops Timetable
25 pages
Password Generation
No ratings yet
Password Generation
7 pages
POLS292 Group Assignment 8
No ratings yet
POLS292 Group Assignment 8
4 pages
Python Module
No ratings yet
Python Module
15 pages
Django Rest Framework Json API
No ratings yet
Django Rest Framework Json API
21 pages
How To Backup Your Tumblr Blog
No ratings yet
How To Backup Your Tumblr Blog
80 pages
Industrial Training Presentation - Monica Kaushik
50% (2)
Industrial Training Presentation - Monica Kaushik
19 pages
SONiC Unit Test and Function Test Enhancement - Edgecore
No ratings yet
SONiC Unit Test and Function Test Enhancement - Edgecore
22 pages
R & Python
No ratings yet
R & Python
22 pages
Class 12 Cs Ncert Cbse 2014-15 Python
100% (1)
Class 12 Cs Ncert Cbse 2014-15 Python
328 pages
02 - 01 - Polymorphism-Lab-Review-And-Practice-Lesson-Notes-Optional-Download - Polymorphism - Labs
No ratings yet
02 - 01 - Polymorphism-Lab-Review-And-Practice-Lesson-Notes-Optional-Download - Polymorphism - Labs
20 pages
CV BryanGislason
No ratings yet
CV BryanGislason
1 page
Skiib
No ratings yet
Skiib
12 pages
UNIT-1 Introduction To Python Programming
No ratings yet
UNIT-1 Introduction To Python Programming
10 pages
Internship Report 3
No ratings yet
Internship Report 3
109 pages
C For Python Programmers
No ratings yet
C For Python Programmers
15 pages
Python Programming Lab Manual
No ratings yet
Python Programming Lab Manual
36 pages
Ali Zaman
No ratings yet
Ali Zaman
1 page
Prediction Game
No ratings yet
Prediction Game
68 pages
Mini Project
No ratings yet
Mini Project
11 pages
Final Report
No ratings yet
Final Report
42 pages
Advanced Python
0% (1)
Advanced Python
5 pages
Python Notes
No ratings yet
Python Notes
7 pages
PDS Q
No ratings yet
PDS Q
11 pages
Python Basics1
No ratings yet
Python Basics1
6 pages
Python File Handling: Open Open
No ratings yet
Python File Handling: Open Open
4 pages
C+Python Lab Manual
No ratings yet
C+Python Lab Manual
46 pages
Georgia 1
No ratings yet
Georgia 1
1 page
Using Python and Sockets: System Power Supply Programming
No ratings yet
Using Python and Sockets: System Power Supply Programming
6 pages

YEAR: 2024 - 2025: Ipl Data Analysis Using Mysql and Python Connectivy

Uploaded by

YEAR: 2024 - 2025: Ipl Data Analysis Using Mysql and Python Connectivy

Uploaded by

YEAR: 2024 - 2025

IPL DATA ANALYSIS

S.No Contents Page No

3 Hardwares and Softwars used 3

The IPL Data Analysis System is a Python-based application developed to

Intel i5 Core Processer

Windows 10 - 64 bit operating system.

#PRINTING THE DEATILS OF FIRST TWO DELIVERIES IN IPL HISTORY

print('PRINTING DATA TYPES OF ALL FIELDS IN THE CSV FILE')

cursor=mydb.cursor() # making a cursor to excute queries.

print('Which team has played highest number of matches till 2020.')

print('plot of the horizontal bar plot of matches played by individual teams')

print('WINNING RATIO OF TEAMS')

print('Most IPL centuries by a player')

print('TOP 5 RUN SCORERS FROM EACH TEAM')

print('plot a bar chart over player runs')

print('Adding one extra column in above dataframe to make easy x-axis.')

print('plotting the graph of above data')

import matplotlib.pyplot as plt

You might also like