ProjectReport
ProjectReport
Subject:
Data Exploration & Visualization using Python I
GitHub Link: https://github.com/045005/045005_IPL-23.git
2. Introduction
For all the codes used for this analysis, kindly refer to the ipnb file attached in my GitHub
repository.
Here, I have scrapped data from 2 different URLs:-
https://www.news18.com/cricketnext/ipl-2023/orange-cap-holder.html
https://www.news18.com/cricketnext/ipl-2023/purple-cap-holder.html
The 1st URL is the data of all the batsmen who scored in the IPL 2023
Similarly, the 2nd URL is the data of all the bowlers who took wickets in the IPL 2023
First, I scrapped the batsmen data and saved them in a dataframe under the name df_orange
Then I scrapped the bowler’s data and saved them in a dataframe under the name df_purple
3. Analyzing the batsmen list
3.e. Analysis based on Strike Rate and total average by each team.
Here, I calculated the average strike rate and total average for each team. The formula used to
calculate the average strike rate is the sum of all the strike rates for a given team divided by
the total number of batsmen in that team. Similarly, for calculating the total average, the
formula is the sum of the average of all the batsmen in that team divided by the total number
of batsmen.
4. Now the analysis of the bowlers' data
Then, I segregated the wickets taken into specific ranges and counted the number of bowlers
in that range.
Then, I found out the total number of wickets taken by each team in the 2023 season.
Here, for this data, we can say that the GT is most effective team in taking wickets.
As per the result, LSG team has the most efficient bowlers in terms of runs conceded.
4.d. Analysis using the 3 wicket hauls, 5 wicket hauls, and maiden overs.
Here, I calculated the total number of 3 wicket hauls, 5 wicket hauls and maiden overs by
each team separately and compiled all the output in a single line graph.
5. Managerial Insights
Batsmen Implications:
Identify and retain batsmen who consistently perform well, especially those who score
the most runs.
Batsmen with a high number of boundaries (4s and 6s) are valuable. They can take
advantage of field restrictions and powerplays to score quickly.
Assess the distribution of top batsmen among different teams. This information can
help in strategic bidding during auctions to ensure a balanced team.
Ranking the teams based on several factors which include runs, power hitting (highest
in boundaries), strike rate, and total average.
Bowlers Implications:
Identify and retain bowlers who take a significant number of wickets (Purple Cap
holders). Wicket-takers can put pressure on the opposition and change the course of
the game.
Balance wicket-taking with the economy rate (EC). Teams that can take wickets
without leaking too many runs are crucial, especially in T20 cricket where runs are at
a premium.
Similarly Average (AVG) given by a bowler is also important for the team.
In games like T20 it is important to score the most runs while batting. Just like
scoring, conceding runs while bowling is also equally important. So, we can
categorize the teams based on these factors too.
Teams that can deliver maiden overs (MDNS) are essential to build pressure on the
batsmen.