0% found this document useful (0 votes)
144 views

Project Python-1

The document describes a dataset containing information on teams that have participated in La Liga, the top professional football division in Spain. It includes attributes like goals scored/conceded, seasons qualified, best position, and more. The tasks involve analyzing patterns in the data like the most consistent team, highest goal difference, and best ranking by grouping and calculating various statistics.

Uploaded by

Mohd Arquam Khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
144 views

Project Python-1

The document describes a dataset containing information on teams that have participated in La Liga, the top professional football division in Spain. It includes attributes like goals scored/conceded, seasons qualified, best position, and more. The tasks involve analyzing patterns in the data like the most consistent team, highest goal difference, and best ranking by grouping and calculating various statistics.

Uploaded by

Mohd Arquam Khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Python

Domain: Sports

Context:

La Liga is the men's top professional football division of the Spanish football league system. The
dataset contains information on all the teams that have participated in all the past tournaments. It has
data about how many goals each team scored, conceded, how many times they came within the first 6
positions, how many seasons they have qualified, their best position in the past, etc.

Data Description:

Laliga.csv - The data set contains information on all the teams so far participated in all the past
tournaments

Attribute Information:

Pos - Position in among the list of all teams

Team Seasons - how many seasons team has played so far

Points - total number of points of the team

GamesPlayed- total number of games played so far

GamesWon- total number of games won so far

GamesDrawn- total number of games drawn so far

GamesLost- total number of games lost so far

GoalsFor- total number of goals by the team

GoalsAgains - total number of goals against the team

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 1
Champion- total number of times it team is a champion

Runner-up- total number of times it team is a runner-up

Third / Fourth/ Fifth/ Sixt- total number of times it team came in a third/fourth/fifth/sixth position

Debut - debut year

BestPosition- best position of the team

Objective:
Using Python functions and we want to come up with metrics which can be used to gauge the winning
team in the upcoming La Liga cup (Football tournament).
Also we want to analyze a few patterns like which team has been most consistent across seasons. Which
team has the highest number of goal difference. Which team has the best ranking.

Steps and tasks:


1. Read the data set and replace dashes with 0 to make sure you can perform arithmetic operations on
the data. And check the distribution for the ‘Best Position’ and report the top position (7 points)
2. Print all the teams which have started playing between 1930-1980 using “Debut” column(Include
year 1930 only) (6 points)
3. Print the list of teams which came Top 5 in terms of points (5 points)
4. Write a function with the name “Goal_diff_count” which should return all the teams with their Goal
Differences. (5 points)
Goal_diff_count = GoalsFor - GoalsAgainst
5. Using the same function, find the team which has a maximum and minimum goal difference. (5
points)

6. Create a new column with the name “Winning Percent” and append it to the data set (7 points)

Percentage of Winning = (GamesWon / GamesPlayed)*100

If there are any numerical error, replace it with 0%

7. Print the top 5 teams which have the highest Winning percentage (5 points)

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 2
8. Group teams based on their “Best position” and print the sum of their points for all positions (10
points)

Eg: Best Position Points

1 25000

2 7000

Learning Outcomes:

● Exploratory Data Analysis

● Practising statistics using Python

References:

● Applications of Data science in the sports domain

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 3

You might also like