Capstone Project - Python Fundamentals
Capstone Project - Python Fundamentals
Python Fundamentals
Dataset Explanations
Attributes
1) id - Assigned number for Project head who will be in charge of the
project.
2) name - person handling the project.
3) gender – Male(M), Female(F)
4) city - locations of the project
5) age - number of years the project will be active.
6) status - status of the project
7) designation level - position of the project head
o excessive failures indicate designation grades to reduce.
o a person with a good reputation means a very high chance to
increase his designation.
Designation scale -
a) 1-highest
b) 2, 3 mid positions and 4 being the least
c) If anyone crosses 4 then he loses eligibility
for heading the project.
Dataframes
Task 1
There are three different tables as given above. Please make three dataframe in python
and save them as three .csv files. From Task 2 to Task 10, use the saved .csv files only.
Task 2
The cost column in the dataframe “Project” has some missing values. Your task is to
compute these missing values. Replace the missing values by running average. You
should use the “For” loop for this task.
Task 3
Split the name column in the Employee dataframe into two new columns “First Name”,
and “Last Name” and remove the older “name” column.
Task 4
Join all three dataframes in one single dataframe. Name it “Final”
Task 5
Add a new bonus column in the Final dataframe. Give a 5% bonus concerning project
cost only to employees who have finished the projects.
Task 6
Demote the designation level by 1, whose projects have status “fail”. Delete the
employees record whose designation level is above 4.
Task 7
Add “Mr.” and “Mrs.” to the first name column and drop the gender column.
Task 8
Promote designation level by 1 for the employees whose age is more than 29 years using
IF condition.
Task 9
Add the cost of all projects for each Employee and save it in new dataframe
“TotalProjCost” with three columns ID, First Name, and Total cost
Task 10
Print all the employee details whose city name contains the letter “o” in it.
Performance Evaluation
You need to complete all the given tasks. You should use the various functions from
Python libraries NumPy and Pandas to complete the assigned tasks. The goal of the
capstone project is to help us better understand your skills and give us an idea of how
you approach tasks relevant to data science. The capstone is a means for us to evaluate
what you have learned so far and are you ready for real-world challenges or not.
Requirements or Deliverables
• Submit .ipynb file where all your code and outputs will be residing.
• Each line must have comments.