0% found this document useful (0 votes)
37 views

Python Lab 2

This document provides instructions for performing operations on three different datasets using Python libraries like Numpy, Pandas, Matplotlib and Seaborn. The datasets include a student database, ecommerce purchase data, and employee salary data. Some key tasks outlined include displaying top and bottom rows, checking shapes and dtypes, finding null values and statistics, filtering rows based on conditions, and calculating aggregates.

Uploaded by

bedima2615
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
37 views

Python Lab 2

This document provides instructions for performing operations on three different datasets using Python libraries like Numpy, Pandas, Matplotlib and Seaborn. The datasets include a student database, ecommerce purchase data, and employee salary data. Some key tasks outlined include displaying top and bottom rows, checking shapes and dtypes, finding null values and statistics, filtering rows based on conditions, and calculating aggregates.

Uploaded by

bedima2615
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 16

Numpy ,Pandas, Matplotlib and Seaborn

1. Create Dictionary for student database perform :

1. Display top 3 rows of dataset.


2. Check last 3 rows of dataset.

3. Find Shape of Dataset.

4. Get information about dataset like total no. of rows, total no of column,
datatype of each column and memory requirment
5. Check null values in dataset, overall statics about the dataframe

6. Find total no of students having marks between 50 to 100 using between


method .
2. Perform following operation on Ecommerce purchase website:

Download database from


https://www.kaggle.com/datasets/utkarsharya/ecommerce-purchases

1. Display Top 10 Rows of The Dataset


2. Check Last 10 Rows of The Dataset

3. Check Datatype of Each Column


4. Check null values in the dataset

5. How many rows and columns are there in our Dataset?

6. Highest and Lowest Purchase Prices.


7. Average Purchase Price

8. How many people have French 'fr' as their Language?

9. Job Title Contains Engineer


10. Find The Email of the person with the following IP Address:
132.207.160.22

11. How many People have Mastercard as their Credit Card Provider and
made a purchase above 50?

12. Find the email of the person with the following Credit Card Number:
4664825258997302

13. How many people purchase during the AM and how many people
purchase during PM?
14. How many people have a credit card that expires in 2020?

15. What are the top 5 most popular email providers (e.g. gmail.com,
yahoo.com, etc...)
3. Perform following operation on Employee salary dataset .

Download database from: https://www.kaggle.com/datasets/kaggle/sf-salaries

1. Display Top 10 Rows of The Dataset


2. Check Last 10 Rows of The Dataset

3. Find Shape of Our Dataset (Number of Rows And Number of Columns)


4. Getting Information About Our Dataset Like Total Number Rows, Total
Number of Columns,Datatypes of Each Column And Memory Requirement

5. Check Null Values In The Dataset

6. Drop ID, Notes, Agency, and Status Columns

7. Get Overall Statistics About The Dataframe

8. Find Occurrence of The Employee Names (Top 5)


9. Find The Number of Unique Job Titles

10.Total Number of Job Titles Contain Captain

12. Find Minimum, Maximum, and Average BasePay

13. Replace 'Not Provided' in EmployeeName' Column to NaN


14. Drop The Rows Having 3 Missing Values

15. Find Job Title of ALBERT PARDINI

16. How Much ALBERT PARDINI Make (Include Benefits)?


18.Find Average TotalPay of All Employee Per Year

19. Find Average `BasePay of All Employee Per JobTitle

You might also like