DATASCIENCE

The document outlines a series of practical exercises focused on data science using Python and libraries like Pandas and NumPy. It includes tasks such as creating dataframes, performing statistical analysis, handling missing values, and generating various types of plots. Additionally, it covers operations on datasets, including importing, cleaning, and visualizing data.

Uploaded by

jesaboc231

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views3 pages

DATASCIENCE

Uploaded by

jesaboc231

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Subject:Foundation Of DataScience

Following questions for the practical

1. Write a Python program to create a dataframe containing columns name, age
and percentage. Add 10 rows to the dataframe. View the dataframe.
2. Write a Python program to print the shape, number of rows-columns, data
types, feature names and the description of the data
3. Write a Python program to view basic statistical details of the data.
4. Write a Python program to Add 5 rows with duplicate values and missing
values. Add a column ‘remarks’ with empty values. Display the data
5. Write a Python program to get the number of observations, missing values
and duplicate values.
6. Write a Python program to drop ‘remarks’ column from the dataframe. Also
drop all null and empty values. Print the modified data.
7. Write a Python program to generate a line plot of name vs percentage
8. Write a Python program to generate a scatter plot of name vs percentage.
9. Write a Python program to find the maximum and minimum value of a
given flattened array.
Expected Output:
Original flattened array: [[0 1] [2 3]]
Maximum value of the above flattened array: 3
Minimum value of the above flattened array: 0
10.Write a python program to compute Euclidian Distance between two data
points in a dataset. [Hint: Use linalgo.norm function from NumPy]
11.Create one dataframe of data values. Find out mean, range and IQR for this
data.
12.Write a python program to compute sum of Manhattan distance between all
pairs of points.
13.Write a NumPy program to compute the histogram of nums against the bins.
Sample Output:
nums: [0.5 0.7 1. 1.2 1.3 2.1]
bins: [0 1 2 3]
Result: (array([2, 3, 1], dtype=int64), array([0, 1, 2, 3]))
14.Create a dataframe for students’ information such name, graduation
percentage and age. Display average age of students, average of graduation
percentage. And, also describe all basic statistics of data. (Hint: use
describe()).
15.Import Dataset and do the followings: a) Describing the dataset b) Shape of
the dataset c) Display first 3 rows from dataset.
16.Handling Missing Value: a) Replace missing value of salary,age column
with mean of that column.
17.Data.csv have two categorical column (the country column, and the
purchased column). a. Apply OneHot coding on Country column. b. Apply
Label encoding on purchased column.
18.Generate a random array of 50 integers and display them using a line chart,
scatter plot, histogram and box plot. Apply appropriate color, labels and
styling options.
19.Add two outliers to the above data and display the box plot.
20.Create two lists, one representing subject names and the other representing
marks obtained in those subjects. Display the data in a pie chart and bar
chart.
21.Write a Python program to create a Bar plot to get the frequency of the three
species of the Iris data.
22.Write a Python program to create a Pie plot to get the frequency of the three
species of the Iris data.
23.Write a Python program to create a histogram of the three species of the Iris
data.
24.Write a Python program to create a graph to find relationship between the
petal length and petal width.
25.Download any dataset from UCI (do not repeat it from set B). Read this csv
file using read_csv() function. Describe the dataset using appropriate
function. Display mean value of numeric attribute. Check any data values
are missing or not.
26. Download nursery dataset from UCI. Split dataset on any one categorical
attribute. Compare the means of each split. (Use groupby)
27. Create one dataframe with 5 subjects and marks of 10 students for each
subject. Find arithmetic mean, geometric mean, and harmonic mean.
28.Download the heights and weights dataset and load the dataset from a given
csv file into a dataframe. Print the first, last 10 rows and random 20 rows.
(https://www.kaggle.com/burnoutminer/heightsand-weights-dataset)
29.Write a Python program to find the shape, size, datatypes of the dataframe
object.
30.Write a Python program to view basic statistical details of the data.
31.Write a Python program to get the number of observations, missing values
and nan values.
32.Write a Python program to add a column to the dataframe “BMI” which is
calculated as : weight/height2

Statistics Grade 12
No ratings yet
Statistics Grade 12
24 pages
XII IP Board Practical File
No ratings yet
XII IP Board Practical File
7 pages
Wts 11 & 12 Data Handling
No ratings yet
Wts 11 & 12 Data Handling
48 pages
DSML Problem Statements
No ratings yet
DSML Problem Statements
8 pages
IPFILE
No ratings yet
IPFILE
26 pages
XII Practical List IP 2022-23 Project.
No ratings yet
XII Practical List IP 2022-23 Project.
6 pages
Khadeeja - DS - PRACTICAL 4
No ratings yet
Khadeeja - DS - PRACTICAL 4
24 pages
English Programs
No ratings yet
English Programs
4 pages
DSBDA LAB - MANUAL (Autosaved) - Sd1-Converted-1-2
100% (1)
DSBDA LAB - MANUAL (Autosaved) - Sd1-Converted-1-2
256 pages
DSBDA Lab Plan
No ratings yet
DSBDA Lab Plan
5 pages
Document From Igd - Rabichandra
No ratings yet
Document From Igd - Rabichandra
4 pages
Xii Program List Ip 2024 25 KVJK
No ratings yet
Xii Program List Ip 2024 25 KVJK
3 pages
XII - IP - Practical - List 2023-24
No ratings yet
XII - IP - Practical - List 2023-24
4 pages
Practical List 2022-23
100% (1)
Practical List 2022-23
4 pages
CS3361 Set1
No ratings yet
CS3361 Set1
5 pages
Data Science
No ratings yet
Data Science
3 pages
XII IP Practical File 1 Complete
No ratings yet
XII IP Practical File 1 Complete
38 pages
CS3361 Set2
No ratings yet
CS3361 Set2
6 pages
2025 GRADE 12 MLIT INVESTIGATION Edited
No ratings yet
2025 GRADE 12 MLIT INVESTIGATION Edited
7 pages
2023 Data Analysis and Visualization Using Python
100% (2)
2023 Data Analysis and Visualization Using Python
9 pages
Practical List Ip
100% (1)
Practical List Ip
10 pages
XII IP Practical File 1 Complete
No ratings yet
XII IP Practical File 1 Complete
38 pages
wst01 01 Rms 20240815
No ratings yet
wst01 01 Rms 20240815
12 pages
Ip Practical File Akash Tripathi
No ratings yet
Ip Practical File Akash Tripathi
50 pages
Data Science Manual
No ratings yet
Data Science Manual
155 pages
21hcs4108 Davpracticals
No ratings yet
21hcs4108 Davpracticals
29 pages
Cs3361 Set3 Fds Anna University
No ratings yet
Cs3361 Set3 Fds Anna University
3 pages
Python Practical Questions@Subas
No ratings yet
Python Practical Questions@Subas
7 pages
Report File Questions
No ratings yet
Report File Questions
3 pages
IP Project 12A
No ratings yet
IP Project 12A
39 pages
Python 1
No ratings yet
Python 1
16 pages
CS 3361 SET 1 QN Only
No ratings yet
CS 3361 SET 1 QN Only
4 pages
IDS Syllabus
No ratings yet
IDS Syllabus
5 pages
SL-III Lab Manual
No ratings yet
SL-III Lab Manual
74 pages
Lab Questions IDSE 2024
No ratings yet
Lab Questions IDSE 2024
7 pages
DSBDA Sample Problem Statements
No ratings yet
DSBDA Sample Problem Statements
3 pages
Practical File - X (AI)
No ratings yet
Practical File - X (AI)
4 pages
PR List Dsbda
No ratings yet
PR List Dsbda
2 pages
Class 12 Practical Assignment Questions
No ratings yet
Class 12 Practical Assignment Questions
3 pages
XII IP Practical File 1 Complete
No ratings yet
XII IP Practical File 1 Complete
38 pages
ML Lab Manual
No ratings yet
ML Lab Manual
36 pages
XII IP Practical File 1 Complete
No ratings yet
XII IP Practical File 1 Complete
37 pages
Informatics Practices Practical List22-2323
100% (1)
Informatics Practices Practical List22-2323
7 pages
CLASS 10 PRACTICAL FILE-format
100% (1)
CLASS 10 PRACTICAL FILE-format
31 pages
Datascience
No ratings yet
Datascience
8 pages
Worksheet-1 (Python)
No ratings yet
Worksheet-1 (Python)
9 pages
Assignment 1
100% (1)
Assignment 1
16 pages
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
No ratings yet
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
16 pages
Manishadav
No ratings yet
Manishadav
27 pages
UBD Draft With Feedback
No ratings yet
UBD Draft With Feedback
2 pages
IP ASSIGNMENT (Class XII)
No ratings yet
IP ASSIGNMENT (Class XII)
4 pages
Term-I Practical Question Paper 2022-2023
No ratings yet
Term-I Practical Question Paper 2022-2023
8 pages
My Practical File
100% (1)
My Practical File
40 pages
5.3.4 Journal - Describing Distributions (Journal)
No ratings yet
5.3.4 Journal - Describing Distributions (Journal)
6 pages
Pert Q Python
No ratings yet
Pert Q Python
3 pages
DSBDA Manual
No ratings yet
DSBDA Manual
76 pages
(Ebook PDF) Statistics For Business Economics 13th Edition by David PDF Download
100% (2)
(Ebook PDF) Statistics For Business Economics 13th Edition by David PDF Download
50 pages
Pracfile Program Index XII-C IP 2023-24
No ratings yet
Pracfile Program Index XII-C IP 2023-24
6 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Ip Practical File GV
No ratings yet
Ip Practical File GV
46 pages
I.P Record File List
No ratings yet
I.P Record File List
2 pages
Python Practice Questions
No ratings yet
Python Practice Questions
5 pages
Syllabus AIML
No ratings yet
Syllabus AIML
14 pages
XII IP Practical List 2023-24
No ratings yet
XII IP Practical List 2023-24
4 pages
Data Analysis Lab - Final - 23-24
No ratings yet
Data Analysis Lab - Final - 23-24
11 pages
Assignmeant-1 Sharan S
No ratings yet
Assignmeant-1 Sharan S
20 pages
Stats Formulas
No ratings yet
Stats Formulas
54 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Business Statistics NOtes
No ratings yet
Business Statistics NOtes
46 pages
TSA Theory Part1
No ratings yet
TSA Theory Part1
98 pages
MATH 533 Project 1
No ratings yet
MATH 533 Project 1
15 pages
Gina Wilson Probability Worksheet 7
No ratings yet
Gina Wilson Probability Worksheet 7
63 pages
Complete Practical File of Class XII-IP 2020-21
No ratings yet
Complete Practical File of Class XII-IP 2020-21
80 pages
STA215 Test F 2014 A Solutions
No ratings yet
STA215 Test F 2014 A Solutions
6 pages
Chart Types and Their Uses
No ratings yet
Chart Types and Their Uses
13 pages
COM508 Reviewer
No ratings yet
COM508 Reviewer
26 pages
BUS270 Assignment 2
No ratings yet
BUS270 Assignment 2
28 pages
Calculation of External Trade Indices Based
No ratings yet
Calculation of External Trade Indices Based
48 pages
Q4 Module 5 III - Finding Answer - Quantitative
No ratings yet
Q4 Module 5 III - Finding Answer - Quantitative
31 pages
Data Science 6th Sem CS Engineesring Questions
No ratings yet
Data Science 6th Sem CS Engineesring Questions
35 pages
ML Labmanual
No ratings yet
ML Labmanual
33 pages
Food Hub Businees Report
No ratings yet
Food Hub Businees Report
15 pages
Module6 Statistical Tools
No ratings yet
Module6 Statistical Tools
29 pages
Data Cleaning
No ratings yet
Data Cleaning
28 pages
wst01 01 Que 20231013
No ratings yet
wst01 01 Que 20231013
13 pages
Math Lit 2020 P1
No ratings yet
Math Lit 2020 P1
20 pages
Assessment Task For Lesson 3.2: X X W W W W X X
No ratings yet
Assessment Task For Lesson 3.2: X X W W W W X X
2 pages
Assignment 3 - Exploratory Data Analysis
No ratings yet
Assignment 3 - Exploratory Data Analysis
2 pages
Mdm4u 4
No ratings yet
Mdm4u 4
2 pages
C Programs To Become Expert In Programming
From Everand
C Programs To Become Expert In Programming
Shubham Yadav
No ratings yet

DATASCIENCE

Uploaded by

DATASCIENCE

Uploaded by

Subject:Foundation Of DataScience

Following questions for the practical

You might also like