0% found this document useful (0 votes)

7 views7 pages

FDS Slips Solution

The document contains a series of Python programming tasks focused on data analysis and visualization using various datasets such as Iris, wine quality, and height-weight. It includes instructions for creating pie charts, handling missing values, generating statistical summaries, and applying data encoding techniques. Additionally, it covers generating plots like box plots, line charts, and histograms, as well as standardizing data and calculating distances.

Uploaded by

j03410581

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views7 pages

FDS Slips Solution

Uploaded by

j03410581

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

SLIP-1 / SLIP-11

Q.2 A) Write a Python program to create a Pie plot to get the frequency of the three
species of the Iris data (Use iris.csv)

from sklearn.datasets import load_iris

import pandas as pd
import matplotlib.pyplot as plt
iris_data pd.read_csv('Iris.csv')
species_counts iris_data['Species'].value_counts()
plt.ﬁgure(ﬁgsize=(8, 8))
plt.pie(species_counts, labels=species_counts.index, autopct='%1.1f%%',
startangle=90, colors=['green', 'blue', 'orange'])
plt.title('Iris Species Distribution')
plt.axis('equal')
plt.show()

B) Write a Python program to view basic statistical details of the data.(Use wineequality-
red.csv)

import pandas as pd
wine_data = pd.read_csv('winequality-red.csv')
statistical_details wine_data.describe()
print(statistical_details)

SLIP-2 / SLIP-6

Q.2 A) Write a Python program for Handling Missing Value. Replace missing value of
salary, age column with mean of that column.(Use Data.csv ﬁle).

import pandas as pd
data pd.read_csv('Data.csv')
print("Original Data:")
print(data)
mean_salary = data['salary'].mean()
mean_age data['age'].mean()
data['salary').ﬁllna(mean_salary, inplace=True)
data['age'].ﬁllna(mean_age, inplace=True)
print("\nData after handling missing values:")
print(data)

B) Write a Python program to generate a line plot of name Vs salary

import pandas as pd
import matplotlib.pyplot as plt
data = pd.read_csv('Datal.csv')
plt.ﬁgure(ﬁgsize=(10, 6))
plt.plot(data['name'], data['salary'), linestyle='dotted', marker='0')
plt.title('Name vs Salary')
plt.xlabel('Name')
plt.ylabel('Salary')
plt.xticks (rotation=45)
plt.show()

C) Download the heights and weights dataset and load the dataset froma given csv ﬁle
into a dataframe. Print the f irst, last 10 rows and random 20 rows also display shape of
the dataset.

import pandas as pd
df = pd.read_csv('height_weight.csv')
print("First 10 rows:")
print(df.head(10))
print("\nLast 10 rows:")
print(df.tail(10))
print("\nRandom 20 rows:")
print(df.sample(20))
print("\nShape of the dataset:")
print(df.shape)

SLIP-3 /SLIP-18

A) Write a Python program to create box plots to see how each feature i.e. Sepal Length,
Sepal Width, Petal Length, Petal Width are distributed across the three species. (Use
iris.csv dataset)

import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
iris_data pd.read_csv('iris.csv')
plt.ﬁgure(ﬁgsize=(14, 8))
plt.subplots_adjust(wspace=0.5, hspace=0.5)
plt.subplot(2, 2, 1)
sns.boxplot(x='Species', y='SepalLengthCm', data=iris_data)
plt.title('Sepal Length Distribution')
plt.subplot(2, 2, 2)
sns.boxplot(x='Species', y='SepalWidthCm', data-iris_data)
plt.title('Sepal Width Distribution')
plt.subplot(2, 2, 3)
sns.boxplot(x='Species', y='PetalLengthCm', data=iris_data)
plt.title('Petal Length Distribution')
plt.subplot(2, 2, 4) sns.boxplot(x='Species', y='PetalWidthCm', data=iris_data)
plt.title('Petal Width Distribution')
plt.show()
B) Write a Python program to view basic statistical details of the data (Use Heights and
Weights Dataset)

import pandas as pd
df = pd.read_csv('height_weight.csv')
statistical_details = df.describe()
print (statistical_details)

SLIP-4 / SLIP-5

A) Generate a random array of 50 integers and display them using a line chart, scatter
plot, histogram and box plot. Apply appropriate color, labels and styling options.

import numpy as np
import matplotlib.pyplot as plt
np. random.seed (42)
random_array = np.random.randint(1, 100, 50)
plt. ﬁgure(ﬁgsize=(12, 4))
plt.subplot(1,4,1)
pit.plot(random_array,marker='o', color='blue')
plt.title('Line Chart')
plt.xlabel ('Index")
plt. ylabel ('Value')
plt. subplot(1, 4, 2)
plt. scatter (range(len(random_array)), random_array, color='green', marker=*)
plt. title( Scatter Plot')
plt. xlabel ('Index')
plt.ylabel( 'Value')
plt. subplot (1, 4, 3)
plt.hist(random_array, bins=10, color='orange', edgecolor='black')
plt. title( 'Histogram')
plt.xlabel( 'Value')
plt.ylabel ('Frequency')
plt. subplot(1, 4, 4)
plt.boxplot(random_array, vert=False, widths=0.7, patch_artist=True, boxprops=dict(facecolor='pink'))
plt. title(Box Plot')
plt. xlabel( 'Value')
plt.tight_layout()
plt. show()
B) Write a Python program to print the shape, number of rows-columns, data types,
feature names and the description of the data. (Use User_Data.csv)

import pandas as pd
df = pd.read_csv('user_data.csv')
print("Shape of the data:", df.shape)
print("Number of rows:", df.shape[0])
print("Number of columns:", df.shape[1])
print("\nData types:")
print(df.dtypes)
print("\nFeature names:")
print(df.columns)
print("\nDescription of the data:")
print(df.describe())

SLIP-7

Write a Python program to perform the following tasks :

a. Apply OneHot coding on Country column.
b. Apply Label encoding on purchased column
(Data.csv have two categorical column the country column, and the purchased column)

import pandas as pd
from sklearn.preprocessing import LabelEncoder, OneHotEncoder
df = pd.read_csv('data2.csv')
print("Original Dataset:")
print(df)
df_onehot = pd.get_dummies (df, columns=['Country'], preﬁx='Country')
print("\nDataset after OneHot encoding:")
print(df_onehot)
label_encoder LabelEncoder()
df['Purchased'] = label_encoder.ﬁt_transform(df ['Purchased'])
print("\nDataset after Label encoding:")
print(df)
SLIP-8
Write a program in python to perform following task Standardizing Data (transform them
into a standard Gaussian distribution with a mean of 0 and a standard deviation of 1)
(Use winequality-red.csv)

import pandas as pd
from sklearn.preprocessing import StandardScaler
df pd.read_csv('winequality-red.csv')
print("Original Dataset:")
print(df.head())
features df.drop('quality', axis=1)
scaler StandardScaler()
features_standardized = scaler.ﬁt_transform(features)
df_standardized pd. DataFrame (features_standardized, columns=features.columns)
df_standardized['quality'] = df ['quality']
print("\nDataset after Standardization:")
print(df_standardized.head())

SLIP-9
A) Generate a random array of 50 integers and display them using a line chart, scatter
plot. Apply appropriate color, labels and styling options……………………..
import numpy as np
import matplotlib.pyplot as plt
np.random.seed(42)
random_array np.random.randint(1, 100, 50)
plt.ﬁgure(ﬁgsize=(12, 4))
plt.subplot(1, 4, 1)
plt.plot(random_array, marker='o', color='blue')
plt.title('Line Chart')
plt.xlabel('Index')
plt.ylabel('Value')
plt.subplot(1, 4, 2)
plt.scatter(range (len(random_array)), random_array, color='green', marker='^')
plt.title('Scatter Plot')
plt.xlabel('Index')
plt.ylabel('Value')
plt.subplot(1, 4, 3)
plt.hist(random_array, bins-10, color'orange', edgecolor='black')
plt.title('Histogram')
plt.xlabel('Value')
plt.ylabel('Frequency')
plt.subplot(1, 4, 4)
plt.boxplot(random_array, vert=False, widths=0.7, patch_artist=True, boxprops
dict(facecolor='pink')) plt.title('Box Plot')
plt.xlabel('Value')
plt.tight_layout()
plt.show()
B) Create two lists, one representing subject names and the other representing marks
obtained in those subjects. Display the data in a pie chart.

import matplotlib.pyplot as plt

subjects ['Math', 'English', 'Science', 'History', 'Art']
marks [90, 85, 92, 88, 78]
plt.ﬁgure(ﬁgsize=(8, 8))
plt.pie(marks, labels subjects, autopct='%1.1f%%', startangle-90, colors=["#FF9999',
'#66B2FF', '#99FF99', '#FFCC99"])
plt.title('Subject-wise Marks Distribution')
plt.axis('equal')
plt.show()

C) Write a program in python to perform following task (Use winequality-red.csv ) Import

Dataset and do the followings: a) Describing the dataset b) Shape of the dataset c)
Display ﬁrst 3 rows from dataset

import pandas as pd
df = pd.read_csv('winequality-red.csv')
print("a) Describing the dataset:")
print(df.describe())
print("\nb) Shape of the dataset:")
print(df.shape)
print("\nc) Display ﬁrst 3 rows from the dataset:")
print(df.head(3))

SLIP-10

A) Write a python program to Display column-wise mean, and median for SOCR-
HeightWeight dataset.

import pandas as pd
df= pd.read_csv('height_weight.csv')
print("Column-wise Mean:")
mean_values = df.mean()
print(mean_values)
print("\nColumn-wise Median:")
median_values = df.median()
print(median_values)
B) Write a python program to compute sum of Manhattan distance between all pairs of points .
import itertools
def manhattan_distance (point1, point2):
return abs (point1 [0] point2[0]) + abs (point1 [1] point2[1])
points = [(1, 2), (3, 4), (5, 6), (7, 8)]
total_distance sum(manhattan_distance (point1, point2) for point1, point2 in
itertools.combinations (points, 2))
print("Sum of Manhattan distance between all pairs of points:", total_distance)
SLIP-21
A) Import dataset “iris.csv”. Write a Python program to create a Bar plot to get the
frequency of the three species of the Iris data.

import pandas as pd
import matplotlib.pyplot as plt
iris_data=pd.read_csv('iris.csv')
species_count=iris_data['Species'].value_counts()
plt.ﬁgure(ﬁgsize=(8, 6))
species_counts.plot(kind='bar', color=['skyblue', 'lightgreen', 'coral'])
plt.title('Frequency of Iris Species')
plt.xlabel('Species')
plt.ylabel('Count')
plt.xticks (rotation=0)
plt.show()

B) Write a Python program to create a histogram of the three species of the Iris data.

import pandas as pd
import matplotlib.pyplot as plt
iris_data = pd.read_csv('iris.csv')
plt.ﬁgure(ﬁgsize=(10, 6))
for species in iris_data['Species'].unique():
subset=iris_data[iris_data['Species'] == species]
plt.hist(subset ['SepalLengthCm'], bins=20, alpha=0.5, label=species)
plt.title('Histogram of Sepal Length for Each Iris Species')
plt.xlabel('Sepal Length')
plt.ylabel('Frequency')
plt.legend()
plt.show()

Markem Image Book For Service Engineers 2200 - v3.3
100% (6)
Markem Image Book For Service Engineers 2200 - v3.3
535 pages
V9-V9S IPTV Function User Guide For V9V9S-V1.0
No ratings yet
V9-V9S IPTV Function User Guide For V9V9S-V1.0
14 pages
Optimize Your Mobile Game Performance: Unity For Games Unity 2020 Lts Edition - E-Book
No ratings yet
Optimize Your Mobile Game Performance: Unity For Games Unity 2020 Lts Edition - E-Book
52 pages
New Latex PDF
No ratings yet
New Latex PDF
55 pages
Computer Project Hardy Cross Spring 2021
No ratings yet
Computer Project Hardy Cross Spring 2021
3 pages
Software Development With Visual Basic B.com Ca
No ratings yet
Software Development With Visual Basic B.com Ca
122 pages
Installation Process of Server and Its Roles
No ratings yet
Installation Process of Server and Its Roles
52 pages
Module 13 - Synchronous Replication of Volumes
No ratings yet
Module 13 - Synchronous Replication of Volumes
53 pages
Collabora Online Installation Guide
No ratings yet
Collabora Online Installation Guide
25 pages
DApp MVP Solution Project Proposal
No ratings yet
DApp MVP Solution Project Proposal
11 pages
Client Guide To Upwork
No ratings yet
Client Guide To Upwork
30 pages
Principles of Api Developement
No ratings yet
Principles of Api Developement
18 pages
Solved WT - DS
No ratings yet
Solved WT - DS
123 pages
Pcil E22b
No ratings yet
Pcil E22b
6 pages
Data Structures Unit 1
No ratings yet
Data Structures Unit 1
96 pages
Internet Vs WWW
No ratings yet
Internet Vs WWW
5 pages
Ai Tools and Applications-Lab
No ratings yet
Ai Tools and Applications-Lab
33 pages
Session 16M-Day 2 Review Session
No ratings yet
Session 16M-Day 2 Review Session
7 pages
Cluster Analysis-Unit 4
No ratings yet
Cluster Analysis-Unit 4
7 pages
Array Insert
No ratings yet
Array Insert
2 pages
PML Ex3
No ratings yet
PML Ex3
20 pages
DS Slips Solutions Sem 5
No ratings yet
DS Slips Solutions Sem 5
23 pages
Abhiml ML File
No ratings yet
Abhiml ML File
74 pages
Industrial Application of Microcontrollers in Agriculture
No ratings yet
Industrial Application of Microcontrollers in Agriculture
2 pages
Data Science
No ratings yet
Data Science
18 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
32 pages
Fds Mannual
No ratings yet
Fds Mannual
39 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
22 pages
AD3411 - 1 To 5
No ratings yet
AD3411 - 1 To 5
11 pages
Vaishnavi Resume
No ratings yet
Vaishnavi Resume
1 page
Data Visualization With Python
No ratings yet
Data Visualization With Python
34 pages
Fds Slips
No ratings yet
Fds Slips
6 pages
Practical File Question 28.09.2022
No ratings yet
Practical File Question 28.09.2022
15 pages
SQL DBA Curriculum OT V1
No ratings yet
SQL DBA Curriculum OT V1
8 pages
Python Slips
No ratings yet
Python Slips
9 pages
PostgreSQL Compare High Availability Frameworks Infographic ScaleGrid DBaaS
No ratings yet
PostgreSQL Compare High Availability Frameworks Infographic ScaleGrid DBaaS
1 page
21hcs4108 Davpracticals
No ratings yet
21hcs4108 Davpracticals
29 pages
FP Sage Business Cloud Paie
No ratings yet
FP Sage Business Cloud Paie
3 pages
WT 1 and FDS Practical Slips Solution Form WWW - Dailycover.live
No ratings yet
WT 1 and FDS Practical Slips Solution Form WWW - Dailycover.live
91 pages
2023 Data Analysis and Visualization Using Python
100% (2)
2023 Data Analysis and Visualization Using Python
9 pages
BDA File
No ratings yet
BDA File
26 pages
Data Science Practical Book - Ipynb
No ratings yet
Data Science Practical Book - Ipynb
21 pages
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
No ratings yet
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
16 pages
Data Preprocessing Assignments
No ratings yet
Data Preprocessing Assignments
6 pages
Lab Manual (DAV)
No ratings yet
Lab Manual (DAV)
33 pages
List of Experiment - Data Analysis Lab
No ratings yet
List of Experiment - Data Analysis Lab
2 pages
Lecture 4
No ratings yet
Lecture 4
60 pages
DS - Lab Manual
No ratings yet
DS - Lab Manual
31 pages
AECOM Case Study
No ratings yet
AECOM Case Study
6 pages
DAV Practical File 234003
No ratings yet
DAV Practical File 234003
14 pages
Manishadav
No ratings yet
Manishadav
27 pages
Data Visualization - 1 by Matplot Lib
No ratings yet
Data Visualization - 1 by Matplot Lib
19 pages
Python Practical Questions@Subas
No ratings yet
Python Practical Questions@Subas
7 pages
23HCS4142 PDF
No ratings yet
23HCS4142 PDF
24 pages
PCR 139 Property Level Doc Tab Vendor Cabinet More Add New Revision
No ratings yet
PCR 139 Property Level Doc Tab Vendor Cabinet More Add New Revision
9 pages
DA Lab ANSWERS
No ratings yet
DA Lab ANSWERS
10 pages
Grade 10 AI Practicals DATA SCIENCE-Solution
No ratings yet
Grade 10 AI Practicals DATA SCIENCE-Solution
6 pages
DSA Lab Manual Pgms - fINAL
No ratings yet
DSA Lab Manual Pgms - fINAL
34 pages
DP Prog
No ratings yet
DP Prog
10 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Data Science Practicals
No ratings yet
Data Science Practicals
47 pages
Nan Mudhalvan Project
No ratings yet
Nan Mudhalvan Project
4 pages
Advanced PHP Full Notes
100% (1)
Advanced PHP Full Notes
3 pages
ML Lab
No ratings yet
ML Lab
14 pages
Batch1 Ds
No ratings yet
Batch1 Ds
15 pages
DVA Lab Manual
No ratings yet
DVA Lab Manual
20 pages
GE Practical Sem 2
No ratings yet
GE Practical Sem 2
28 pages
Data Science Algorithmen Master - 02 Data Handling
No ratings yet
Data Science Algorithmen Master - 02 Data Handling
76 pages
Ai Class 12 Practical
No ratings yet
Ai Class 12 Practical
21 pages
Assembly Tips
No ratings yet
Assembly Tips
18 pages
Mayank Chaudhary DEV Practicals
No ratings yet
Mayank Chaudhary DEV Practicals
14 pages
ML (Sudhanshu)
No ratings yet
ML (Sudhanshu)
24 pages
Week2 Lab
No ratings yet
Week2 Lab
8 pages
Guidelines DAVP
No ratings yet
Guidelines DAVP
3 pages
Quantitative Social Science Data With R An Introduction 1st Edition Brian J Fogarty Download
No ratings yet
Quantitative Social Science Data With R An Introduction 1st Edition Brian J Fogarty Download
89 pages
Gec Practicals
No ratings yet
Gec Practicals
31 pages
DAV Practicle File
No ratings yet
DAV Practicle File
28 pages
Dav Lab Manual
No ratings yet
Dav Lab Manual
28 pages
Ai Class 12 Practical 2
No ratings yet
Ai Class 12 Practical 2
21 pages
Vanshika Goyal Gec Practicals
No ratings yet
Vanshika Goyal Gec Practicals
31 pages
Ankit Python
No ratings yet
Ankit Python
26 pages
Python 1
No ratings yet
Python 1
16 pages
Fds QB
No ratings yet
Fds QB
6 pages
23bet10114 Naman Gupta Assignment-1
No ratings yet
23bet10114 Naman Gupta Assignment-1
17 pages
DXE 24gksmknvj
No ratings yet
DXE 24gksmknvj
16 pages
(Ebook PDF) SAS Certified Specialist Prep Guide: Base Programming Using SAS 9.4 PDF Download
100% (2)
(Ebook PDF) SAS Certified Specialist Prep Guide: Base Programming Using SAS 9.4 PDF Download
55 pages
ML Record
No ratings yet
ML Record
19 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet

FDS Slips Solution

Uploaded by

FDS Slips Solution

Uploaded by

SLIP-1 / SLIP-11

from sklearn.datasets import load_iris

B) Write a Python program to generate a line plot of name Vs salary

Write a Python program to perform the following tasks :

import matplotlib.pyplot as plt

C) Write a program in python to perform following task (Use winequality-red.csv ) Import

You might also like