Minor Assignment - 4 (Machine Learning-Classification, Regression and Clustering)

The document outlines a minor assignment for a course on Machine Learning, focusing on classification, regression, and clustering tasks using Python. It includes specific tasks such as performing dimensionality reduction on the Iris dataset, creating visualizations for the California Housing dataset, implementing linear regression on temperature data, and classifying students based on exam scores using the KNN algorithm. Additionally, it involves K-Means clustering on various datasets and working with pandas Series for data manipulation.

Uploaded by

shaktibarik2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views2 pages

Minor Assignment - 4 (Machine Learning-Classification, Regression and Clustering)

Uploaded by

shaktibarik2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Centre for Data Science

Institute of Technical Education & Research, SOA, Deemed to be University

Python for Computer Science and Data Science 2 (CSE 3652)

M INOR A SSIGNMENT-4: M ACHINE L EARNING - C LASSIFICATION , R EGRESSION
AND C LUSTERING

1. Perform dimensionality reduction using scikit-learn’s TSNE estimator on the Iris dataset, then graph
the results.

2. Create a Seaborn pairplot graph for the California Housing dataset. Try the Matplotlib features to
panning and zoom in on the diagram. These are accessible via the icons in the Matplotlib window.

3. Go to NOAA’s Climate at a Glance page (Link) and download the available time series data for the
average annual temperatures of New York City from 1895 to today (1895-2025). Implement simple
linear regression using average annual temperature data. Also, show how does the temperature trend
compare to the average January high temperatures?

4. Load the Iris dataset from the scikit-learn library and perform classification on it with the k-nearest
neighbors algorithm. Use a KNeighborsClassifier with the default k value. What is the prediction
accuracy?

5. You are given a dataset of 2D points with their corresponding class labels. The dataset is as follows:

Point ID x y Class
A 2.0 3.0 0
B 1.0 1.0 0
C 4.0 4.0 1
D 5.0 2.0 1

A new point P with coordinates (3.0, 3.0) needs to be classified using the KNN algorithm. Use the
Euclidean distance to calculate the distance between points.

6. A teacher wants to classify students as ”Pass” or ”Fail” based on their performance in three exams.
The dataset includes three features:
Exam 1 Score Exam 2 Score Exam 3 Score Class (Pass/Fail)
85 90 88 Pass
70 75 80 Pass
60 65 70 Fail
50 55 58 Fail
95 92 96 Pass
45 50 48 Fail

A new student has the following scores:

• Exam 1 Score: 72
• Exam 2 Score: 78
• Exam 3 Score: 75

Classify this student using the K-Nearest Neighbors (KNN) algorithm with k = 3.

7. Using scikit-learn’s KFold class and the cross val score function, determine the optimal value for k
to classify the Iris dataset using a KNeighborsClassifier.
1
Centre for Data Science
Institute of Technical Education & Research, SOA, Deemed to be University

8. Write a Python script to perform K-Means clustering on the following dataset:

Dataset: {(1, 1), (2, 2), (3, 3), (8, 8), (9, 9), (10, 10)}

Use k=2 and visualize the clusters.

9. Write a Python script to perform K-Means clustering on the following dataset: Mall Customer Seg-
mentation. Use k = 5 (also, determine optimal k via the Elbow Method) and visualize the clusters to
identify customer segments.
Expected Output:

• Scatter plot showing clusters (e.g., “High Income-Low Spenders,” “Moderate Income-Moderate
Spenders”).
• Insights for targeted marketing strategies.

10. Perform the following tasks using the pandas Series object:

(a) Create a Series from the list [7, 11, 13, 17].
(b) Create a Series with five elements where each element is 100.0.
(c) Create a Series with 20 elements that are all random numbers in the range 0 to 100. Use the
describe method to produce the Series’ basic descriptive statistics.
(d) Create a Series called temperatures with the following floating-point values: 98.6, 98.9,
100.2, and 97.9. Use the index keyword argument to specify the custom indices ’Julie’,
’Charlie’, ’Sam’, and ’Andrea’.
(e) Form a dictionary from the names and values in Part (d), then use it to initialize a Series.

Information Security Fundamental Weaknesses Place EPA Data and Operations at Risk 1st Edition by Government Accountability Office ISBN 1508400784 9781508400783 Instant Download
100% (6)
Information Security Fundamental Weaknesses Place EPA Data and Operations at Risk 1st Edition by Government Accountability Office ISBN 1508400784 9781508400783 Instant Download
75 pages
03 - PowerScale Hardware Installation-SSP - Participant Guide
No ratings yet
03 - PowerScale Hardware Installation-SSP - Participant Guide
98 pages
BCSL606 Machine Learning Lab
No ratings yet
BCSL606 Machine Learning Lab
33 pages
6.EBS1-PTFA27-SAQA-PLQA-1002-D00 - Project Quality Plan
No ratings yet
6.EBS1-PTFA27-SAQA-PLQA-1002-D00 - Project Quality Plan
28 pages
Develop Disaster Recovery Back-Up Procedures and Recovery Instructions
No ratings yet
Develop Disaster Recovery Back-Up Procedures and Recovery Instructions
4 pages
Process Analysis and Simulation in Chemical Engineering
No ratings yet
Process Analysis and Simulation in Chemical Engineering
5 pages
Machine Learning
100% (5)
Machine Learning
56 pages
Introduction To Python and Computer Programming 1704298503
No ratings yet
Introduction To Python and Computer Programming 1704298503
44 pages
Lecture 4 KNN
No ratings yet
Lecture 4 KNN
17 pages
Simulation Model For The Study of Maintenance Actions I
No ratings yet
Simulation Model For The Study of Maintenance Actions I
19 pages
DPR - Surat Metro (Ceo Ieccl - June-23
No ratings yet
DPR - Surat Metro (Ceo Ieccl - June-23
332 pages
Basic Exception Handling
No ratings yet
Basic Exception Handling
7 pages
18-Article Text-30-2-10-20210321
No ratings yet
18-Article Text-30-2-10-20210321
11 pages
38asb Air Cooled Condensing Units
No ratings yet
38asb Air Cooled Condensing Units
1 page
Research Methodology
100% (1)
Research Methodology
10 pages
Digital Progress and Trends Report 2023
No ratings yet
Digital Progress and Trends Report 2023
177 pages
Lab7.ipynb - Colaboratory
100% (1)
Lab7.ipynb - Colaboratory
5 pages
Hand-Over List
No ratings yet
Hand-Over List
88 pages
XD Series Catalouge
No ratings yet
XD Series Catalouge
90 pages
K Means
No ratings yet
K Means
3 pages
Class Notes
No ratings yet
Class Notes
54 pages
ML - Datascience Manual
No ratings yet
ML - Datascience Manual
64 pages
Java Assignment 31 To 60
No ratings yet
Java Assignment 31 To 60
51 pages
Machine Learning Final Manual
No ratings yet
Machine Learning Final Manual
45 pages
ML Lab Programs (1-13)
No ratings yet
ML Lab Programs (1-13)
44 pages
ML Lab Manual
No ratings yet
ML Lab Manual
60 pages
ML Lab Manual
No ratings yet
ML Lab Manual
43 pages
Rahul Raj - Ipynb - Colab
No ratings yet
Rahul Raj - Ipynb - Colab
50 pages
ML Lab Mannual1
No ratings yet
ML Lab Mannual1
37 pages
BCSL606 Machine Learning Lab Final Draft
No ratings yet
BCSL606 Machine Learning Lab Final Draft
32 pages
Exercise and Experiment 3
No ratings yet
Exercise and Experiment 3
14 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
33 pages
MGT460 Assignment
No ratings yet
MGT460 Assignment
15 pages
Machine Learning Lab Manaul BCSL606
No ratings yet
Machine Learning Lab Manaul BCSL606
27 pages
MLLab Manual
No ratings yet
MLLab Manual
24 pages
Lecture 12 K-Nearest Neighbors
No ratings yet
Lecture 12 K-Nearest Neighbors
24 pages
ML Lab Manual
No ratings yet
ML Lab Manual
25 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
26 pages
Consent Letter For Society
No ratings yet
Consent Letter For Society
3 pages
ML Lab Manual
No ratings yet
ML Lab Manual
24 pages
Variable Stepper Motor 2k22 Ec 58 Ec 59 Ec 41 Ec 54
No ratings yet
Variable Stepper Motor 2k22 Ec 58 Ec 59 Ec 41 Ec 54
26 pages
Act 8
No ratings yet
Act 8
20 pages
Instructions
No ratings yet
Instructions
20 pages
New Data Science Module Nearest Neighbors
No ratings yet
New Data Science Module Nearest Neighbors
22 pages
Drag Force: The Basics of Transport Phenomena
No ratings yet
Drag Force: The Basics of Transport Phenomena
12 pages
Mlalllabprgs
No ratings yet
Mlalllabprgs
17 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
18 pages
L5 K Nearest Neighbor
No ratings yet
L5 K Nearest Neighbor
10 pages
Argha's ML LAB - 240927 - 121838
No ratings yet
Argha's ML LAB - 240927 - 121838
13 pages
Final
No ratings yet
Final
13 pages
Shubham Pract 6 - Merged
No ratings yet
Shubham Pract 6 - Merged
12 pages
3 Framework PACT 21 A
No ratings yet
3 Framework PACT 21 A
12 pages
B-56 Sanket Jambhulkar MLA-7
No ratings yet
B-56 Sanket Jambhulkar MLA-7
9 pages
Machine Learning Programs
No ratings yet
Machine Learning Programs
10 pages
DM 2023
No ratings yet
DM 2023
8 pages
Recursion, Searching, Sorting
No ratings yet
Recursion, Searching, Sorting
14 pages
M PDF
No ratings yet
M PDF
13 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
9 pages
Assignment 3 B
No ratings yet
Assignment 3 B
7 pages
Homework 1
No ratings yet
Homework 1
9 pages
KNN Cookbook
No ratings yet
KNN Cookbook
8 pages
V
No ratings yet
V
8 pages
Carreon WS06
No ratings yet
Carreon WS06
4 pages
2.3 Aiml Rishit
No ratings yet
2.3 Aiml Rishit
7 pages
Ad Lab 9
No ratings yet
Ad Lab 9
6 pages
ML 3
No ratings yet
ML 3
6 pages
ML Experiment - 9 - Final
No ratings yet
ML Experiment - 9 - Final
6 pages
KNN Practice Set
No ratings yet
KNN Practice Set
5 pages
MS6711 Data Mining Homework 1: 1.1 Implement K-Means Manually (8 PTS)
No ratings yet
MS6711 Data Mining Homework 1: 1.1 Implement K-Means Manually (8 PTS)
6 pages
Assignment 02
No ratings yet
Assignment 02
5 pages
Problems: Measure Value City Block Distance
No ratings yet
Problems: Measure Value City Block Distance
3 pages
ML Cat QNS
No ratings yet
ML Cat QNS
4 pages
Assignment No 2 AI
No ratings yet
Assignment No 2 AI
4 pages
Boookkk
No ratings yet
Boookkk
4 pages
Assignment I
No ratings yet
Assignment I
4 pages
Minor Assignment-3 (NLP)
No ratings yet
Minor Assignment-3 (NLP)
2 pages
KNN Lab
No ratings yet
KNN Lab
4 pages
51 DA5400 - FML51 - 20250501 ProblemSet06
No ratings yet
51 DA5400 - FML51 - 20250501 ProblemSet06
4 pages
Implementation of K-Nearest Neighbor Classification Model: Background
No ratings yet
Implementation of K-Nearest Neighbor Classification Model: Background
4 pages
Lab 4 AI (Nafay)
No ratings yet
Lab 4 AI (Nafay)
3 pages
8051 Microcontroller Based RFID Car Parking System
No ratings yet
8051 Microcontroller Based RFID Car Parking System
4 pages
178 hw1
No ratings yet
178 hw1
4 pages
Q1S 1
No ratings yet
Q1S 1
2 pages
RedTeamAcademy Form
No ratings yet
RedTeamAcademy Form
2 pages
ENGS 25 LabExercise01 Pacing
No ratings yet
ENGS 25 LabExercise01 Pacing
3 pages
How To Setup Cashiering Management System in Code: and Mysql Database Is A Great Advantage When Used Properly Within The
No ratings yet
How To Setup Cashiering Management System in Code: and Mysql Database Is A Great Advantage When Used Properly Within The
3 pages
Date Sheet For The BS in Computer Science 4 Year Programe 1st 2nd 3rd 69039
No ratings yet
Date Sheet For The BS in Computer Science 4 Year Programe 1st 2nd 3rd 69039
3 pages
Process of ML Code/Algorithm: KNN Type I - Input Test Sample Method
No ratings yet
Process of ML Code/Algorithm: KNN Type I - Input Test Sample Method
3 pages
Assignment 1 - Machine Learning
No ratings yet
Assignment 1 - Machine Learning
2 pages
LC 33
No ratings yet
LC 33
2 pages
Midterm - APS1070 - 2019 - 09 Fall
No ratings yet
Midterm - APS1070 - 2019 - 09 Fall
2 pages
BMW Case
No ratings yet
BMW Case
2 pages
Ranks and Experience - Tanki Online Wiki
No ratings yet
Ranks and Experience - Tanki Online Wiki
1 page
Academic Calendar For MBA Modular System
No ratings yet
Academic Calendar For MBA Modular System
1 page
IGNOU BCA Computer Oriented Numerical Technique Previous Year Unsolved Papers BCS 054
From Everand
IGNOU BCA Computer Oriented Numerical Technique Previous Year Unsolved Papers BCS 054
Manish Soni
No ratings yet

Minor Assignment - 4 (Machine Learning-Classification, Regression and Clustering)

Uploaded by

Minor Assignment - 4 (Machine Learning-Classification, Regression and Clustering)

Uploaded by

Centre for Data Science

Institute of Technical Education & Research, SOA, Deemed to be University

Python for Computer Science and Data Science 2 (CSE 3652)

A new student has the following scores:

8. Write a Python script to perform K-Means clustering on the following dataset:

Use k=2 and visualize the clusters.

You might also like