DMG Exam 3

Uploaded by

organizedmessy.anan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

115 views3 pages

DMG Exam 3

Uploaded by

organizedmessy.anan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

DMG Exam 3

Total Marks: 30 Time: 60 min.

Instructions

1. You will have to create a PDF file with your answers, name the file as <Name>-<RollNo>.pdf
and upload it on the classroom page.
2. Any submission submitted on and after 11:10 will be marked as late. No submissions after
11.10 will be considered.
3. You will have to join the zoom link (lecture) from a camera-enabled device. Attendance will be
taken from the zoom itself before evaluation. Absent students will be given zero marks.
4. For Plagiarism, institute policy will be followed. Any case of plagiarism from online sources or
from your colleagues will result in an "F" grade.
5. Do not roundoff any values, You are strongly advised to truncate any intermediate or final
decimal values.
6. If you face issues while submitting to the Google classroom, Please mail your responses to
[email protected] with the subject as "DMG Exam 3 ". The above timings will hold for mailed
responses too.

Question-1 [5 Marks] The Bisecting k-Means algorithm starts by dividing the points into two
clusters. It may consider several bisections and pick the best one. Let us take "best" to mean the
lowest SSE (Sum Squared Error). The SSE is defined as the sum of the squares of the distances
between each of the points of the cluster and the centroid of the cluster.

Suppose that the data set consists of nine points arranged in a square grid, as suggested by the figure
below:

Although it doesn't matter for this question, you may take the grid spacing to be 1 (i.e., the squares
are 2-by-2) and the lower-left corner to be the point (0,0). In the figure, we see three possible
bisections. (a) would be the bisection if we chose the two initial centroids to be 3 and 7, for example,
and broke ties in favor of 7. (b) would be the split if we chose initial centroids 1 and 2. (c) would be
the split for initial choice 2 and 7.

Comment on the below-given choices in terms of if they are correct or incorrect. If it is a wrong
option, then discuss the reasons for that.
I. (b) is better than (a)

II. (c) is the worst choice.

III. (a) and (c) are equally good choices.

Question-2 [9 marks] Suppose that the true data consists of three clusters, as suggested by the
diagram below:

There is a large cluster B centered around the origin (0,0), with 8000 points uniformly distributed in a
circle of radius 2. There are two small clusters, A and C, each with 1000 points uniformly distributed
in a circle of radius 1. The center of A is at (-10,0) and the center of C is at (10,0).

Suppose we choose three initial centroids x, y, and z, and cluster the points according to which of x, y,
or z they are closest. The result will be three apparent clusters, which may or may not coincide with
the true clusters A, B, and C. Depending on from which clusters we chose x, y, and z, it is possible that
all and only the points in B will be assigned to one of these three centroids. Another possibility is that
one of these centroids will be assigned all of B and all of A, but none of C. Still, a third possibility is
that one centroid will be assigned all of B and C, but none of A. Compute the probabilities of each of
these events.

Question-3 [5 Marks] Perform a hierarchical clustering of the following six points:

Using the single-link proximity measure (the distance between clusters is the shortest distance
between any pair of points, one from each cluster). Give the dendrogram showing the correct merge
sequence.

Question-4 [5 Marks] In the following, 1 through 7 are items. Which of the following association
rules has a confidence that is certain to be at least as great as the confidence of 12=>34567 and no
greater than the confidence of 1234=>5? Give reasons.

a) 134=>567

b) 123=>456

c) 134=>257

d) 134=>256

e) 124=>356

Question-5 [6 Marks] Consider the task of anomaly detection using the classic DBSCAN algorithm
and the Silhouette score. The dataset has clusters of different densities, and anomalies are rare but
are in a group with a higher density than the regular points cluster.

a) What shall be the DBSCAN algorithm's performance in detecting anomalies?

b) The Silhouette score is usually defined using the notion of distance of a given point from the
centroid of the corresponding cluster. Comment on the coefficient values for the anomalous
objects for the DBSCAN algorithm.

PCED-30-01 Certified Entry-Level Data Analyst With Python Dumps
No ratings yet
PCED-30-01 Certified Entry-Level Data Analyst With Python Dumps
7 pages
Exam SRM Sample Questions
No ratings yet
Exam SRM Sample Questions
71 pages
10987C: Performance Tuning and Optimizing SQL Databases Microsoft® Hyper-V® Classroom Setup Guide
No ratings yet
10987C: Performance Tuning and Optimizing SQL Databases Microsoft® Hyper-V® Classroom Setup Guide
23 pages
HW 2
No ratings yet
HW 2
7 pages
DM - Make Up - Sep 2019
No ratings yet
DM - Make Up - Sep 2019
3 pages
EE4146 Test1 202324 Semb Solution
No ratings yet
EE4146 Test1 202324 Semb Solution
7 pages
Exam DUT 070816 Ans
No ratings yet
Exam DUT 070816 Ans
5 pages
Major 2020
No ratings yet
Major 2020
2 pages
Fourth: Aeideirhelnnom
No ratings yet
Fourth: Aeideirhelnnom
9 pages
Cluster Analysis Chapter 8 Solution
No ratings yet
Cluster Analysis Chapter 8 Solution
8 pages
MT2023 Sol
No ratings yet
MT2023 Sol
8 pages
HW 1
No ratings yet
HW 1
5 pages
Exam dm1 121017 Ans
No ratings yet
Exam dm1 121017 Ans
8 pages
IS328 Final Exam
No ratings yet
IS328 Final Exam
12 pages
15A05602 Data Warehousing & Mining
No ratings yet
15A05602 Data Warehousing & Mining
2 pages
K Means Example
No ratings yet
K Means Example
8 pages
Capture D'écran, Le 2025-04-21 À 21.26.38
No ratings yet
Capture D'écran, Le 2025-04-21 À 21.26.38
14 pages
MS6711 Data Mining Homework 1: 1.1 Implement K-Means Manually (8 PTS)
No ratings yet
MS6711 Data Mining Homework 1: 1.1 Implement K-Means Manually (8 PTS)
6 pages
Assignment7_2023
No ratings yet
Assignment7_2023
4 pages
Density Based CA
No ratings yet
Density Based CA
8 pages
637227449508725497DataMining (Chapter8)
No ratings yet
637227449508725497DataMining (Chapter8)
8 pages
Quiz 4
No ratings yet
Quiz 4
4 pages
Intro To Data Science
No ratings yet
Intro To Data Science
47 pages
Data Warehousing and Data Mining_(PEC-IT602B)_6584_I040
No ratings yet
Data Warehousing and Data Mining_(PEC-IT602B)_6584_I040
2 pages
Feedback The Correct Answer Is:analysis of Time Series
No ratings yet
Feedback The Correct Answer Is:analysis of Time Series
42 pages
Cluster Analysis: Basic Concepts and Methods: 10.1 Exercises
No ratings yet
Cluster Analysis: Basic Concepts and Methods: 10.1 Exercises
16 pages
Data Mining
No ratings yet
Data Mining
7 pages
K-Means Clustering
No ratings yet
K-Means Clustering
21 pages
It-3031 (DMDW) - CS End Nov 2023
No ratings yet
It-3031 (DMDW) - CS End Nov 2023
23 pages
TD9 - K-Means 2025 Correction
No ratings yet
TD9 - K-Means 2025 Correction
10 pages
Data Mining Notes
No ratings yet
Data Mining Notes
31 pages
ST3189 Exam Paper - October 2023
No ratings yet
ST3189 Exam Paper - October 2023
5 pages
May 2021 Examination Diet School of Mathematics & Statistics MT4537
No ratings yet
May 2021 Examination Diet School of Mathematics & Statistics MT4537
11 pages
Intermediate R - Cluster Analysis
33% (3)
Intermediate R - Cluster Analysis
27 pages
MOCK Exam HCMUT DS 2024
No ratings yet
MOCK Exam HCMUT DS 2024
9 pages
COSC 6335 Data Mining (Dr. Eick) Solution Sketches Midterm Exam October 25, 2012
No ratings yet
COSC 6335 Data Mining (Dr. Eick) Solution Sketches Midterm Exam October 25, 2012
11 pages
Advanced Data Mining and Machine Learning: Assignment 3: High Dimensional Data Clustering
No ratings yet
Advanced Data Mining and Machine Learning: Assignment 3: High Dimensional Data Clustering
4 pages
Pattern Recognition & Clustering
No ratings yet
Pattern Recognition & Clustering
9 pages
Homework#6
No ratings yet
Homework#6
10 pages
INAIO Stage 2 Sample Problems MLTheory
No ratings yet
INAIO Stage 2 Sample Problems MLTheory
6 pages
Assignment 10: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 10: Introduction To Machine Learning Prof. B. Ravindran
4 pages
Tutorial 6 Part 2
No ratings yet
Tutorial 6 Part 2
5 pages
Tutorial Series 4: Exercise 1
No ratings yet
Tutorial Series 4: Exercise 1
1 page
Data Mining Comprehensive Exam - Regular PDF
No ratings yet
Data Mining Comprehensive Exam - Regular PDF
3 pages
IML-IITKGP - Assignment 8 Solution
No ratings yet
IML-IITKGP - Assignment 8 Solution
8 pages
CS 7641 CSE/ISYE 6740 Mid-Term Exam 2 (Fall 2016) Solutions: 1 Probability and Bayes' Rule (14 PTS)
No ratings yet
CS 7641 CSE/ISYE 6740 Mid-Term Exam 2 (Fall 2016) Solutions: 1 Probability and Bayes' Rule (14 PTS)
12 pages
DM 2019
No ratings yet
DM 2019
7 pages
Solution 10
No ratings yet
Solution 10
7 pages
HW 8
No ratings yet
HW 8
4 pages
Final Compre - Solutions - Updated FoDS
No ratings yet
Final Compre - Solutions - Updated FoDS
12 pages
Learn Lab3
No ratings yet
Learn Lab3
12 pages
ECE457 Pattern Recognition Techniques and Algorithms: Answer All Questions
No ratings yet
ECE457 Pattern Recognition Techniques and Algorithms: Answer All Questions
3 pages
Assignment On Clustering
No ratings yet
Assignment On Clustering
2 pages
hw4 2015
No ratings yet
hw4 2015
2 pages
DM 2022
No ratings yet
DM 2022
4 pages
Final Exam BWA44603
No ratings yet
Final Exam BWA44603
4 pages
Question Bank Semester: IV Sem Subject: Data Science Sub Code: 17MCA441 SL - No. Questions Marks
No ratings yet
Question Bank Semester: IV Sem Subject: Data Science Sub Code: 17MCA441 SL - No. Questions Marks
4 pages
Endsem ML Regular AK
No ratings yet
Endsem ML Regular AK
7 pages
Extra Questions
100% (1)
Extra Questions
7 pages
12 Clustering
No ratings yet
12 Clustering
46 pages
CBSE Class 10 Data Science Previous Years Solved Question Papers
From Everand
CBSE Class 10 Data Science Previous Years Solved Question Papers
Manish Soni
No ratings yet
CBSE Class 10 Data Science Previous Years Unsolved Question Papers
From Everand
CBSE Class 10 Data Science Previous Years Unsolved Question Papers
Manish Soni
No ratings yet
SCXI-1326 High-Voltage-Terminal-Block-Installation-Guide-And-Specifications
No ratings yet
SCXI-1326 High-Voltage-Terminal-Block-Installation-Guide-And-Specifications
8 pages
Wordpress Website Development: Introduction To E-Commerce and Wordpress
100% (1)
Wordpress Website Development: Introduction To E-Commerce and Wordpress
42 pages
A.12 Template Simple LTA Overview v2
No ratings yet
A.12 Template Simple LTA Overview v2
13 pages
What Is AI - We Drew You A Flowchart To Work It Out - MIT Technology Review
No ratings yet
What Is AI - We Drew You A Flowchart To Work It Out - MIT Technology Review
2 pages
Help With Resume Toronto
100% (2)
Help With Resume Toronto
7 pages
(ENGLISH Release) HONOR Earbuds 3 Pro - World's 1st TWS Earbuds With AI Temperature Monitoring - FINAL
No ratings yet
(ENGLISH Release) HONOR Earbuds 3 Pro - World's 1st TWS Earbuds With AI Temperature Monitoring - FINAL
4 pages
ECT463 M4 Ktunotes - in
No ratings yet
ECT463 M4 Ktunotes - in
70 pages
Audiovisual Guidelines Master Guide v2 2019 11
No ratings yet
Audiovisual Guidelines Master Guide v2 2019 11
13 pages
BNOSS Blasting Painting Level 2 Endorsed
No ratings yet
BNOSS Blasting Painting Level 2 Endorsed
72 pages
Shopify
No ratings yet
Shopify
6 pages
6500-301 Management Principles
No ratings yet
6500-301 Management Principles
13 pages
Schematic Lenovo Sl400: Read/Download
0% (1)
Schematic Lenovo Sl400: Read/Download
2 pages
Factors Influencing The Utilization of Learning Management System Among Aviation Academy Students
100% (1)
Factors Influencing The Utilization of Learning Management System Among Aviation Academy Students
65 pages
Hamiltonian Graphs
No ratings yet
Hamiltonian Graphs
3 pages
General and Particular Differential Equations Solutions - Videos, Examples PDF
No ratings yet
General and Particular Differential Equations Solutions - Videos, Examples PDF
1 page
A4629ac494 Syllabus
No ratings yet
A4629ac494 Syllabus
3 pages
Teens and Technology Share A Future
No ratings yet
Teens and Technology Share A Future
8 pages
Universal Replicator Overview
No ratings yet
Universal Replicator Overview
15 pages
Infinix Mobile - Wikipedia
No ratings yet
Infinix Mobile - Wikipedia
6 pages
Top 50 CSS & CSS3 Interview Questions & Answers
No ratings yet
Top 50 CSS & CSS3 Interview Questions & Answers
9 pages
Beamr Case Study
No ratings yet
Beamr Case Study
4 pages
Huawei Smartax Ma5616 Mdu and Boards Datasheet
No ratings yet
Huawei Smartax Ma5616 Mdu and Boards Datasheet
28 pages
Assessment of Higher Order Thinking - Skills Schraw, Gregory Robinson
No ratings yet
Assessment of Higher Order Thinking - Skills Schraw, Gregory Robinson
407 pages
Cash App
No ratings yet
Cash App
8 pages
PIC Timers
No ratings yet
PIC Timers
16 pages
Determinant and Matrices DPP)
No ratings yet
Determinant and Matrices DPP)
11 pages
1Z0 083 Demo
No ratings yet
1Z0 083 Demo
4 pages
IBM LTO 7 Install Users Guide
No ratings yet
IBM LTO 7 Install Users Guide
368 pages

DMG Exam 3

Uploaded by

DMG Exam 3

Uploaded by

DMG Exam 3

Total Marks: 30 Time: 60 min.

II. (c) is the worst choice.

III. (a) and (c) are equally good choices.

Question-3 [5 Marks] Perform a hierarchical clustering of the following six points:

a) What shall be the DBSCAN algorithm's performance in detecting anomalies?

You might also like