Quiz 7 Data Sci
Quiz 7 Data Sci
I. To remedy the issue on unit measurement, rescaling of the attributes may be done by dividing each
attribute value by its variance.
II. The choice for the unit of measurement of a particular object is important because it directly affects
the cluster membership of the data points.
I only
II only
both I and II
neither I nor
II
Which of the following is ALWAYS TRUE about the considerations regarding the object attributes when
performing cluster analysis?
I. On the choice of which attributes to use, it is important to understand what attributes will be known
at the time a new object will be assigned to a cluster.
II. Whenever possible and based on the data, it is best to reduce the number of attributes to the extent
possible.
I only
II only
both I and II
neither I nor
II
This study source was downloaded by 100000845580997 from CourseHero.com on 05-04-2022 20:23:28 GMT -05:00
https://www.coursehero.com/file/96439049/Quiz7DataScidocx/
In general, the following questions should be asked whenever performing diagnostics of the results.
Are the clusters well separated from each other?
Do any of the clusters have only a few points?
Do any of the centroids appear to be too close to each other?
All of the Above
This is an analytical technique that identifies k clusters of objects based on the objects’ proximity to the
center of the k groups where center is the arithmetic average of n-dimensional vector of attributes.
K-modes
K-means
K-medians
None of the Above
II only
both I and II
neither I nor
II
II only
both I and II
neither I nor
II
The following is a result from R when running k-means on a particular dataset on 620 high school
students with attributes regarding their grades on English, Math and Science. Based on this result, to
This study source was downloaded by 100000845580997 from CourseHero.com on 05-04-2022 20:23:28 GMT -05:00
https://www.coursehero.com/file/96439049/Quiz7DataScidocx/
which cluster does a student belong whose grade for English, Math and Science are 90, 81 and 88
respectively?
Cluster 1
Cluster 2
Cluster 3
Cannot be
determined
Which of the following is ALWAYS TRUE about considerations regarding the implementation of k-means?
I. The k-means algorithm is sensitive to the starting positions of the initial centroid.
II. K-means can handle all types of variables.
I only
II only
both I and II
neither I nor
II
The process of identifying the appropriate value of k is referred to as finding the ‘elbow’ of the WSS
curve.
True
False
This study source was downloaded by 100000845580997 from CourseHero.com on 05-04-2022 20:23:28 GMT -05:00
https://www.coursehero.com/file/96439049/Quiz7DataScidocx/
Powered by TCPDF (www.tcpdf.org)