We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 6
Perfect Python Homework Help
For Any Homework related queries, Call us at : - +1 678 648 4277
You can mail us at : - [email protected] or reach us at : - https://www.pythonhomeworkhelp.com/ 1) Select the correct option for a data set with 7 objects and an interval-scaled variable 1 'we have the following measurements: f (1, 2, 3, 4, 5, 8, 50) containing one outlying value.
(a) Std deviation(std f) is more affected by the outlier
(b) Mean absolute deviation (s f) is more affected by the outlier (c) Std deviation (std f) is less affected by the outlier (d) Std deviation (std_ f) and mean absolute deviation (s_ f) are having the same effect of the outlier
Answer:- (a) Std deviation(std f) is more affected by the outlier
2) Euclidean and Manhattan distances between the objects P, 0 and
(b) Supervised learning technique (c) Both (a) and (b) (d) None of these
Answer:- (b) Supervised learning technique
4) Select the correct statement about the standardization in the
following options-
(e) Standardizing the data always gives inefficient result while
making clusters (f) Standardizing the data always beneficial during clustering analysis (c) The variables having an absolute value may not efficient after standardization during clustering analysis (d) Outliers can not be detected by standardized data
Answer:- (c) The variables having an absolute value may not
efficient after standardization during clustering analysis 5) Dissimilarity can be defined as –
(a) How much certain objects differ from each other
(b) How much certain objects similar from each other (C) dissimilarities are non-negative numbers d( i, j) that are small (dose to zero) when i and j are 'near" to each other and that become large when i and j are very different (d) both (a) and (c)
Answer:- (d) both (a) and (c)
6) Select incorrect statement about similarity –
(a) Similarity j) typically takes or values between 0 and 1, where 0
means that i and j are not similar at all and 1 reflects the maximal similarity (b) Si, i) = 1 (c) Similarities between variables can be defined using Pearson or the Spearman correlation coefficient (d) It is not necessary to trarsform the similarities into dissimilarities while data consists of the similarity matrix
Answer:- It is not necessary to trarsform the similarities into
dissimilarities while data consists of the similarity matrix 7) Formula for dissimilarity computation between two objects 'or categorical variable is -Here p is a categorical variable and m denotes number of matches
(a) Di, j) p-m / p
(b) Di, j) = p-m / m (c) Di, j) = m-p / p (d) D(i, j) = m-p / m
Answer:- (a) Di, j) p-m / p
8) Which library is used for calculating distance measure in
9) Selects some facts about k-means clustering – (a) It comes under the partitiorirg method (b) Number of clusters are predefined for this method (c) Cluster similarity is measured in regard to the mean value of the objects in a cluster (d) All of the above
Answer:- (d) All of the above
10) Selects some facts about hierarchical clustering –
(a) A hierarchical method comes under either agglomerative or divisive algorithms (b) Hlierarchical methods suffer from the fact that once a step (merge or split) is done, it can rever be undone (c)A tree structure called a dendrogram is commonly used to represert the process of hierarchical clustering (e) All of the above