Data Mining Assignment
Data Mining Assignment
RPubs
Dummy
brought to you by RStudio
by subba reddy Yeruva Last updated 12 months ago Comments () Share
Sign in Register
Hide Toolbars
1.1 Apply hierarchical clustering with Euclidean distance and Wards method. Make sure to standardize the
data first. How many clusters appear?
1.3 Compare the cluster centroids to characterize the different clusters and try to give each cluster a label.
1.4 To check the stability of the clusters, remove a random 5% of the data (by taking a random sample of 95%
of the records), and repeat the analysis. Does the same picture emerge?
1.5 Use k-means clustering with the number of clusters that you found above in Part (a). Does the same
picture emerge? If not, how does it contrast or validate the finding in Part c above?
1.6 Which cluster(s) would you target for offers, and what type of offers would you target to customers in that
cluster? Include proper reasoning in support of your choice of cluster(s) and the corresponding offer(s).
As per the generated dendogram below, if slicing is done at height 100 then there are 3 clusters formed.
As per the dendogram branching is done into 2 parts and left branch alone formed one cluster and 2nd
branch(Right Branch) again splitted into two more branches. The two branches of right branch are formed into
an cluster. Hence in total 3 clusters with maximum number of customers.
https://rpubs.com/yeruva_s_reddy/194667 1/1