Question Bank Semester: IV Sem Subject: Data Science Sub Code: 17MCA441 SL - No. Questions Marks
Question Bank Semester: IV Sem Subject: Data Science Sub Code: 17MCA441 SL - No. Questions Marks
DEPARTMENT OF MCA
Question Bank
Module 1
Module 2
Module 3
Module 5
1 What is cluster Analysis? List out the application areas of cluster analysis 10 marks
to practical problems.
2 Explain the different types of clustering. 8 marks
3 Describe the basic K-means algorithm with example. 8 marks
4 Mention the ways in choosing the initial centroids. 8 marks
5 Determine the time and space complexity of K-means algorithm. 5 marks
6 Explain Bisecting K-means algorithm. 8 marks
7 Mention the strengths and weakness of K-means algorithm 5 marks
8 Describe the agglomerative hierarchical clustering algorithm. 8 marks
9 Write a basic agglomerative hierarchical clustering algorithm. 8 marks
10 Explain the different ways in defining the proximity between clusters. 8 marks
11 Determine the time and space complexity of hierarchical clustering 8 marks
algorithm.
12 Illustrate Ward’s method in finding the proximity between two clusters. 8 marks
13 Describe DBSCAN clustering algorithm with example. 8 marks
14 How to evaluate clusters? Explain 8 marks
15 What is anomaly detection? Illustrate applications for which anomalies are 8 marks
of interest.
16 Mention the causes for anomalies. 8 marks
17 Explain different approaches to anomaly detection. 8 marks
18 Explain the different issues that need to be addressed when dealing with 8 marks
anomaly detection.
19 Describe the statistical approaches to outlier detection. 8 marks
20 Explain the proximity based outlier detection. 8 marks
21 Explain the clustering based techniques for outlier detection. 8 marks