MTech (DS) Sem-II Data Mining and Predictive Analytics - Out
MTech (DS) Sem-II Data Mining and Predictive Analytics - Out
MARKING SCHEME
Amity University Haryana
M.Tech. (DS)
Second Semester End Term Examinations – April-May, 2019
Course Title: Data Mining and Predictive Analytics
1. In real-world data, tuples with missing values for some attributes are a common
occurrence.Describe methods for handling this problem.
Answer: Name of methods -2 Marks, Explanation- 4 Marks.
2. Describe the steps that are required in a data mining process and explain the importance
of each step.
Answer: Steps – 3 Marks, Explanation of importance of steps – 3 Marks.
3. Managers want to know by next week whether deployment will take place.
Therefore,analysts meet to discuss how useful and accurate their model is. Explain which
phase in the CRISP-DM process is represented in the stated scenario.
Answer: Name of steps – 3 Marks, Explanation – 3 Marks.
4. Suppose that our target variable is continuous numeric. Can we apply decision trees
directly to classify it? How can we work around this?
Answer: Answer andWorking Mechanism – 6 Marks.
6. Describe some of the similarities between Kohonen networks and the neural networks.
Answer: 4 Similarities – 6 marks.
7. Explain Decision Tree Induction.Describe basic algorithm for inducing a decision tree
from training tuples?
Answer: Definition decision tree – 3 marks, Algorithm – 3 Marks.
Page 1 of 2
8. Outliers are often discarded as noise. However, one person’s garbage could be another’s
treasure. For example, exceptions in credit card transactions can help us detect the
fraudulent use of credit cards. Taking fraudulence detection as an example, propose two
methodsthat can be used to detect outliers and discuss which one is more reliable.
Answer: Name of methods-4 Marks, Explanation and name of outlier – 6 Marks.
9. Discuss Cluster Analysis. What are some typical applications of clustering? What are
some typical requirements of clustering in data mining?
Answer: Definition- 3 Marks, Application – 3 Marks, Requirement – 4 Marks.
Suppose minimum support count is fixed as 50% and minimum confidence required is
70%. Find out the frequent item set using the apriori algorithm. (10)
Answer: Demonstration of algorithm – 10 Marks
b) Using the data in below table, find the k-nearest neighbour for record 5 and record 10
using k=3. (10)
Page 2 of 2