Subject Name: Data Mining and Data Visualization Time:02:30 PM TO 05:00 PM Total Marks: 70 Instructions:
1. Attempt all questions.
2. Make Suitable assumptions wherever necessary. 3. Figures to the right indicate full marks. 4. Use of simple calculators and non-programmable scientific calculators are permitted.
Q.1 (a) Do as directed. 07
1. _________ is the input to KDD. 2. Dimensionality reduction reduces the data set size by removing ____________. 3. The first phase of APriori algorithm is _______. 4. Clustering is a Unsupervised learning. [True/False] 5. What is an example of an ordinal value? 6. What is Market Basket Analysis? 7. Define Data Cube. (b) List the various Data Pre-processing methods. Discuss various Data Cleaning 07 techniques for Missing value and Noisy data. Q.2 (a) Discuss major issues in data mining. 07 (b) What is meant by data reduction? Explain dimensionality reduction. 07 OR (b) Describe the steps involved in data mining when viewed as a process of 07 knowledge discovery. Q.3 (a) Discuss confusion matrix by example. 07 (b) Write a short note on the categorization of clustering methods. 07 OR Q.3 (a) What is Bayes theorem? Explain the working of Naïve Bayesian Classifier. 07 (b) Discuss the Algorithm for Inducing a Decision Tree from Training Tuples in 07 detail. Q.4 (a) What is the purpose of Apriori algorithm? Explain with suitable example. 07 (b) Explain K-Means partitioning method of clustering. 07 OR Q.4 (a) Discuss the different categories related to clustering techniques. 07 (b) What is a canvas chart? How to add animations in canvas chart? 07 Q.5 (a) Write a short note: Google Charts for advanced visualizations. 07 (b) What are Outliers? Explain Challenges of Outliers Detection. 07 OR Q.5 (a) Write a short note on different charting primitives 07 (b) Discuss the applications of data visualization. 07