BHASKAR ENGINEERING COLLEGE
B. Tech. IV Year I Sem. II Mid Examinations, Jan- 2025
Subject Name: DATA MINING
Branch: COMPUTER SCIENCE AND ENGINEERING
SET-I
Time : 1 1/2hr Max.Marks:20
Answer all questions in Part A. Each Question carries 1 mark
PART -A:
1. Define Support Vector Machines (SVM).
2. What is Outlier Analysis, and why is it important?
3. Describe Spatial Data Mining.
4. What is a decision tree?
5. What is Web Structure Mining?
PART-B:
Answer any one of the following 5M
1. Explain the concept of classification in machine learning. Discuss various classification techniques including
Decision Tree Induction, Naive Bayes Classifier, and Rule-Based Classification.
2. Discuss the different types of clustering methods. Explain partitioning methods like k-Means and k-Medoids in
detail.
3. Explain the concepts of Web Mining. Discuss Web Content Mining, Web Structure Mining, and Web Usage Mining
with examples.
Objective Exam
Name: ________________________ Hall Ticket No. A
Answer All Questions. All Questions Carry Equal Marks. Time: 20 Min. Marks: 10.
I. Choose the correct alternative:
1) Which clustering method requires the number of clusters (k) to be specified in advance . [ ]
a) DBSCAN b) k-Means c) both d) clustering
2) In DBSCAN, points that are not part of any cluster are known as: [ ]
a) Noise b) outliers c) points d) all of these
3) Web mining is typically divided into ____ categories: [ ]
a) 1 b) 2 c) 3 d) 4
4) _______ is the proportion of correct predictions out of the total predictions: [ ]
a) accuracy b) precision c) both d)recall
5) _______Correctly predicted positive instances [ ]
a) TP b) TN c) both d) none of these
6) ______Incorrectly predicted as negative. [ ]
a)TP b)TN c)FP d)FN
7) ________ Focuses on mining the content of web pages, such as text, images, videos. [ ]
a)WSM b)WUM c)WCM d)None of these
8) K-means is similar to _____________. [ ]
a) K-clustering b) K-mediods c) K-medoids d) none of these
9) __________ Involves analyzing the structure of hyperlinks between web pages to understand relationships and
patterns techniques. [ ]
a) WSM b) WCM c) both a and b d) WUM
10) Hierarchical Methods are of ____types. [ ]
a) 2 b) 3 c) 4 d) 5
II. Fill in the blanks:
11) A classification technique where the data is divided into different classes based on certain decision rules is
known as a ________________.
12) The method used to combine multiple weak learners to form a strong classifier is called ____________.
13) ____________ is a type of classifier that assigns labels based on the nearest neighbor in the feature space.
14) ____________ is a density-based clustering algorithm that can detect outliers in data.
15) In k-Means, the number of clusters, denoted as ____________, must be pre-specified.
16) Web Content Mining focuses on extracting information from the ____________ of websites.
17) The analysis of patterns and relationships over time in time-series data is known as ____________ Mining.
18) Temporal Association Rules are used to detect patterns in ____________ data.
19) _________is the proportion of true positives to the total actual positives.
20) __________ is an unsupervised learning task used to group similar objects together.