0% found this document useful (0 votes)

41 views8 pages

Data Mining Question Bank

Uploaded by

saniyaa.fatimaa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views8 pages

Data Mining Question Bank

Uploaded by

saniyaa.fatimaa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 8

Question Bank

2
3
3
3

Name of the Course: DATA MINING Course Code:

Branch: CSE/CSD/CSM Academic Year: 2024-2025

Name & Details of the Course Coordinator:

Dept. of. CSE-

K1-Remembering; K2-Understanding; K3-Applying; K4-Analyzing; K5-Evaluating; K6-

Creating
UNIT –I
Short Answer Question:
Bloom’s
Course
Q.No. Question Taxonomy Marks
Outcome
Level
What are the main components of data
1 CO1 K1 2
warehouse architecture?

Describe the role of the data staging area in

2 CO1 K1 3
data warehouse architecture.

3 What is data mining and why is it important? CO1 K2 2

What types of data are typically used in data

4 CO1 K1 3
mining?

What are the primary functionalities of data

5 CO1 K1 3
mining?

How are data mining systems classified based

6 CO1 K1 2
on data types?
Long Answer Questions:
Bloom’s
Course
S.No. Question Taxonomy Marks
Outcome
Level
Explain the concept of a data warehouse and its
1 CO1 K3 10
significance in business intelligence.
Describe the multidimensional model and its role in
2 data warehousing. Include an explanation of CO1 K4 10
dimensions, measures, and data cubes.
What are the advantages of using the multidimensional
3 CO1 K4 10
model in OLAP systems?
Describe the typical architecture of a data warehouse,
4 CO1 K4 10
including its key components and their functions.
Define data mining and discuss its importance in
5 extracting valuable insights from large datasets. CO1 K4 10
Structures in the context of document processing.
Explain the different functionalities of data mining and
6 CO1 K3 10
provide examples of each
UNIT: 2

Short Answer Questions:

Course Bloom’s
S. No. Question Marks
Outcome Taxonomy Level

What is association analysis in data

1 CO2 2
mining? K1

What is the Apriori algorithm used

2 CO2 K1 2
for?

What is an FP-tree and how does it

3 CO2 2
relate to frequent itemset mining? K1

What is meant by multilevel

4 CO2 K1 2
association rule mining?

What are multi-dimensional

5 CO2 K1 2
association rules?

Long Answer Questions:

Course Bloom’s
S.No. Question Marks
Outcome Taxonomy Level

Explain the concept of association

analysis in data mining. How does it
help in discovering relationships
1 among items in large datasets? Provide CO2 K4 10
examples of practical applications
where association analysis can be
beneficial.

Elaborate on the Apriori algorithm for

frequent itemset mining. How does the
algorithm generate candidate itemsets,
2 and what role does the concept of CO2 K2 10
support play in pruning the search
space? Discuss its advantages and
limitations.

Describe the FP-tree (Frequent Pattern

Tree) structure and explain how it
facilitates efficient frequent itemset
3 mining. What are the key features of CO2 K3 10
the FP-tree, and how do they differ
from the Apriori algorithm’s
approach?
Course Bloom’s
S.No. Question Marks
Outcome Taxonomy Level

Describe is multilevel association rule

mining, and how does it address
hierarchical data structures? Explain
4 CO2 K3 10
the process of mining rules at different
levels of abstraction and the challenges
associated with this approach.

Discuss the concept of multi-

dimensional association rule mining.
How does it differ from traditional
5 CO1 K5 10
association rule mining, and what are
the benefits of incorporating multiple
dimensions into the analysis?

Explain the role of correlation

analysis in the context of association
rule mining. How does correlation
6 CO2 K4 10
complement other measures like
support and confidence in evaluating
the strength of associations?

UNIT: 3

Short Answer Questions:

Bloom’s
Course
S.No. Question Taxonomy Marks
Outcome
Level

What is the primary goal of classification

1 CO3 2
in machine learning? K1

What is a class label in the context of K1

2 CO3 2
classification?

MID II

What are the typical steps involved in the K1

3 CO3 2
classification process?

How is the training set used in a K1

4 CO3 3
classification problem?

What is the Gini index, and how is it used K1

5 CO3 2
in decision tree induction?

Long Answer Questions:

Bloom’s
Course
S.No. Question Taxonomy Marks
Outcome
Level

Describe the classification problem in

machine learning. How does it differ from
1 other types of predictive modeling, such as CO3 K3 10
regression and clustering? Provide examples
to illustrate your explanation.

Outline the general approach to solving a K4

classification problem. Discuss each step in
2 CO3 10
detail, from data collection and preprocessing
to model selection, training, and evaluation.

MID II

Explain the process of decision tree induction.

How do decision trees split data at each node,
and what criteria are commonly used for
3 CO3 10
making these splits? Discuss the advantages K4
and disadvantages of using decision trees for
classification.

Describe the rule-based classifiers, and how

are classification rules generated from data?
4 Discuss the methods for evaluating and CO3 K4 10
refining rules to improve the accuracy and
interpretability of the classifier.

Explain the k-nearest neighbor (k-NN)

algorithm. How does it determine the class of K4
a new instance, and what factors influence the
5 CO3 10
performance of the classifier? Discuss the
impact of the choice of k and distance metrics
on the results.
UNIT: 4

Short Answer Questions:

Bloom’s
Course
S.No. Question Taxonomy Marks
Outcome
Level

1 What is the primary goal of cluster analysis? CO4 K1 2

Why are similarity and distance metrics crucial K2

2 CO4 2
in clustering?

What are the key characteristics of a good K1

3 CO4 2
clustering algorithm?

What is the basic principle of partition-based K1

4 CO4 2
clustering?

How does BIRCH handle large datasets K2

5 CO4 2
efficiently?

Long Answer Questions:

Bloom’s
Course
S.No. Question Taxonomy Marks
Outcome
Level

Explain the main objectives of cluster analysis and

discuss how it is utilized in various fields such as
6 marketing, biology, and image processing. How does CO4 K4 10
cluster analysis contribute to the discovery of
underlying patterns in large datasets?.

Describe the importance of similarity and distance

metrics in the context of clustering. How do these K3
7 metrics influence the formation of clusters, and what CO4 10
are the common challenges associated with choosing
an appropriate metric?

Identify and discuss the key characteristics that define

a clustering algorithm. How do factors such as
8 scalability, interpretability, and cluster shape influence CO4 K3 10
the selection of a clustering algorithm for a particular
application?
Bloom’s
Course
S.No. Question Taxonomy Marks
Outcome
Level

Examine the challenges associated with clustering

large-scale datasets. What strategies can be employed
9 to ensure that a clustering algorithm remains efficient CO4 K4 10
and effective when dealing with massive amounts of
data?

Compare k-means with other partition-based clustering

techniques such as k-medoids and fuzzy c-means.
10 Discuss the advantages and disadvantages of each CO4 K3 10
method, and provide examples of scenarios where one
might be preferred over the others.

UNIT: 5

Short Answer Questions:

Bloom’s
Course
S.No. Question Taxonomy Marks
Outcome
Level

What is a data stream, and how does it

1 CO5 2
differ from traditional data processing? K1

What is a time series, and how is it used

2 CO5 2
in data mining? K2

How do time series forecasting models

3 like ARIMA and Exponential CO5 K1 2
Smoothing work??

What is the goal of sequence pattern

4 CO5 2
mining in transactional databases? K1

Explain the difference between frequent

5 itemset mining and sequential pattern CO5 K2 2
mining..
Long Answer Questions:

Course Bloom’s
S.No. Question Marks
Outcome Taxonomy Level

Describe the
fundamental challenges
and strategies involved
in mining data streams.
How do concepts such
as concept drift and data
1 stream fragmentation CO5 K4 10
impact the effectiveness
of data stream mining
algorithms? Discuss how
incremental learning
approaches are used to
address these challenges.

Discuss the key

techniques for
summarizing and
approximating data in
data stream mining, such
as sketching and K3
2 CO5 10
sampling. How do these
techniques help in
handling the scalability
and memory limitations
inherent in data stream
environments?

Compare and contrast

time series forecasting
models such as ARIMA,
Exponential Smoothing, K4
and machine learning
approaches like LSTM
3 CO5 10
(Long Short-Term
Memory) networks.
How do these models
handle seasonality, trend
components, and noise
in time series data?
Course Bloom’s
S.No. Question Marks
Outcome Taxonomy Level

Describe the process of

mining sequential
patterns in transactional
databases, focusing on
algorithms such as
Prefix Span and SPADE. K3
4 C05 10
How do these algorithms
efficiently find frequent
sequential patterns and
handle the challenges of
large-scale sequence
data?

Explain how the

concept of constraint-
based mining can be
applied to sequential
pattern mining. What
5 types of constraints are C05 K4 10
commonly used, and
how do they impact the
efficiency and relevance
of the discovered
sequential patterns?

Analyze the unique

challenges associated
with mining object data
compared to traditional
tabular data. How do
object-oriented data K4
6 C05 10
models and complex
data structures affect the
mining process? Discuss
specific algorithms or
techniques used for
mining object data.

BR235 - EN - Col18 SAP Convergent Charging
No ratings yet
BR235 - EN - Col18 SAP Convergent Charging
195 pages
Project Management Decision Trees
100% (1)
Project Management Decision Trees
38 pages
JNTUK R20 ML UNIT-I Final
No ratings yet
JNTUK R20 ML UNIT-I Final
22 pages
Data Mining Question Bank
0% (1)
Data Mining Question Bank
7 pages
Emerging Artificial Intelligence Applications in Computer Engineering_ Real Word AI Systems With Applications in EHealth, HCI, Information Retrieval and ... in Artificial Intelligence and Applications) ( PDFDrive )
No ratings yet
Emerging Artificial Intelligence Applications in Computer Engineering_ Real Word AI Systems With Applications in EHealth, HCI, Information Retrieval and ... in Artificial Intelligence and Applications) ( PDFDrive )
421 pages
Decision Theory
No ratings yet
Decision Theory
67 pages
Unit 3
No ratings yet
Unit 3
86 pages
DM Important Questions
100% (1)
DM Important Questions
2 pages
Question Bank Semester: IV Sem Subject: Data Science Sub Code: 17MCA441 SL - No. Questions Marks
No ratings yet
Question Bank Semester: IV Sem Subject: Data Science Sub Code: 17MCA441 SL - No. Questions Marks
4 pages
Lesson 8 INDIVIDUAL TASK
No ratings yet
Lesson 8 INDIVIDUAL TASK
3 pages
DMDA Viva Questions-1
No ratings yet
DMDA Viva Questions-1
7 pages
Data Mining Question Bank
No ratings yet
Data Mining Question Bank
4 pages
17CS651 DMDW
No ratings yet
17CS651 DMDW
302 pages
Datamining Quiz
No ratings yet
Datamining Quiz
173 pages
BDA3
No ratings yet
BDA3
61 pages
DWDM Unitwise Qns
No ratings yet
DWDM Unitwise Qns
3 pages
Class 16 Decision Tree
No ratings yet
Class 16 Decision Tree
45 pages
Ansh Rohatgi 20csu169 AI ML WorkTag
No ratings yet
Ansh Rohatgi 20csu169 AI ML WorkTag
68 pages
Saikiran
No ratings yet
Saikiran
28 pages
Short Answer Type Questions: Question Bank
No ratings yet
Short Answer Type Questions: Question Bank
26 pages
Hillier 7e Ch12 PPT Accessible
No ratings yet
Hillier 7e Ch12 PPT Accessible
56 pages
Open AI and Its Impact On Fraud Detection in Financial Industry
No ratings yet
Open AI and Its Impact On Fraud Detection in Financial Industry
24 pages
Dcs 7302
No ratings yet
Dcs 7302
17 pages
UNCERTIANITY
No ratings yet
UNCERTIANITY
114 pages
DWDM
No ratings yet
DWDM
18 pages
AOR Questions
No ratings yet
AOR Questions
33 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
13 pages
Tutorial Pres 1
No ratings yet
Tutorial Pres 1
28 pages
DW Model Questions
No ratings yet
DW Model Questions
8 pages
Advanced Machine Learning Techniques For Cardiovascular Disease Early Detection and Diagnosis
No ratings yet
Advanced Machine Learning Techniques For Cardiovascular Disease Early Detection and Diagnosis
29 pages
Unit
No ratings yet
Unit
13 pages
Comp 414 Revision
No ratings yet
Comp 414 Revision
9 pages
MIT Communicating With Data
No ratings yet
MIT Communicating With Data
19 pages
Learning Analytics in Education For The Twenty-Fir
No ratings yet
Learning Analytics in Education For The Twenty-Fir
22 pages
Unit 5
No ratings yet
Unit 5
25 pages
DM QB
No ratings yet
DM QB
25 pages
Answers PDF
No ratings yet
Answers PDF
9 pages
R2032051
No ratings yet
R2032051
7 pages
Sentiment Identification in Football-Specific Tweets: Corresponding Author: Samah Aloufi (Salou102@uottawa - Ca)
No ratings yet
Sentiment Identification in Football-Specific Tweets: Corresponding Author: Samah Aloufi (Salou102@uottawa - Ca)
13 pages
ML Lab - V Sem - Bca
No ratings yet
ML Lab - V Sem - Bca
22 pages
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
No ratings yet
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
31 pages
Classification
No ratings yet
Classification
14 pages
HW1
No ratings yet
HW1
4 pages
Script of E - Previous Question Papers - URR18 03.08.2023 - VI Semester - U18CS605 PDF
No ratings yet
Script of E - Previous Question Papers - URR18 03.08.2023 - VI Semester - U18CS605 PDF
10 pages
Iris Classification
No ratings yet
Iris Classification
6 pages
DWDM-CSE-Question Bank
No ratings yet
DWDM-CSE-Question Bank
11 pages
DM-Question Bank 2024-25 Objective Question Bank
No ratings yet
DM-Question Bank 2024-25 Objective Question Bank
14 pages
DM 100
No ratings yet
DM 100
17 pages
DM
No ratings yet
DM
7 pages
Chapter 12 Decision-Making Under Conditions of Risk and Uncertainty
No ratings yet
Chapter 12 Decision-Making Under Conditions of Risk and Uncertainty
5 pages
Gujarat Technological University: Page 1 of 2
No ratings yet
Gujarat Technological University: Page 1 of 2
2 pages
ML Papers
No ratings yet
ML Papers
10 pages
Gujarat Technological University: Page 1 of 2
No ratings yet
Gujarat Technological University: Page 1 of 2
2 pages
Project Proposal
No ratings yet
Project Proposal
11 pages
CBC An Associative Classifier With A Small Number of Rules
No ratings yet
CBC An Associative Classifier With A Small Number of Rules
8 pages
Wa0001
No ratings yet
Wa0001
6 pages
DM IV YR MID2 Set2
No ratings yet
DM IV YR MID2 Set2
4 pages
DWDM
No ratings yet
DWDM
2 pages
DM Questions
No ratings yet
DM Questions
7 pages
DWDM Unit Wise Question Bank
No ratings yet
DWDM Unit Wise Question Bank
8 pages
DM Question Bank
No ratings yet
DM Question Bank
5 pages
Question Bank 2
No ratings yet
Question Bank 2
4 pages
Unit4 Mcqs
No ratings yet
Unit4 Mcqs
7 pages
Subject Question Bank-1
No ratings yet
Subject Question Bank-1
6 pages
Session 11 and Session 12-Decision Trees Construction Worked Examples
No ratings yet
Session 11 and Session 12-Decision Trees Construction Worked Examples
5 pages
9th 2ut Partb PDF
No ratings yet
9th 2ut Partb PDF
6 pages
Sample Question DMW
No ratings yet
Sample Question DMW
4 pages
Q1R Ext
No ratings yet
Q1R Ext
4 pages
191CSC503T - Data Mining-Cat 2-Question Bank
No ratings yet
191CSC503T - Data Mining-Cat 2-Question Bank
6 pages
Data Mining and Warehousing
No ratings yet
Data Mining and Warehousing
7 pages
Theory Format 1
No ratings yet
Theory Format 1
2 pages
Data Warehousing and Data Mining Dec 2023
No ratings yet
Data Warehousing and Data Mining Dec 2023
7 pages
DWDM QB
No ratings yet
DWDM QB
6 pages
DMDW QB
No ratings yet
DMDW QB
4 pages
Data Mining (Gtu Sem-6) 002
No ratings yet
Data Mining (Gtu Sem-6) 002
5 pages
Updated DWDM Question Bank 2021-22, I Sem
No ratings yet
Updated DWDM Question Bank 2021-22, I Sem
4 pages
QB Data Mining
No ratings yet
QB Data Mining
5 pages
Syllabus CSE 7th Sem
No ratings yet
Syllabus CSE 7th Sem
3 pages
3-2 Rec Cse DWDM
No ratings yet
3-2 Rec Cse DWDM
4 pages
DMBI All Pyqs
No ratings yet
DMBI All Pyqs
4 pages
DM DW Assignment (17775) PDF
No ratings yet
DM DW Assignment (17775) PDF
3 pages
Bca IV Data Minining Qestion Bank - Dr. KK Sharma Socsa
No ratings yet
Bca IV Data Minining Qestion Bank - Dr. KK Sharma Socsa
5 pages
Write Your Roll Number: Time: Hours Max. Marks
No ratings yet
Write Your Roll Number: Time: Hours Max. Marks
2 pages

Data Mining Question Bank

Uploaded by

Data Mining Question Bank

Uploaded by

Question Bank

Name of the Course: DATA MINING Course Code:

Name & Details of the Course Coordinator:

K1-Remembering; K2-Understanding; K3-Applying; K4-Analyzing; K5-Evaluating; K6-

Describe the role of the data staging area in

3 What is data mining and why is it important? CO1 K2 2

What types of data are typically used in data

What are the primary functionalities of data

How are data mining systems classified based

Short Answer Questions:

What is association analysis in data

What is the Apriori algorithm used

What is an FP-tree and how does it

What is meant by multilevel

What are multi-dimensional

Long Answer Questions:

Explain the concept of association

Elaborate on the Apriori algorithm for

Describe the FP-tree (Frequent Pattern

Describe is multilevel association rule

Discuss the concept of multi-

Explain the role of correlation

Short Answer Questions:

What is the primary goal of classification

What is a class label in the context of K1

What are the typical steps involved in the K1

How is the training set used in a K1

What is the Gini index, and how is it used K1

Long Answer Questions:

Describe the classification problem in

Outline the general approach to solving a K4

Explain the process of decision tree induction.

Describe the rule-based classifiers, and how

Explain the k-nearest neighbor (k-NN)

Short Answer Questions:

1 What is the primary goal of cluster analysis? CO4 K1 2

Why are similarity and distance metrics crucial K2

What are the key characteristics of a good K1

What is the basic principle of partition-based K1

How does BIRCH handle large datasets K2

Long Answer Questions:

Explain the main objectives of cluster analysis and

Describe the importance of similarity and distance

Identify and discuss the key characteristics that define

Examine the challenges associated with clustering

Compare k-means with other partition-based clustering

Short Answer Questions:

What is a data stream, and how does it

What is a time series, and how is it used

How do time series forecasting models

What is the goal of sequence pattern

Explain the difference between frequent

Discuss the key

Compare and contrast

Describe the process of

Explain how the

Analyze the unique

You might also like