0% found this document useful (0 votes)

23 views3 pages

Data Mining and Warehousing22

This document is an examination paper for the B. Tech (Fifth Semester – Regular) on Data Mining & Data Warehousing at GIET University. It consists of multiple choice questions, short answer questions, and long answer questions covering various topics in data mining and data warehousing. The exam is structured to assess knowledge on algorithms, data structures, and statistical measures relevant to the field.

Uploaded by

22cse575.saigitapatro

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views3 pages

Data Mining and Warehousing22

Uploaded by

22cse575.saigitapatro

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

QPC: RD20BTECH327 AR 20 Reg.

GIET UNIVERSITY, GUNUPUR – 765022

B. Tech (Fifth Semester – Regular) Examinations, December – 2022
BPCCS5010 / BPCCT5010 - Data Mining & Data Warehousing
(CSE & CST)
Time: 3 hrs Maximum: 70 Marks
Answer ALL Questions
The figures in the right hand margin indicate marks.
PART – A: (Multiple Choice Questions) (1 x 10 = 10 Marks)

Q.1. Answer ALL questions CO # PO #

a. What does Apriori algorithm do? CO-3 PO-1
i. It mines all frequent patterns through ii. It mines all frequent patterns through
pruning rules with lesser support pruning rules with higher support
iii. Both 1 and 2 iv. None of the above
b. What is not true about FP growth algorithms? CO-2 PO-2
i. It mines frequent itemsets without ii. There are chances that FP trees may not
candidate generation. fit in the memory
iii. FP trees are very expensive to build iv. It expands the original database to
build FP trees.
c. What is Gini index? CO-3 PO-1
i. It is a type of index structure ii. It is a measure of purity
iii. Both options except none iv. None of the options
d. Which one of these is not a tree based learner? CO-2 PO-2
i. CART ii. ID3
iii. Bayesian classifier iv. Random Forest
e. The following technology is not well-suited for data mining: CO-3 PO-1
i. Expert system technology ii. Data visualization
iii. Technology limited to specific data iv. Parallel architecture
types such as numeric data types
f. Which of the following features usually applies to data in a data warehouse? CO-3 PO-1
i. Data are often deleted ii. Most applications consist of
transactions
iii. Data are rarely deleted iv. Relatively few records are processed by
applications
g. In the relational database terminology, a table is synonymous with: CO-1 PO-1
i. A column ii. A row
iii. An attribute iv. A relation
h. A null value indicates: CO-1 PO-1
i. A numeric value with value 0 ii. The absence of a value
iii. A very small value iv. An erroneous value
i. The following is a major disadvantage while using a neural network CO-2 PO-2
i. It is very difficult to find optimal or near ii. Interpretation of the model becomes
optimal parameters for the network very difficult
iii. It becomes difficult to model non-linear iv. The number of inputs it can handle are
relation between input and output limited
variables
j. In training a neural network using back propagation algorithm CO-2 PO-2
i. Chain rule of differentiation is used in ii. Activation functions are chosen so that
computing gradient of the error surface they are differentiable in nature
iii. The connecting weights can be iv. All of the above
generated initially at random in the
range of (0.0, 1.0)
Page 1 of 3
PART – B: (Short Answer Questions) (2 x 10 = 20 Marks)

Q2. Answer ALL questions CO # PO #

a. What is Knowledge Discovery? CO-1 PO-1

b. What is the need of data warehouses? CO-2 PO-2

c. Define fact table. CO-4 PO-1

d. Define metadata and explain the types of metadata CO-3 PO-1

e. Define support and confidence. CO-3 PO-1

f. Find the cosine similarity between the given two term frequency vectors: CO-2 PO-1

X=[3,2,0,5,0,0,0,2,0,0]
Y=[1,0,0,0,0,0,0,1,0,2]
g. What is attribute selection measure? CO-3 PO-1

h. Briefly describe the k-NN classification algorithm. CO-3 PO-3

i. Give two examples of activation function used in neural networks. CO-3 PO-2

j. Explain the principle of hierarchical clustering. CO-3 PO-1

PART – C: (Long Answer Questions) (10 x 4 = 40 Marks)

Answer ALL questions Marks CO # PO #

3.a. Briefly outline how to compute the dissimilarity between objects described by 5 CO-1 PO-2
the following types of variables:
i. Numerical (interval-scaled) variables
ii. Categorical variables
iii. Ratio-scaled variables
iv. Nonmetric vector objects
b. Explain the steps of KDD, with the help of a diagram. 5 CO-1 PO-1

(OR)
c. Suppose that a hospital tested the age and body fat data for 18 randomly 10 CO-2 PO-2
selected adults with the following results:
Age 23 23 27 27 39 41 47 49 50
% fat 9.5 26.5 7.8 17.8 31.4 25.9 27.4 27.2 31.2
Age 52 54 54 56 57 58 58 60 61
% fat 34.6 42.5 28.8 33.4 30.2 34.1 32.9 41.2 35.7

i. Calculate the mean, median, and standard deviation of age and %fat.
ii. Find out the covariance and correlation among these two attributes.
4.a. Explain how Apriori Algorithm is used for mining frequent item sets. 5 CO-2 PO-1

b. What are the measures of interestingness for an association rule? Define a 5 CO-2 PO-2
strong association rule.
(OR)
c. There are five transactions (T1,T2,T3,T4,T5) with items (A,B,C,D) purchased 10 CO-3 PO-2
as T1(B,C),T2(A,C,D),T3(B,C), T4(A,B,C,D), T5(B,D). The min_sup=2.
Show how Apriori Rule Mining Algorithm can generate the association rules
for the above dataset.
Page 2 of 3
5.a. What is decision trees algorithm? List down the attribute selection measures 5 CO-2 PO-2
used by the ID3 algorithm to construct a Decision Tree.
b. Write short answer on Naïve Bayes classifier. 5 CO-2 PO-1

(OR)
c. A multilayer feed-forward neural network is shown in below Figure. Let the 10 CO-3 PO-2
learning rate be 0.9. The initial weight and bias values of the network are given
in Table below, along with the first training tuple, X = (1, 0, 1), with a class
label of 1. Compute Net input, output and error at each node and update weight
and bias values just once. Use logistic activation function at nodes 4, 5 and 6.

Initial Input, weight and Bias values:

𝑥1 𝑥2 𝑥3 𝑤14 𝑤15 𝑤24 𝑤24 𝑤34 𝑤35 𝑤46 𝑤56 𝜃4 𝜃5 𝜃6
1 0 1 0.2 -0.3 0.4 0.1 -0.5 0.2 -0.3 -0.2 -0.4 0.2 0.1

6.a. Why is outlier mining important? Briefly describe the different approaches 5 CO-2 PO-2
behind distanced-based outlier detection and density based local outlier
detection.
b. Given two objects represented by the tuples (22, 1, 42, 10) and (20, 0, 36, 8): 5 CO-2 PO-1
Compute the Minkowski distance between the two objects, using q = 3.
(OR)
c. Both k-means and k-medoids algorithms can perform effective Clustering. 5 CO-3 PO-2
Illustrate the strength and weakness of k-means in comparison with the k-
medoids algorithm.
d. Suppose that the data mining task is to cluster the following eight points (with 5 CO-3 PO-2
(x, y) representing location) into three clusters:
A1(2, 10), A2(2, 5), A3(8, 4), B1(5, 8), B2(7, 5), B3(6, 4), C1(1, 2), C2(4, 9):
The distance function is Euclidean distance. Suppose initially we assign A1,
B1, and C1 as the center of each cluster, respectively.
Use the k-means algorithm to show only
i. The three cluster centers after the first round execution
ii. The final three clusters
--- End of Paper ---

Page 3 of 3

Cae Question Papers
100% (1)
Cae Question Papers
14 pages
Data Mining Model Qns
100% (1)
Data Mining Model Qns
14 pages
Data Mining
100% (1)
Data Mining
30 pages
Data Mining Exam Questions
No ratings yet
Data Mining Exam Questions
25 pages
Data Mining BITS-PILANI Mid Semester Sample
No ratings yet
Data Mining BITS-PILANI Mid Semester Sample
10 pages
Previous Year Paper - Sem 7
No ratings yet
Previous Year Paper - Sem 7
12 pages
Data Mning
No ratings yet
Data Mning
40 pages
CA2-Question Bank MCQ (PEC-CSBS601D)
No ratings yet
CA2-Question Bank MCQ (PEC-CSBS601D)
9 pages
Data Mining
No ratings yet
Data Mining
8 pages
Artificial Intelligence MCQs
No ratings yet
Artificial Intelligence MCQs
30 pages
DM 2023
No ratings yet
DM 2023
8 pages
Datamining Bits
No ratings yet
Datamining Bits
16 pages
Big Data Analysis On ML Main Points
No ratings yet
Big Data Analysis On ML Main Points
5 pages
DataMining - Workbook MCQ
No ratings yet
DataMining - Workbook MCQ
16 pages
Pyqp - Cs402-Qp-Jun21
No ratings yet
Pyqp - Cs402-Qp-Jun21
3 pages
AIML Mod 4&5
No ratings yet
AIML Mod 4&5
7 pages
DMW MCQ
No ratings yet
DMW MCQ
388 pages
Unit4 Mcqs
No ratings yet
Unit4 Mcqs
7 pages
Unit 4 - Question Bank
No ratings yet
Unit 4 - Question Bank
11 pages
Data Mining List of Important Question
No ratings yet
Data Mining List of Important Question
4 pages
Data Warehousing&Data Mining AMTCSE0114
No ratings yet
Data Warehousing&Data Mining AMTCSE0114
3 pages
Noida Institute of Engineering and Technology, Greater Noida
No ratings yet
Noida Institute of Engineering and Technology, Greater Noida
3 pages
Data Mining Algorithms MCQs
No ratings yet
Data Mining Algorithms MCQs
9 pages
DM
No ratings yet
DM
7 pages
DM-Question Bank 2024-25 Objective Question Bank
No ratings yet
DM-Question Bank 2024-25 Objective Question Bank
14 pages
Data Warehousing and Mining April 2019
No ratings yet
Data Warehousing and Mining April 2019
4 pages
DWDM MID - 2 Question Paper and Online Bits
No ratings yet
DWDM MID - 2 Question Paper and Online Bits
3 pages
Ai Syllabus
No ratings yet
Ai Syllabus
4 pages
ACSML0502
No ratings yet
ACSML0502
4 pages
DM Endsem 2023-1
No ratings yet
DM Endsem 2023-1
4 pages
Dcs 7302
No ratings yet
Dcs 7302
17 pages
Data Mining End 23 24
No ratings yet
Data Mining End 23 24
2 pages
DW Model Questions
No ratings yet
DW Model Questions
8 pages
CSC 501 Mid Term 2-Assignment
No ratings yet
CSC 501 Mid Term 2-Assignment
2 pages
DM 23
No ratings yet
DM 23
8 pages
ACSML0502
No ratings yet
ACSML0502
4 pages
III Yr B.Tech. - Computer Science & Engineering/Information Technology Data Mining
No ratings yet
III Yr B.Tech. - Computer Science & Engineering/Information Technology Data Mining
2 pages
B. Sc. H Computer S 3OWYH6v
No ratings yet
B. Sc. H Computer S 3OWYH6v
6 pages
B.Tech Degree S8 (S, FE) / S6 (PT) (S, FE) Examination June 2023 (2015 Scheme)
No ratings yet
B.Tech Degree S8 (S, FE) / S6 (PT) (S, FE) Examination June 2023 (2015 Scheme)
4 pages
DMDW Question Bank
No ratings yet
DMDW Question Bank
17 pages
Dmbi Mcqs Mcqs For Data Mining and Business Intelligence
No ratings yet
Dmbi Mcqs Mcqs For Data Mining and Business Intelligence
24 pages
Data Mining IMP Objective Questions - Sep 2023
No ratings yet
Data Mining IMP Objective Questions - Sep 2023
4 pages
Q1R Ext
No ratings yet
Q1R Ext
4 pages
Fundamentals of Machine Learning Techniques
No ratings yet
Fundamentals of Machine Learning Techniques
2 pages
Script of E - Previous Question Papers - URR18 03.08.2023 - VI Semester - U18CS605 PDF
No ratings yet
Script of E - Previous Question Papers - URR18 03.08.2023 - VI Semester - U18CS605 PDF
10 pages
1 - Page
No ratings yet
1 - Page
11 pages
DM-I Q Paper 2024
No ratings yet
DM-I Q Paper 2024
12 pages
C-3 Pap365er
No ratings yet
C-3 Pap365er
4 pages
Subject Code: 80359 Subject Name: Data Warehousing and Data Mining Common Subject Code (If Any)
No ratings yet
Subject Code: 80359 Subject Name: Data Warehousing and Data Mining Common Subject Code (If Any)
9 pages
CEGP013091: 49.248.216.238 08/12/2018 13:08:58 Static-238
No ratings yet
CEGP013091: 49.248.216.238 08/12/2018 13:08:58 Static-238
3 pages
Final Exam Review
No ratings yet
Final Exam Review
6 pages
Question Bank For DMDW
No ratings yet
Question Bank For DMDW
10 pages
Q1S 1
No ratings yet
Q1S 1
2 pages
DW & DM Questions & Answers
No ratings yet
DW & DM Questions & Answers
12 pages
QB 2
No ratings yet
QB 2
4 pages
APM Project Management Qualification Sample Answer Document 2021 v1
100% (1)
APM Project Management Qualification Sample Answer Document 2021 v1
24 pages
Ai & ML Lab Manual (As Per 2018 Scheme)
No ratings yet
Ai & ML Lab Manual (As Per 2018 Scheme)
42 pages
Exercise Quadratic Equations
100% (1)
Exercise Quadratic Equations
7 pages
CS
No ratings yet
CS
15 pages
AZ900
No ratings yet
AZ900
136 pages
Electronic - Evidence - Lecture - Atty. Ed Lim
100% (1)
Electronic - Evidence - Lecture - Atty. Ed Lim
37 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
6 pages
(Data Sheet) hm70 Evo - v1.00 - Rev1 - 20210316.semnat
No ratings yet
(Data Sheet) hm70 Evo - v1.00 - Rev1 - 20210316.semnat
24 pages
UM7210 EN V1 Lumiso Expert A4 User Manual
No ratings yet
UM7210 EN V1 Lumiso Expert A4 User Manual
34 pages
Database Management System
No ratings yet
Database Management System
162 pages
FSM 5000 OPC Special enUS 80930337931
No ratings yet
FSM 5000 OPC Special enUS 80930337931
28 pages
Openview Operations Error Messages
No ratings yet
Openview Operations Error Messages
267 pages
m2m and Gateway
No ratings yet
m2m and Gateway
12 pages
Design of Iot Full
No ratings yet
Design of Iot Full
18 pages
SERVLETS 01 - Introduction To Servlets
No ratings yet
SERVLETS 01 - Introduction To Servlets
28 pages
Storage (S3, Cloudfront)
No ratings yet
Storage (S3, Cloudfront)
21 pages
Elisys Duo en
No ratings yet
Elisys Duo en
4 pages
Enabling Technology Behind Iot
No ratings yet
Enabling Technology Behind Iot
7 pages
Module Framework R1
No ratings yet
Module Framework R1
16 pages
Iot Architecture
No ratings yet
Iot Architecture
12 pages
Test Plan Template
No ratings yet
Test Plan Template
6 pages
Domain Specific Iot
No ratings yet
Domain Specific Iot
8 pages
Diploma in I.T Technical Support: Assignment Title: The Boot Process in Windows and Ubuntu
100% (1)
Diploma in I.T Technical Support: Assignment Title: The Boot Process in Windows and Ubuntu
14 pages
Ds7100niq1 Series
No ratings yet
Ds7100niq1 Series
93 pages
Computer Networks 21
No ratings yet
Computer Networks 21
2 pages
Current Log
No ratings yet
Current Log
45 pages
ICO Crowd Magazine, Issue One, September 2017
No ratings yet
ICO Crowd Magazine, Issue One, September 2017
112 pages
SERVLETS 04 - Servlet - Cookies
No ratings yet
SERVLETS 04 - Servlet - Cookies
25 pages
RHEL 8.5 - Release Notes
No ratings yet
RHEL 8.5 - Release Notes
184 pages
Joel Repport
No ratings yet
Joel Repport
33 pages
Programming For Problem Solving B.tech 1ST SEM ALL STUDENT
No ratings yet
Programming For Problem Solving B.tech 1ST SEM ALL STUDENT
6 pages
Dictionary 1
No ratings yet
Dictionary 1
7 pages
Oracle PLM - Abhishek
No ratings yet
Oracle PLM - Abhishek
3 pages
System Center 2022 v23.10
No ratings yet
System Center 2022 v23.10
1 page
What's A Satellite Assembly ?
No ratings yet
What's A Satellite Assembly ?
5 pages
Razer Blade 15 2018 H2
No ratings yet
Razer Blade 15 2018 H2
3 pages
Practicial 1 To 7,10,11,12 by Jas
No ratings yet
Practicial 1 To 7,10,11,12 by Jas
30 pages
Reading and Writing Skills Reviewer
No ratings yet
Reading and Writing Skills Reviewer
10 pages

Data Mining and Warehousing22

Uploaded by

Data Mining and Warehousing22

Uploaded by

QPC: RD20BTECH327 AR 20 Reg.

GIET UNIVERSITY, GUNUPUR – 765022

Q.1. Answer ALL questions CO # PO #

Q2. Answer ALL questions CO # PO #

a. What is Knowledge Discovery? CO-1 PO-1

b. What is the need of data warehouses? CO-2 PO-2

c. Define fact table. CO-4 PO-1

d. Define metadata and explain the types of metadata CO-3 PO-1

e. Define support and confidence. CO-3 PO-1

h. Briefly describe the k-NN classification algorithm. CO-3 PO-3

j. Explain the principle of hierarchical clustering. CO-3 PO-1

PART – C: (Long Answer Questions) (10 x 4 = 40 Marks)

Answer ALL questions Marks CO # PO #

Initial Input, weight and Bias values:

You might also like