0% found this document useful (0 votes)
43 views1 page

DMDW Midsem Question

Uploaded by

K27 Sneha Bharti
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
43 views1 page

DMDW Midsem Question

Uploaded by

K27 Sneha Bharti
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Sub Name Code: DMDW

Subject Code: IT-4037


Program Name: B.Tech
Semester:V (Special suppl)
Special supplementary MID-SEMESTER - 2023 Year - 2023
KIIT, Deemed to be University, Bhubaneswar-24
Data Mining and Data Warehousing
IT 4037
Time: 11/2 Hours Full Mark: 20
The figures in the margin indicate full marks. Candidates are required to give their answers in their
own words as far as practicable and all parts of a question should be answered at one place only.

(Answer any four questions including question No.1 which is compulsory)


Q1. Answer all the following questions. Provide appropriate example, if necessary. [1×5 =5]
(a) Describe the steps of the knowledge discovery process in data mining. CO1
(b) Define temporal, sequence and time-series databases. CO1
(c) What is data warehouse? List out OLAP operations. CO3
(d) What are the methods used to improve the Apriori’s Efficiency? CO2
(e) What are the techniques to handle missing data in any database? CO1
Q2.(a) For the following data set find the five-number summary, IQR, Tukey fence, [3]
and outlier (if any). CO1
Draw the box plot to describe the distribution of data in the data set.
18, 34, 76, 29, 15, 41, 46, 25, 54, 38, 20, 32, 43, 22

(b) Given two objects represented by the tuples (22, 1, 42, 10) and (20,0,36,8), [2]
compute the following distance CO2
i. Euclidean distance ii. Manhattan distance

Q3. (a) Consider the age :23, 23, 27, 27, 39, 41, 47, 49, 50. Use the following [3]
normalization to transform the age value 39 CO2
i) min-max normalization onto the range[0.0, 1.0]
ii) z-score normalization
iii) decimal scaling
(b) Illustrate the dimensionality reduction techniques and it’s importance. CO2 [2]
Q4.(a) Demonstrate the major tasks in data pre-processing. CO2 [3]
(b) What are the issues to be considered while data integration? CO2 [2]
Q5. (a) Consider the transactional data base with minimum support 22% and [5]
minimum confidence 70% . CO4
Find out
TID List of item_IDs
a) Maximal frequent item T100 I1, I2, I5
sets T200 I2, I4
T300 I2, I3
b) Strong association rules T400 I1, I2, I4
T500 I1, I3
T600 I2,I3
T700 I1, I3
T800 I1,I2,I3,I5
-------------------------------------------------------ALL THE BEST---------------------------------------------------------

You might also like