End Sem

This document summarizes an end semester exam for a course on data warehousing and data mining. It provides 8 questions covering topics like text mining, decision trees, clustering, association rule mining, and frequent itemset mining. Students are instructed to answer any 2 out of the first 3 questions and any 4 of the remaining questions.

Uploaded by

Sathiya Jothi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views3 pages

End Sem

Uploaded by

Sathiya Jothi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

End Semester Exam Course Id: 406035 Data Warehousing and Data Mining Date: November 19th, 2003.

Total Time: 3 Hours

Max. Marks: 100

Answer any two questions from 1-3 and any four from the rest. Clearly state any reasonable assumptions you make. 1. (a) In a text mining application, 20 documents are retrieved for a given query. 7 of the retrieved documents are relevant. The total number of relevant documents in the database is 30. When 30 documents are retrieved for the same query, 10 are found to be relevant. Plot the recall Vs. precision for this text retrieval system. (b) A classifier is tested with a number of test data. The classifier output and the correct class are shown below. Draw the confusion matrix for the classifier. Srl No. 1 2 3 4 5 6 7 8 9 Classifier Output C1 C1 C1 C2 C2 C2 C3 C3 C3 Correct Class C2 C1 C3 C2 C2 C2 C1 C1 C1 [5+5=10] 2. What are semi-additive and non-additive facts? Given one example of each type. Give one example each of distributive, algebraic and holistic measures. [5X2=10] 3. Suppose that a data warehouse contains three dimensions date, doctor and patient. There is only measure charge where charge is the fee that a doctor charges to a patient for a visit. Design a star schema for the data warehouse, assuming some concept hierarchy for each dimension. Starting with the base cuboid [date, doctor, patient], which sequence of OLAP operations do you need to list the total fee collected by each doctor in the year 2002? [8+2=10]

1/3

End Semester Exam Course Id: 406035 Data Warehousing and Data Mining 4. Build a Decision Tree using the training data in the table given below. Divide the Height attribute into ranges as follows: (0,1.6], (1.6,1.7], (1.7, 1.8], (1.8, 1.9], (1.9, 2.0], (2.0, 5.0] [20] Gender F M F F F M F M M M F M F F F Height 1.6 m 2m 1.9 m 1.88 m 1.7 m 1.85 m 1.6 m 1.7 m 2.2 m 2.1 m 1.8 m 1.95 m 1.9 m 1.8 m 1.75 m Class Short Tall Medium Medium Short Medium Short Short Tall Tall Medium Medium Medium Medium Medium

5. There are 5 documents in a text database A, B, C, D and E. The inter document distance matrix is shown in the form of the following table. Using an agglomerative hierarchical clustering algorithm, build and draw the dendrogram. You should use a step of 0.5. [20] Document A B C D E A 0 1 2 2 3 B 1 0 2 4 3 C 2 2 0 1 5 D 2 4 1 0 3 E 3 3 5 3 0

6. There are two clusters C1 and C2 formed from a dataset. The Clustering Feature (CF) vectors of these two clusters are: CF1 = (2, 8, 18) and CF2 = (3, 6, 14). Determine the following: Centroids of C1 and C2 Radii of C1 and C2 Diameters of C1 and C2 Average inter-cluster distance between C1 and C2 defined as: 1 j (O i O j ) 2 n 1 n 2 iC1 C 2 [4X5=20] {If the values become complex, work out till the last step and leave it there} a) b) c) d)

2/3

End Semester Exam Course Id: 406035 Data Warehousing and Data Mining 7. Consider the 5 transactions given below. If minimum support is 30% and minimum confidence is 80%, determine the frequent itemsets and association rules using the a priori algorithm. [15+5=20] Transaction T1 T2 T3 T4 T5 Items Bread, Jelly, Butter Bread, Butter Bread, Milk, Butter Coke, Bread Coke, Milk

8. Consider the following table of transactions. Each row represents a transaction and each column represents an item. If an item is present in a transaction, it is marked as 1, else it is marked as 0. Determine the Frequent Itemsets using the Dynamic Itemset Counting algorithm. Use intervals of 5 transactions and min_support = 20%. [20] A1 A2 A3 A4 A5 A6 A7 A8 A9 1 0 0 0 1 1 0 1 0 0 1 0 1 0 0 0 1 0 0 0 0 1 1 0 1 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 1 1 1 0 0 0 1 1 1 0 0 0 0 0 0 1 0 0 0 1 1 0 1 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 0 1 0 1 0 0 0 0 1 0 1 0 1 0 0 0 0 0 0 1 1 0 1 0 0 1 0 1 0 1 1 0 0 1 0 1 0 1 0 1 0 0 0 1 1 0 0 0 0 0 1

3/3

Jntuqp DWDM
No ratings yet
Jntuqp DWDM
8 pages
CDMP Mock Test 2
No ratings yet
CDMP Mock Test 2
19 pages
Chapter 3 Exercises
50% (2)
Chapter 3 Exercises
3 pages
Unit 1 Assignment
0% (1)
Unit 1 Assignment
6 pages
Data Warehouse and Data Mining Question Bank R13 PDF
No ratings yet
Data Warehouse and Data Mining Question Bank R13 PDF
12 pages
DWM Assignment
No ratings yet
DWM Assignment
15 pages
CS 515 Data Warehousing and Data Mining
No ratings yet
CS 515 Data Warehousing and Data Mining
5 pages
DWDM Answer
No ratings yet
DWDM Answer
19 pages
Ariori DHP
No ratings yet
Ariori DHP
53 pages
Cis 417.Ccs 415. CCT 416 Cat
No ratings yet
Cis 417.Ccs 415. CCT 416 Cat
4 pages
V2019 en TDT4300
No ratings yet
V2019 en TDT4300
10 pages
Assignment 2 Slot8 TTS3208 Summer
No ratings yet
Assignment 2 Slot8 TTS3208 Summer
11 pages
Winsem2012-13 Cp0535 Modqst Model QP
No ratings yet
Winsem2012-13 Cp0535 Modqst Model QP
4 pages
Midterm F07 Solutions
No ratings yet
Midterm F07 Solutions
4 pages
It-3031 (DMDW) - CS End Nov 2023
No ratings yet
It-3031 (DMDW) - CS End Nov 2023
23 pages
DWDM Unitwise Qns
No ratings yet
DWDM Unitwise Qns
3 pages
Data Warehousing and Data Mining
No ratings yet
Data Warehousing and Data Mining
6 pages
DM Questions
No ratings yet
DM Questions
7 pages
Dec 2016
No ratings yet
Dec 2016
2 pages
Data Warehousing and Mining
No ratings yet
Data Warehousing and Mining
4 pages
Unit-5 DWDM
No ratings yet
Unit-5 DWDM
7 pages
(It-704c) Data Warehousing and Data Mining (2013-14)
No ratings yet
(It-704c) Data Warehousing and Data Mining (2013-14)
6 pages
640005
No ratings yet
640005
4 pages
DWDM Descriptive Mid-I
No ratings yet
DWDM Descriptive Mid-I
6 pages
Assignment 1 5
No ratings yet
Assignment 1 5
4 pages
Introduction To Multi-Criteria Decision Making
100% (1)
Introduction To Multi-Criteria Decision Making
4 pages
DWDM Unit Wise Question Bank
No ratings yet
DWDM Unit Wise Question Bank
8 pages
Data Mining-1
No ratings yet
Data Mining-1
15 pages
Dwadm 19
No ratings yet
Dwadm 19
3 pages
Chapter 3
100% (3)
Chapter 3
4 pages
DM
No ratings yet
DM
7 pages
Dcs 7302
No ratings yet
Dcs 7302
17 pages
Datamining Bits
No ratings yet
Datamining Bits
16 pages
B.Tech Odd Semester Examination, 2018-19 Name of Subject: El-I (Data Warehousing & Data Mining)
No ratings yet
B.Tech Odd Semester Examination, 2018-19 Name of Subject: El-I (Data Warehousing & Data Mining)
3 pages
CST466
No ratings yet
CST466
5 pages
Mid Sem
No ratings yet
Mid Sem
3 pages
Data Mining Doubt Clearing Session Questions
No ratings yet
Data Mining Doubt Clearing Session Questions
12 pages
MSC CS Mqp0708
No ratings yet
MSC CS Mqp0708
12 pages
B.Tech May2022 Comp CSPE-64 Sem4
No ratings yet
B.Tech May2022 Comp CSPE-64 Sem4
4 pages
Assign em NT
No ratings yet
Assign em NT
2 pages
Assignment I
No ratings yet
Assignment I
4 pages
Script of E - Previous Question Papers - URR18 03.08.2023 - VI Semester - U18CS605 PDF
No ratings yet
Script of E - Previous Question Papers - URR18 03.08.2023 - VI Semester - U18CS605 PDF
10 pages
COMP1942 Question Paper
No ratings yet
COMP1942 Question Paper
5 pages
CEUC502 - DMBI - Question - Bank
No ratings yet
CEUC502 - DMBI - Question - Bank
12 pages
Ps Assignment - Solution
No ratings yet
Ps Assignment - Solution
7 pages
CST466 DATA MINING, OCTOBER 2023.pdf - Crdownload
No ratings yet
CST466 DATA MINING, OCTOBER 2023.pdf - Crdownload
3 pages
Q1R Ext
No ratings yet
Q1R Ext
4 pages
DWDM QB
No ratings yet
DWDM QB
12 pages
CS 8031 Data Mining and Data Warehousing Tutorial
No ratings yet
CS 8031 Data Mining and Data Warehousing Tutorial
9 pages
Data Warehousing and DatabySRS
No ratings yet
Data Warehousing and DatabySRS
8 pages
DMDW Nov-Dec 2022
No ratings yet
DMDW Nov-Dec 2022
4 pages
DM-Question Bank 2024-25 Objective Question Bank
No ratings yet
DM-Question Bank 2024-25 Objective Question Bank
14 pages
Assignment 05
No ratings yet
Assignment 05
2 pages
126VW122019
No ratings yet
126VW122019
2 pages
Data Mining IMP Objective Questions - Sep 2023
No ratings yet
Data Mining IMP Objective Questions - Sep 2023
4 pages
CIA 1 Key
No ratings yet
CIA 1 Key
3 pages
Write Your Roll Number: Time: Hours Max. Marks
No ratings yet
Write Your Roll Number: Time: Hours Max. Marks
2 pages
2016 Complete Symbolic Simulation of SystemC Models Efficient Formal Verification of Finite Non-Terminating Programs
No ratings yet
2016 Complete Symbolic Simulation of SystemC Models Efficient Formal Verification of Finite Non-Terminating Programs
172 pages
Reliability & Fault Tree Analysis
No ratings yet
Reliability & Fault Tree Analysis
25 pages
Implicit Finite-Difference Solutions of The Enthalpy Formulation of Stefan Problems
No ratings yet
Implicit Finite-Difference Solutions of The Enthalpy Formulation of Stefan Problems
14 pages
r05321204 Data Warehousing and Data Mining
No ratings yet
r05321204 Data Warehousing and Data Mining
5 pages
Consolidated Cse Question Bank1
No ratings yet
Consolidated Cse Question Bank1
170 pages
OSY CO-PO Competency Performance Indicator Matices
No ratings yet
OSY CO-PO Competency Performance Indicator Matices
14 pages
Defining Spatial Entropy From Multivariate Distributions of Co-Occurrences
No ratings yet
Defining Spatial Entropy From Multivariate Distributions of Co-Occurrences
14 pages
Esp 3701 Tut 2021
No ratings yet
Esp 3701 Tut 2021
18 pages
Hungarian Algorithm For Assignment Problem - Set 1 (Introduction)
No ratings yet
Hungarian Algorithm For Assignment Problem - Set 1 (Introduction)
10 pages
WBS-2-Operations Analytics-W2S2-Optimizing-with-Solver
No ratings yet
WBS-2-Operations Analytics-W2S2-Optimizing-with-Solver
5 pages
Analytical Methods of Machine Learning Model For E-Commerce Sales Analysis and Prediction
No ratings yet
Analytical Methods of Machine Learning Model For E-Commerce Sales Analysis and Prediction
6 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
9 Applications of Maxwell's Thermodynamical Relations Part 2
No ratings yet
9 Applications of Maxwell's Thermodynamical Relations Part 2
18 pages
1 s2.0 S2352012423018416 Main
No ratings yet
1 s2.0 S2352012423018416 Main
14 pages
Control System
No ratings yet
Control System
39 pages
Manipal Institute of Technology: Course Plan
No ratings yet
Manipal Institute of Technology: Course Plan
3 pages
Assignment 3
No ratings yet
Assignment 3
2 pages
Ping Lu-2009 PDF
No ratings yet
Ping Lu-2009 PDF
11 pages
Graph Theory
No ratings yet
Graph Theory
13 pages
Recognition of Persisting Emotional Valence From EEG Using Convolutional Neural Networks PDF
No ratings yet
Recognition of Persisting Emotional Valence From EEG Using Convolutional Neural Networks PDF
6 pages
DSP File
No ratings yet
DSP File
26 pages
MTH601 Mid Term Quiz
No ratings yet
MTH601 Mid Term Quiz
9 pages
6690 01 Que 2003 SPECIMEN
No ratings yet
6690 01 Que 2003 SPECIMEN
9 pages
AI Intro
No ratings yet
AI Intro
13 pages
Mamba Survey
No ratings yet
Mamba Survey
20 pages
A Development of Travel Itinerary Planning Application Using Traveling Salesman Problem and K-Means Clustering Approach PDF
No ratings yet
A Development of Travel Itinerary Planning Application Using Traveling Salesman Problem and K-Means Clustering Approach PDF
5 pages
r05321204 Data Warehousing and Data Mining
No ratings yet
r05321204 Data Warehousing and Data Mining
5 pages
A22 Sayson Ce50p 2 La2 Excel Solution
No ratings yet
A22 Sayson Ce50p 2 La2 Excel Solution
6 pages
Assignment 9 July 2022 Solution
No ratings yet
Assignment 9 July 2022 Solution
4 pages
Quiz 2 Solution
No ratings yet
Quiz 2 Solution
2 pages
Cs Gyaat
No ratings yet
Cs Gyaat
1 page
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
From Everand
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
Manish Soni
No ratings yet
IGNOU BCA Statistical Techniques Previous Year Unsolved Papers BCS 040
From Everand
IGNOU BCA Statistical Techniques Previous Year Unsolved Papers BCS 040
Manish Soni
No ratings yet

End Sem

Uploaded by

End Sem

Uploaded by

End Semester Exam Course Id: 406035 Data Warehousing and Data Mining Date: November 19th, 2003.

Total Time: 3 Hours

You might also like