Welcome to Scribd!

0% found this document useful (0 votes)

25 views

2020-Dec CS-719 40

Uploaded by

This document contains a summary of 6 questions from a Data Mining exam. The questions cover topics like hierarchical clustering, association rule mining using Apriori and FP-Growth algorithms, data warehouse modeling, PageRank algorithm, and decision tree induction. The questions ask students to perform tasks like computing metrics, drawing diagrams, enumerating schemas, finding frequent itemsets, and calculating information gain.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

2020-Dec CS-719 40

Uploaded by

Sahil Choudhary

0% found this document useful (0 votes)

25 views3 pages

Original Title

2020-Dec_CS-719_40

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

25 views3 pages

2020-Dec CS-719 40

Uploaded by

Sahil Choudhary

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 3

Search inside document

Roll Number:__________________________ No.

of Pages: 03

National Institute of Technology, Hamirpur (H.P.)

Department of Computer Science and Engineering

M. Tech./Ph.D.: Semester- I (AI) Course Code: CS-719

Course Name: Data Mining
December 17, 2020 Thursday, 15:00 – 17:00 Hrs
Time: 2 Hours, M. Marks: 50 Marks Name Of Faculty: VKC
Note: Attempt all questions in proper sequence. Assume missing data, if any, suitably.

Q1 Consider an organization tested eighteen randomly chosen persons. The age and fat 9 Marks
data are mentioned in the following table:

Age 23 23 27 27 39 41 47 49 50
%Fat 9.5 26.5 7.8 17.8 31.4 25.9 27.4 27.2 31.2
Age 52 54 54 56 57 58 58 60 61
%Fat 34.6 42.5 28.8 33.4 30.2 34.1 32.9 41.2 35.7

a) Compute the mean, median and standard deviation of age and %fat.
b) Draw the boxplots for Age and %Fat.
c) Normalize the two variables based on z-score normalization.
d) Compute the Pearson correlation coefficient. Are these two variables
positively or negatively correlated?

Q2 Compute the hierarchical F-measure for the eight objects {p1, p2, p3, p4, p5, p6, p7, p8} 9 Marks
and hierarchical clustering shown in Fig. 1. Class A contains points p1, p2, and p3,
while p4, p5, p6, p7, and p8 belong to class B.

Fig. 1. Hierarchical Clustering

1
Q3 Consider the dataset consists of five transactions. Assume min_suppport = 60% and 8 Marks
min_confidence= 80%.

TID Items Bought

T1 {M,O,N,K,E,Y}
T2 {D,O,N,K,E,Y}
T3 {M,A,K,E}
T4 {M,U,C,K,Y}
T5 {C,O,O,K,I,E}

a) Find all frequent itemsets using Apriori and FP-growth, respectively. Compare the
efficiency of the two mining processes.
b) List all of the strong association rules (with support s and confidence c) matching the
following meta rule, where X is a variable representing customers, and itemi denotes
variables representing items (e.g., "A", "B", etc.):
x  transcation, buys  X , item1   buys  X , item2   buys  X , item3   s, c 

Q4 Consider a data warehouse consists of the three dimensions time, doctor, and patient, 8 Marks
and the two measures count and charge, where charge is the fee that a doctor charges a
patient for a visit.
(a) Enumerate three classes of schemas that are popularly used for modeling data
warehouses.
(b) Draw a schema diagram for the above data warehouse using one of the schema
classes listed in Question 4(a).
(c) Starting with the base cuboid [day; doctor; patient], what specific OLAP operations
should be performed in order to list the total fee collected by each doctor in 2020?
(d) To obtain the same list, write an SQL query assuming the data is stored in a
relational database with the schema fee (day, month, year, doctor, hospital, patient,
count, charge).

Q5 Consider the following four pages with damping factor = 0.85 and their links in context 8 Marks
to the Page Rank algorithm.

Page A has page rank of 1 and has one link to B.

Page B has page rank of 2 and has two links to C and D.
Page C has Page rank of 3 and has two links to B and D.
Page D has page rank of 2 and has three links to A, B, and C.

a) Find page rank for all the web pages using page rank algorithm.
b) Which page has the highest page rank?

2
Q6 Consider the following data set for class problem. Calculate the gain in the Gini index 8 Marks
when splitting on attributes X, Y and Z having values T=True and F=False.

X Y Z Class
T T T I
F F F II
T T F III
T F T I
F T F III
F F F II
F F T I
T F F II
F T F III
T T F III

a) Which attribute would the decision tree induction algorithm choose as root attribute
in the decision tree?
b) Build the decision tree for class problem.

Boxplot Activity
Document8 pages
Boxplot Activity
Gourav Singhal
0% (1)
Assignment-2 3
Document4 pages
Assignment-2 3
botiwa
No ratings yet
B.Tech May2022 Comp CSPE-64 Sem4
Document4 pages
B.Tech May2022 Comp CSPE-64 Sem4
ankit12012064
No ratings yet
Test 12020
Document2 pages
Test 12020
Ajay Kumar
No ratings yet
Important Questions of Machine Learning
Document5 pages
Important Questions of Machine Learning
zeeshanahmad12030
No ratings yet
DWBI Assignment 17B
Document3 pages
DWBI Assignment 17B
Siri Penmetsa
0% (1)
Reg. No.: Name:: Architecture You Would Choose. What Is The Purpose of Each Component of This Architecture?
Document2 pages
Reg. No.: Name:: Architecture You Would Choose. What Is The Purpose of Each Component of This Architecture?
Delvin company
No ratings yet
SYDSc SEM III (OCT 2023)
Document8 pages
SYDSc SEM III (OCT 2023)
dheerajv5610
No ratings yet
Assign em NT
Document2 pages
Assign em NT
Kholiator sss
No ratings yet
640005
Document4 pages
640005
Swetang Khatri
No ratings yet
10 Questions BBA (Stat - 1) (19 Pages)
Document28 pages
10 Questions BBA (Stat - 1) (19 Pages)
Bangladesh Gonit Foundation
No ratings yet
ML Question Bank
Document7 pages
ML Question Bank
arunwaghmare5
No ratings yet
DAV Guidelines
Document4 pages
DAV Guidelines
akshatswamiisro
No ratings yet
15A05602 Data Warehousing & Mining
Document2 pages
15A05602 Data Warehousing & Mining
Chitra Madhuri Yashoda
No ratings yet
ML QP
Document6 pages
ML QP
Ashok
No ratings yet
DWM - Assignment 2 - July2024
Document2 pages
DWM - Assignment 2 - July2024
kalulalu144
No ratings yet
Model Paper of Business Statics MBA
Document5 pages
Model Paper of Business Statics MBA
Piyush Kumar
100% (1)
DWDM Ii Mid Paper
Document2 pages
DWDM Ii Mid Paper
adapureddisaranya
No ratings yet
STATISTICS
Document6 pages
STATISTICS
shobha mahadeva
No ratings yet
April, 2007 Fundamental IT Engineer Examination (Morning) : Questions Must Be Answered in Accordance With The Following
Document32 pages
April, 2007 Fundamental IT Engineer Examination (Morning) : Questions Must Be Answered in Accordance With The Following
Denz Tajo
No ratings yet
Dwm Question Bank Winter 2024
Document4 pages
Dwm Question Bank Winter 2024
sahillanjewar294
No ratings yet
Document
Document4 pages
Document
Nathaniel Adika
No ratings yet
COMP1942 Question Paper
Document5 pages
COMP1942 Question Paper
pakaMuziki
No ratings yet
DWDM Unit Wise Question Bank
Document8 pages
DWDM Unit Wise Question Bank
beastboy232472
No ratings yet
MATHS XII COM - Pre Prelim Paper (2023)
Document8 pages
MATHS XII COM - Pre Prelim Paper (2023)
SALMA ANSARI
No ratings yet
Assignment Booklet PGDAST Jan-Dec 2018
Document35 pages
Assignment Booklet PGDAST Jan-Dec 2018
sumit_waghmare
No ratings yet
Prelim in Are Exam Maths II
Document3 pages
Prelim in Are Exam Maths II
AMIN BUHARI ABDUL KHADER
No ratings yet
Mid Term Question 241 CSE4889 a DMF
Document1 page
Mid Term Question 241 CSE4889 a DMF
Nafis Hossain
No ratings yet
Economics Sample Paper Class 11th 23-24 - 240205 - 124435
Document5 pages
Economics Sample Paper Class 11th 23-24 - 240205 - 124435
Gauransh Dhing
No ratings yet
Midterm F07 Solutions
Document4 pages
Midterm F07 Solutions
Kamal Jack
No ratings yet
II PUC Statistics Mock Paper I
Document4 pages
II PUC Statistics Mock Paper I
mohanraokp2279
No ratings yet
15A05602 Data Warehousing & Mining
Document2 pages
15A05602 Data Warehousing & Mining
Chitra Madhuri Yashoda
No ratings yet
Mid Term Paper 2019-20
Document8 pages
Mid Term Paper 2019-20
bushrakhan85883
No ratings yet
Mid Semster Exam QP
Document2 pages
Mid Semster Exam QP
Kriti Goyal
100% (2)
CST466 DATA MINING, OCTOBER 2023.pdf - Crdownload
Document3 pages
CST466 DATA MINING, OCTOBER 2023.pdf - Crdownload
20b739
No ratings yet
3) Algorithms, Data Structures and Computability
Document18 pages
3) Algorithms, Data Structures and Computability
bacexam229
No ratings yet
It 404 Data Analytics Semester Viii Assignment - I (Modules I & Ii)
Document3 pages
It 404 Data Analytics Semester Viii Assignment - I (Modules I & Ii)
midhuna
No ratings yet
Matlab-Exercises 2 PDF
Document4 pages
Matlab-Exercises 2 PDF
Ahmed Jamal
No ratings yet
II PUC-StatisticsPracticeQPEng23-24 I - IV
Document16 pages
II PUC-StatisticsPracticeQPEng23-24 I - IV
lahariaullal
No ratings yet
I) Solve Any Two Questions From Each Section. Ii) Assume Suitable Data If Necessary and State Clearly
Document1 page
I) Solve Any Two Questions From Each Section. Ii) Assume Suitable Data If Necessary and State Clearly
Sujay Hv
No ratings yet
IP Program File
Document21 pages
IP Program File
ps2128128
No ratings yet
Cost QB
Document4 pages
Cost QB
ajs3313746
No ratings yet
Math102-Problem Set 3.1-Sem2-Sy2018-2019
Document5 pages
Math102-Problem Set 3.1-Sem2-Sy2018-2019
Girard Immanuel Soriano
No ratings yet
CST 204 Database Management Systems, June 2023
Document4 pages
CST 204 Database Management Systems, June 2023
binnytmz
No ratings yet
Assignment 2018-19 DBMS
Document9 pages
Assignment 2018-19 DBMS
santa
No ratings yet
Unit
Document11 pages
Unit
muneeb.engineer10
No ratings yet
ML - TH - Assignment 2 - 2024-25 - TA1728472836250
Document4 pages
ML - TH - Assignment 2 - 2024-25 - TA1728472836250
skubuntu27
No ratings yet
Priority Questions
Document12 pages
Priority Questions
sainaresh2727
No ratings yet
Computer Science_sr Centum Work Sheet
Document19 pages
Computer Science_sr Centum Work Sheet
som555129
No ratings yet
CE003323 ADS Exam2
Document5 pages
CE003323 ADS Exam2
Ramesh Bishwas
No ratings yet
ECO4016F 2011 Tutorial 7
Document6 pages
ECO4016F 2011 Tutorial 7
Swazzy12
No ratings yet
QP Final XI CS SET A
Document6 pages
QP Final XI CS SET A
Rakesh Soni
No ratings yet
6th Sem Pyq Paper
Document12 pages
6th Sem Pyq Paper
Ajay Kumar
No ratings yet
END SEM LAST YEAR CSE+IT (Except OOPs)
Document10 pages
END SEM LAST YEAR CSE+IT (Except OOPs)
John Cena
No ratings yet
Mba 3 Sem Business Analytics 18mba302e 2020
Document2 pages
Mba 3 Sem Business Analytics 18mba302e 2020
HoD MBA
100% (1)
CSC220 356 133-CSC220
Document5 pages
CSC220 356 133-CSC220
Aniket Ambekar
No ratings yet
Btech Cse 6 Sem Compiler Design Pcs6i102 2019
Document2 pages
Btech Cse 6 Sem Compiler Design Pcs6i102 2019
bipulswain365
No ratings yet
Math Practice Tests For The ACT
From Everand
Math Practice Tests For The ACT
Vibrant Publishers
No ratings yet
100 Puzzles to Learn Data Warehousing
From Everand
100 Puzzles to Learn Data Warehousing
Cristian Scutaru
No ratings yet
Master Fundamental Concepts of Math Olympiad: Maths, #1
From Everand
Master Fundamental Concepts of Math Olympiad: Maths, #1
Subbalakshmi Devaki
No ratings yet
2020-Dec CHD-314 110
Document2 pages
2020-Dec CHD-314 110
Sahil Choudhary
No ratings yet
2020-Dec CHD-413 93
Document1 page
2020-Dec CHD-413 93
Sahil Choudhary
No ratings yet
2020-Dec CHD-412 113
Document2 pages
2020-Dec CHD-412 113
Sahil Choudhary
No ratings yet
2020-Dec ECD-412 50
Document2 pages
2020-Dec ECD-412 50
Sahil Choudhary
No ratings yet
2020-Dec AR-613 5
Document8 pages
2020-Dec AR-613 5
Sahil Choudhary
No ratings yet
2020-Dec CE-306 74
Document3 pages
2020-Dec CE-306 74
Sahil Choudhary
No ratings yet
11 - Chapter 6 PDF
Document14 pages
11 - Chapter 6 PDF
Mansoor Khanali
No ratings yet
3.3.3. LAB PRACTICE - MSF Hacking Windows10 Lab1 v1-1
Document12 pages
3.3.3. LAB PRACTICE - MSF Hacking Windows10 Lab1 v1-1
CONTACTS CONTACTS
No ratings yet
(MPPO) : Managing People & Performance in Organisations
Document19 pages
(MPPO) : Managing People & Performance in Organisations
Gaurav Kumar
No ratings yet
SOLAR THERMAL POWER PLANT Revanth
Document20 pages
SOLAR THERMAL POWER PLANT Revanth
A Akash
No ratings yet
MCQ International Business
Document19 pages
MCQ International Business
Zeeshan Ahmad
100% (10)
EBS Pre Built Content UPK
Document205 pages
EBS Pre Built Content UPK
thiagoom
100% (1)
Well Services QHSE Standard 23 Guideline 09: CT Reel Swivel and Stub Shaft Inspection and Test
Document11 pages
Well Services QHSE Standard 23 Guideline 09: CT Reel Swivel and Stub Shaft Inspection and Test
CiprianHn
No ratings yet
What Is 5S Principle?: Training On Housekeeping
Document56 pages
What Is 5S Principle?: Training On Housekeeping
ALOKE GANGULY
No ratings yet
Final - SHS - OrgMgt Q2 Module 1
Document32 pages
Final - SHS - OrgMgt Q2 Module 1
TOOTSY BOY Ahyong
No ratings yet
Calibration Dryer
Document1 page
Calibration Dryer
A.R.
No ratings yet
Incorporation of Companies PDF
Document16 pages
Incorporation of Companies PDF
vandana gupta
No ratings yet
Leave No Trace - Powerpoint
Document33 pages
Leave No Trace - Powerpoint
api-377875752
No ratings yet
Selection Factors: Loads, Speeds, and Life Requirements. Load Magnitude (And Direction) Affects The Selection of
Document1 page
Selection Factors: Loads, Speeds, and Life Requirements. Load Magnitude (And Direction) Affects The Selection of
deecrankson
No ratings yet
AISC DG31 Example 002
Document12 pages
AISC DG31 Example 002
alejandro mantilla
No ratings yet
Example of Intrinsic Motivation Strategies
Document3 pages
Example of Intrinsic Motivation Strategies
dotdotPindot
No ratings yet
Daftar Riwayat Hidup Prof. Dr. Ketut Buda Artana, S.T., M.Sc.
Document3 pages
Daftar Riwayat Hidup Prof. Dr. Ketut Buda Artana, S.T., M.Sc.
Donny Agustian
No ratings yet
Marine Flushing Oil A
Document2 pages
Marine Flushing Oil A
Jicheng Piao
No ratings yet
22 0252 VIP MenuUpdate QC EN NC
Document9 pages
22 0252 VIP MenuUpdate QC EN NC
AmA Channel
No ratings yet
Automatic Air Suspension System
Document16 pages
Automatic Air Suspension System
Vijay Sai
No ratings yet
Tesla 1
Document68 pages
Tesla 1
Julian Erlandson
No ratings yet
Petitioner
Document29 pages
Petitioner
Rooshna
0% (1)
Common Computer Connector Types
Document36 pages
Common Computer Connector Types
Daisy Mangahas Balerite
No ratings yet
Als Dmea Q3 Cy 2023
Document91 pages
Als Dmea Q3 Cy 2023
Nilo Zolina
No ratings yet
Company List-1
Document7 pages
Company List-1
Yash gupta
No ratings yet
Project Budgets Export - 86
Document3 pages
Project Budgets Export - 86
hardik
No ratings yet
49 - Spinning - Amsler
Document1 page
49 - Spinning - Amsler
Wajih Hashmi
No ratings yet
ITR 2021-22 - Compressed
Document36 pages
ITR 2021-22 - Compressed
joshikrishnakumar19
No ratings yet
Fair Tax
Document1 page
Fair Tax
Merritt Marcella
No ratings yet
TWT 1445 Target Sample
Document216 pages
TWT 1445 Target Sample
Mohammed
No ratings yet
Se 322
Document24 pages
Se 322
mm1251838
No ratings yet