Introduction to Data Mining Assignment 2

This document is an assignment on data mining that includes questions about frequent itemset mining using Apriori and FP-growth algorithms, as well as implementation tasks for these algorithms in programming languages like C++ or Java. It also explores association rules, correlation relationships, and various measures of confidence in the context of supermarket transaction data. The assignment requires analysis of algorithm performance and correlation relationships based on given data sets.

Uploaded by

Ayesha Rahim

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views

Introduction to Data Mining Assignment 2

Uploaded by

Ayesha Rahim

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Introduction to Data Mining

Assignment #2

Q#1: A database has five transactions. Let min-sup=60% and min-conf=80%

TID Items-bought
T100 {M, O, N, K, E, Y}
T200 {D, O, N, K, E, Y}
T300 {M, A, K, E}
T400 {M, U, C, K, Y}
T500 {C, O, O, K, I, E}
Find all frequent itemsets using Apriori and FP-growth, respectively. Compare the efficiency of
the two mining processes.
List all the strong association rules (with support s and confidence c) matching the following
metarule, where X is a variable representing customers, and item I denotes variables representing
items(e.g, “A,” “B”);)
∀x ∈ transaction, buys(X,item1) ∧ buys(X,item2) ⇒ buys(X,item3) [s,c]
Q#2: (Implementation project) Using a programming language that you are familiar with, such
as C++ or Java, implement three frequent itemset mining algorithms introduced in this chapter:
(1) Apriori [AS94b], (2) FP-growth [HPY00], and (3) Eclat [Zak00] (mining using the
vertical data format). Compare the performance of each algorithm with various kinds of large
data sets. Write a report to analyze the situations (e.g., data size, data distribution, minimal
support threshold setting, and pattern density) where one algorithm may perform better than the
others, and state why?
Q#3: Give a short example to show that items in a strong association rule actually may be
negatively correlated.
Q#4: The following contingency table summarizes supermarket transaction data, where hot dogs
refers to the transactions containing hot dogs, hot dogs refers to the transactions that do not
contain hot dogs, hamburgers refers to the transactions containing hamburgers, and hamburgers
refers to the transactions that do not contain hamburgers.

(a) Suppose that the association rule “hot dogs ⇒ hamburgers” is mined. Given a minimum
support threshold of 25% and a minimum confidence threshold of 50%, is this association rule
strong?
(b) Based on the given data, is the purchase of hot dogs independent of the purchase of
hamburgers? If not, what kind of correlation relationship exists between the two?
(c) Compare the use of the all confidence, max confidence, Kulczynski, and cosine measures
with lift and correlation on the given data.

FLU GSN 2021R1 EN WS06 Duct Vanes
No ratings yet
FLU GSN 2021R1 EN WS06 Duct Vanes
48 pages
Module 5 - Frequent Pattern Mining
No ratings yet
Module 5 - Frequent Pattern Mining
111 pages
Unit 2 Question and Answers Bdhdns
No ratings yet
Unit 2 Question and Answers Bdhdns
15 pages
Assignment 03
No ratings yet
Assignment 03
9 pages
Suppose A Student Collected The Price and Weight of 20 Products in A Shop With The Following Result
No ratings yet
Suppose A Student Collected The Price and Weight of 20 Products in A Shop With The Following Result
4 pages
I. Review Questions Chapter 4: Mining Frequent Patterns, Associations, Ad Corelations
No ratings yet
I. Review Questions Chapter 4: Mining Frequent Patterns, Associations, Ad Corelations
19 pages
Data Mining Practice Final Sol
No ratings yet
Data Mining Practice Final Sol
5 pages
DMDW Unit 4 Association 29.12.2020
No ratings yet
DMDW Unit 4 Association 29.12.2020
31 pages
DW Model Questions
No ratings yet
DW Model Questions
8 pages
ML Unit - Iii
No ratings yet
ML Unit - Iii
64 pages
Thabet Slimani - Efficiant Analysis of Pattern and Association Rule Mining Approaches
No ratings yet
Thabet Slimani - Efficiant Analysis of Pattern and Association Rule Mining Approaches
14 pages
Assignment (Association Rule Mining)
No ratings yet
Assignment (Association Rule Mining)
2 pages
Data Mining Unit 2 1
No ratings yet
Data Mining Unit 2 1
15 pages
Mining Frequent Patterns, Association and Correlations - Basic Concepts and Methods
No ratings yet
Mining Frequent Patterns, Association and Correlations - Basic Concepts and Methods
55 pages
Data Cube Computation and Data Generation
No ratings yet
Data Cube Computation and Data Generation
54 pages
CS 515 Data Warehousing and Data Mining
No ratings yet
CS 515 Data Warehousing and Data Mining
5 pages
DMDW Qa-3.2
No ratings yet
DMDW Qa-3.2
11 pages
Assignment 2
No ratings yet
Assignment 2
13 pages
Mining Frequent Itemset-Association Analysis
No ratings yet
Mining Frequent Itemset-Association Analysis
59 pages
Unit II
No ratings yet
Unit II
22 pages
GTU-COMPUTER-3160714-SUMMER-2023
No ratings yet
GTU-COMPUTER-3160714-SUMMER-2023
3 pages
p139 Data Mining Mafia
No ratings yet
p139 Data Mining Mafia
13 pages
Unit-4_Part-1
No ratings yet
Unit-4_Part-1
152 pages
Association Rule-A Tool For Data Mining: Praveen Ranjan Srivastava
No ratings yet
Association Rule-A Tool For Data Mining: Praveen Ranjan Srivastava
6 pages
DMDW_Association Analysis
No ratings yet
DMDW_Association Analysis
12 pages
Association Rule Mining
No ratings yet
Association Rule Mining
10 pages
Comparison of Two Association Rule Mining Algorith PDF
No ratings yet
Comparison of Two Association Rule Mining Algorith PDF
9 pages
Question Bank: Q1) What Is Data Warehouse?
No ratings yet
Question Bank: Q1) What Is Data Warehouse?
17 pages
association rule mapping -unit-4
No ratings yet
association rule mapping -unit-4
11 pages
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
No ratings yet
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
31 pages
Vi Sem Bca Qbank - Wcms - Fds
0% (1)
Vi Sem Bca Qbank - Wcms - Fds
11 pages
Closet - An Efficient Algorithm For Mining Frequent
No ratings yet
Closet - An Efficient Algorithm For Mining Frequent
8 pages
DWDM Unit 2 and 3
No ratings yet
DWDM Unit 2 and 3
31 pages
06 FPBasic
No ratings yet
06 FPBasic
69 pages
DM_U_2
No ratings yet
DM_U_2
16 pages
Unit_3 Mining Frequent Patterns
No ratings yet
Unit_3 Mining Frequent Patterns
10 pages
Data Mining UNIT 3 LECTURE NOTES
No ratings yet
Data Mining UNIT 3 LECTURE NOTES
13 pages
Understanding Association Rule in Data Mining
No ratings yet
Understanding Association Rule in Data Mining
4 pages
s13042-013-0172-6
No ratings yet
s13042-013-0172-6
11 pages
Unit-5 DWDM
No ratings yet
Unit-5 DWDM
7 pages
Mining Frequent Patterns and Associations
No ratings yet
Mining Frequent Patterns and Associations
52 pages
DWDM - Unit - IV
No ratings yet
DWDM - Unit - IV
67 pages
Module5 DMW
No ratings yet
Module5 DMW
13 pages
Unit 3 1
No ratings yet
Unit 3 1
34 pages
Comparative Evaluation of Association Rule Mining Algorithms With Frequent Item Sets
No ratings yet
Comparative Evaluation of Association Rule Mining Algorithms With Frequent Item Sets
7 pages
DM Unit - 2
No ratings yet
DM Unit - 2
14 pages
Data Mining-Knowledge Presentation 2: Prof. Sin-Min Lee
No ratings yet
Data Mining-Knowledge Presentation 2: Prof. Sin-Min Lee
54 pages
03. UNIT-III(DMWH6EM)
No ratings yet
03. UNIT-III(DMWH6EM)
24 pages
Association Rule Mining Using Apriori Al PDF
No ratings yet
Association Rule Mining Using Apriori Al PDF
11 pages
Mining Frequent Patterns, Associations and Correlations: Basic Concepts and Methods
No ratings yet
Mining Frequent Patterns, Associations and Correlations: Basic Concepts and Methods
20 pages
Data Analytics Unit 4
No ratings yet
Data Analytics Unit 4
22 pages
Association Rule Mining
No ratings yet
Association Rule Mining
72 pages
DWDM-UNIT-4
No ratings yet
DWDM-UNIT-4
12 pages
CH - 5
No ratings yet
CH - 5
43 pages
Online Course Assignments
No ratings yet
Online Course Assignments
8 pages
3final CH 5 Concept
No ratings yet
3final CH 5 Concept
101 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet
Co-Clustering: Models, Algorithms and Applications
From Everand
Co-Clustering: Models, Algorithms and Applications
Gérard Govaert
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Mathematical Formulas for Economics and Business: A Simple Introduction
From Everand
Mathematical Formulas for Economics and Business: A Simple Introduction
K.H. Erickson
4/5 (4)
DOC-20221116-WA0070.
No ratings yet
DOC-20221116-WA0070.
3 pages
ch6 Cpu Scheduling
No ratings yet
ch6 Cpu Scheduling
29 pages
Lecture # 1 - Android Introduction
No ratings yet
Lecture # 1 - Android Introduction
31 pages
CH 4
No ratings yet
CH 4
14 pages
CH 2 Complete
No ratings yet
CH 2 Complete
39 pages
ch3 Part1
No ratings yet
ch3 Part1
26 pages
Web Application Assignment No 3
No ratings yet
Web Application Assignment No 3
4 pages
CH 1
No ratings yet
CH 1
52 pages
National Income
No ratings yet
National Income
6 pages
Computer Graphics: Week 4: Presentation By: Ms. Ifrah Mansoor
No ratings yet
Computer Graphics: Week 4: Presentation By: Ms. Ifrah Mansoor
48 pages
Monopoly and Perfect Competition
No ratings yet
Monopoly and Perfect Competition
5 pages
CGPresentation Week2 (API, GPU&OpenGLInstallation)
No ratings yet
CGPresentation Week2 (API, GPU&OpenGLInstallation)
55 pages
Scanned With Camscanner
No ratings yet
Scanned With Camscanner
14 pages
Inter Process Communication
No ratings yet
Inter Process Communication
25 pages
CGPresentation Week1 (IntroductiontoCGandBasicConcepts)
No ratings yet
CGPresentation Week1 (IntroductiontoCGandBasicConcepts)
41 pages
Single Mode and Multi-Mode Optical Fibers Optiwave
No ratings yet
Single Mode and Multi-Mode Optical Fibers Optiwave
15 pages
Active Forums Admin Guide
No ratings yet
Active Forums Admin Guide
16 pages
3ds Max Design Essentials
No ratings yet
3ds Max Design Essentials
4 pages
MS28937
No ratings yet
MS28937
1 page
Lecture #2 - Data Warehouse Architecture
No ratings yet
Lecture #2 - Data Warehouse Architecture
6 pages
Earn 200$-300$ Per Month With ShareCash
No ratings yet
Earn 200$-300$ Per Month With ShareCash
8 pages
Tarak Nath Das CV
No ratings yet
Tarak Nath Das CV
2 pages
TTDS Lecture 2
No ratings yet
TTDS Lecture 2
40 pages
Rev F
No ratings yet
Rev F
27 pages
3 Steps To Time Series Forecasting LSTM With TensorFlow KerasA Practical Example in Python With Usefu
No ratings yet
3 Steps To Time Series Forecasting LSTM With TensorFlow KerasA Practical Example in Python With Usefu
15 pages
PIXEL CP Operators Manual
No ratings yet
PIXEL CP Operators Manual
81 pages
Right To Work Right Check Manager Guide
No ratings yet
Right To Work Right Check Manager Guide
23 pages
Jeppesen Mobile FliteDeck FAQ
No ratings yet
Jeppesen Mobile FliteDeck FAQ
5 pages
CBSE Practicals Manual
No ratings yet
CBSE Practicals Manual
24 pages
Brugermanual SACKit Speak 200
No ratings yet
Brugermanual SACKit Speak 200
24 pages
ResearchReport 60070701701 PDF
No ratings yet
ResearchReport 60070701701 PDF
7 pages
Expected Simplification Questions PDF For Bank PO/ Clerk Prelims Exam
100% (1)
Expected Simplification Questions PDF For Bank PO/ Clerk Prelims Exam
18 pages
Introduction To Innovation Management: By: Prof. Leah B. Bitao
No ratings yet
Introduction To Innovation Management: By: Prof. Leah B. Bitao
24 pages
Unit-2-Converter and HVDC System Control
No ratings yet
Unit-2-Converter and HVDC System Control
13 pages
ABW2011IQM15
No ratings yet
ABW2011IQM15
21 pages
ZMS
No ratings yet
ZMS
2 pages
IMCAM261 DP Station Keeping Events Summary 2022 nj3zf0
No ratings yet
IMCAM261 DP Station Keeping Events Summary 2022 nj3zf0
17 pages
FAQs - Applanix POSPac Term License
No ratings yet
FAQs - Applanix POSPac Term License
2 pages
Intranets and Extranets
No ratings yet
Intranets and Extranets
12 pages
Final Learning Journal
No ratings yet
Final Learning Journal
31 pages
23 sep delhi to patna
No ratings yet
23 sep delhi to patna
3 pages
Bits Pilani Course Handout Ael
No ratings yet
Bits Pilani Course Handout Ael
6 pages
Latex Beamer Theme Following Aiming To Incorporate The Best of L Tex
No ratings yet
Latex Beamer Theme Following Aiming To Incorporate The Best of L Tex
5 pages
Web Information Gathering
No ratings yet
Web Information Gathering
5 pages

Introduction to Data Mining Assignment 2

Uploaded by

Introduction to Data Mining Assignment 2

Uploaded by

Introduction to Data Mining

Q#1: A database has five transactions. Let min-sup=60% and min-conf=80%

You might also like