0% found this document useful (0 votes)

13 views12 pages

Data Mining Experiment 4

The document outlines an experiment applying the Apriori algorithm for association rule mining using supermarket data in .arff format. It details the steps to generate frequent itemsets and rules based on specified support, confidence, and lift metrics, along with explanations of key concepts like support, confidence, and lift. The process includes using Weka software to analyze the dataset and visualize results.

Uploaded by

kulsooom456

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views12 pages

Data Mining Experiment 4

Uploaded by

kulsooom456

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

Department of Computer Science and Engineering

Roll No : 160622733182
Name : Tabasum Syed Tajamul

Experiment No. 4: Apply the Apriori Algorithm

Date: 03/03/2025
Aim: Create the following supermarket data in .arff format
4(a) Apply the apriori algorithm with support = 0.2, confidence = 0.5 & generate 5
frequent itemsets and rules
4(b) Apply the apriori algorithm with support = 0.2, lift = 0.5 & generate 5 frequent
patterns and rules

(a) Apply the apriori algorithm with support = 0.2, confidence = 0.5 & generate 5 frequent
itemsets and rules

Description:
Association Rule Mining: Association Rule Mining is a data mining technique used to identify
relationships between items in large datasets. It helps uncover patterns, such as which products
are frequently bought together in a store. Key metrics include support, which measures how
often an itemset appears in transactions, confidence, which indicates the likelihood of one item
appearing when another does, and lift, which evaluates the strength of an association beyond
random chance.
For example, a supermarket may discover that 80% of customers who buy bread also purchase
butter. This insight can help businesses optimize product placement and marketing strategies.
Popular algorithms for association rule mining include Apriori, which generates frequent
itemsets iteratively, and FP-Growth, which builds a tree structure to find patterns more
efficiently.
Market Basket Analysis: Market Basket Analysis is a data mining technique used to identify
patterns in customer purchasing behavior. It helps businesses understand which products are
frequently bought together, enabling better decision-making in sales, marketing, and inventory
management. MBA uses association rule mining to discover relationships between items in
transaction data.
Frequent Item: A frequent item is an item or a set of items that appear together in a dataset with
a frequency above a specified threshold. In association rule mining, frequent items are identified
using the support metric, which measures how often an item or itemset appears in transactions.
An itemset is a collection of one or more items. If the occurrence of an itemset exceeds a
predefined minimum support threshold, it is considered frequent.
Support: The proportion of transactions that contain a particular item or itemset. It helps identify
frequently bought items.
Formula:
𝑆𝑢𝑝𝑝𝑜𝑟𝑡(𝑋
𝑇𝑟𝑎𝑛𝑠𝑎𝑐𝑡𝑖𝑜𝑛𝑠 𝑐𝑜𝑛𝑡𝑎𝑖𝑛𝑖𝑛𝑔 𝑋

) =
𝑇𝑜𝑡𝑎𝑙 𝑇𝑟𝑎𝑛𝑠𝑎𝑐𝑡𝑖𝑜𝑛𝑠

Stanley College of Engineering and Technology for

Women
Department of Computer Science and Engineering
Roll No : 160622733182
Name : Tabasum Syed Tajamul

Confidence: The probability that a customer who buys item X also buys item Y. It measures the
reliability of the association rule.
Formula:
𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒(𝑋→𝑌) =𝑆𝑢𝑝𝑝𝑜𝑟𝑡(
𝑆𝑢𝑝𝑝𝑜𝑟𝑡(𝑋∪𝑌)

𝑋)
Lift: Measures how much more likely two items are bought together compared to random
chance.
Formula:
𝐿𝑖𝑓𝑡(𝑋→𝑌
) =
𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒(𝑋
→𝑌)
𝑆𝑢𝑝𝑝𝑜𝑟𝑡(𝑌)
Algorithm Apriori:

1) Collect the dataset: Gather transactional data where each transaction contains a set of
items.
2) Generate frequent 1-itemsets (L1): Compute support for individual items and discard
those below the minimum support threshold.
3) Generate k-itemsets iteratively:
● Use frequent (k-1)-itemsets (Lk-1) to generate candidate k-itemsets (Ck).
● Prune non-frequent subsets and compute support for Ck.
● Retain itemsets meeting the minimum support threshold, forming Lk.
4) Repeat step 3 until no more frequent itemsets can be generated.
5) Extract association rules from frequent itemsets and evaluate their strength using
confidence, keeping those above the minimum confidence threshold.
Results:
1) Open notepad
2) Enter the dataset as follows:

Figure 1: Notepad - supermarket.arff file

Stanley College of Engineering and Technology for

Women
Department of Computer Science and Engineering
Roll No : 160622733182
Name : Tabasum Syed Tajamul

Table 1: Transaction Dataset

Tid Itemset

T1 {bread, cheese, juice}

T2 {bread, egg, juice, yogurt}

T3 {cheese, yogurt}

T4 {bread, cheese, egg, yogurt}

T5 {egg, juice}

3) Save the file in .arff format (supermarket.arff)

4) Open Weka environment, start Weka Explorer

Figure 2: Weka Environment

Figure 3: Weka Explorer

Stanley College of Engineering and Technology for

Women
Department of Computer Science and Engineering
Roll No : 160622733182
Name : Tabasum Syed Tajamul

5) Open file, choose path - weather.arff

Figure 4: Open supermarket.arff

Figure 5: supermarket.arff

Figure 6: Visualization of all Attributes

Stanley College of Engineering and Technology for

Women
Department of Computer Science and Engineering
Roll No : 160622733182
Name : Tabasum Syed Tajamul

6) Viewing the data, click edit

Figure 7: Data Viewer

7) After loading the file, choose the associate tab in the weka explorer window.
8) Under the associate tab, click on choose and select the apriori algorithm as shown below.

Figure 8: Selecting Apriori Algorithm for Association Rule Mining

Stanley College of Engineering and Technology for

Women
Department of Computer Science and Engineering
Roll No : 160622733182
Name : Tabasum Syed Tajamul

Figure 9: Apriori Association Rule Mining Interface

9) Change the parameters as follows (set metricType as Confidence) and click OK

Figure 10: Weka Apriori Algorithm Configuration Window

Stanley College of Engineering and Technology for

Women
Department of Computer Science and Engineering
Roll No : 160622733182
Name : Tabasum Syed Tajamul

Figure 11: Start the associator

10) The output is represented as shown below

Figure 12: Apriori Algorithm Results

Stanley College of Engineering and Technology for

Women
Department of Computer Science and Engineering
Roll No : 160622733182
Name : Tabasum Syed Tajamul

(b) Apply the apriori algorithm with support = 0.2, lift = 0.5 & generate 5 frequent patterns and
rules
Results:
1) Open notepad
2) Enter the dataset as follows:

Figure 13: Notepad - supermarket.arff file

3) Save the file in .arff format (supermarket.arff)
4) Open Weka environment, start Weka Explorer

Figure 14: Weka Environment

Stanley College of Engineering and Technology for

Women
Department of Computer Science and Engineering
Roll No : 160622733182
Name : Tabasum Syed Tajamul

Figure 15: Weka Explorer

5) Open file, choose path - weather.arff

Figure 16: Open supermarket.arff

Figure 17: supermarket.arff

Stanley College of Engineering and Technology for

Women
Department of Computer Science and Engineering
Roll No : 160622733182
Name : Tabasum Syed Tajamul

Figure 18: Visualization of all Attributes

6) Viewing the data, click edit

Figure 19: Data Viewer

7) After loading the file, choose the associate tab in the weka explorer window.
8) Under the associate tab, click on choose and select the apriori algorithm as shown below.

Figure 20: Selecting Apriori Algorithm for Association Rule Mining

Stanley College of Engineering and Technology for

Women
Department of Computer Science and Engineering
Roll No : 160622733182
Name : Tabasum Syed Tajamul

Figure 21: Apriori Association Rule Mining Interface

9) Change the parameters as follows (set metricType as Lift) and click OK

Figure 22: Weka Apriori Algorithm Configuration Window

Stanley College of Engineering and Technology for

Women
Department of Computer Science and Engineering
Roll No : 160622733182
Name : Tabasum Syed Tajamul

Figure 23: Start the associator

10) The output is represented as shown below

Figure 24: Apriori Algorithm Results

Stanley College of Engineering and Technology for

Women

Module 5.1 - Association Rule Mining, Apriori Algorithm, Data Mining, Support, Confidence, Examples
100% (1)
Module 5.1 - Association Rule Mining, Apriori Algorithm, Data Mining, Support, Confidence, Examples
108 pages
Ethical Hacking Workshop
100% (2)
Ethical Hacking Workshop
71 pages
How To Add New Fields To VAS Field Catalog in S:4
100% (3)
How To Add New Fields To VAS Field Catalog in S:4
13 pages
Association Rule Mining
No ratings yet
Association Rule Mining
24 pages
Tibco LogLogic Version 5.3 Administrator's Guide
No ratings yet
Tibco LogLogic Version 5.3 Administrator's Guide
326 pages
RAC Frequently Asked Questions
100% (1)
RAC Frequently Asked Questions
95 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
19 pages
BADI Example
No ratings yet
BADI Example
12 pages
Mining Association Rules in Large Databases
No ratings yet
Mining Association Rules in Large Databases
77 pages
Data Mining: Concepts and Techniques: Mining Association Rules in Large Databases
No ratings yet
Data Mining: Concepts and Techniques: Mining Association Rules in Large Databases
81 pages
XCP Performance Best Practices
No ratings yet
XCP Performance Best Practices
23 pages
Unit 4
No ratings yet
Unit 4
72 pages
CSA 106 Market Basket Analysis
No ratings yet
CSA 106 Market Basket Analysis
13 pages
Apriori Algorithm Example PDF
No ratings yet
Apriori Algorithm Example PDF
7 pages
CIS Red Hat Enterprise Linux 7 Benchmark v2.2.0
No ratings yet
CIS Red Hat Enterprise Linux 7 Benchmark v2.2.0
386 pages
Association Rule Mining Presentation
No ratings yet
Association Rule Mining Presentation
44 pages
Contents
No ratings yet
Contents
59 pages
Association Rule Mod 3
No ratings yet
Association Rule Mod 3
28 pages
Ariori Introduction and Concept
No ratings yet
Ariori Introduction and Concept
37 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
9 pages
Chapter - 05 - Association Rules
No ratings yet
Chapter - 05 - Association Rules
38 pages
Marketbasket Analysis
No ratings yet
Marketbasket Analysis
28 pages
Association-Analysis
No ratings yet
Association-Analysis
72 pages
I. Review Questions Chapter 4: Mining Frequent Patterns, Associations, Ad Corelations
No ratings yet
I. Review Questions Chapter 4: Mining Frequent Patterns, Associations, Ad Corelations
19 pages
DWDM Unit 4 (R22)
No ratings yet
DWDM Unit 4 (R22)
25 pages
Unit - III
No ratings yet
Unit - III
27 pages
Building An E-Commerce Presence: Web Sites, Mobile Sites, and Apps
No ratings yet
Building An E-Commerce Presence: Web Sites, Mobile Sites, and Apps
48 pages
DWM Exp5 A49
No ratings yet
DWM Exp5 A49
8 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
6 pages
Q) Frequent Itemset Generation: States That If An Itemset Is Frequent, Then All of Its Subsets Must Also Be Frequent. This
No ratings yet
Q) Frequent Itemset Generation: States That If An Itemset Is Frequent, Then All of Its Subsets Must Also Be Frequent. This
9 pages
667a8d24bb947 PPT
No ratings yet
667a8d24bb947 PPT
24 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
5 pages
U2 - Apriori - 5th Sem - DS
No ratings yet
U2 - Apriori - 5th Sem - DS
12 pages
Unit 4 - Data Mining - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Data Mining - WWW - Rgpvnotes.in
10 pages
DWDM Mid Ii
No ratings yet
DWDM Mid Ii
13 pages
Mod 4 Part1 - Merged
No ratings yet
Mod 4 Part1 - Merged
104 pages
Unit - 5 Machine Learning
No ratings yet
Unit - 5 Machine Learning
72 pages
Performance Analysis of Distributed Association Rule Mining With Apriori Algorithm
No ratings yet
Performance Analysis of Distributed Association Rule Mining With Apriori Algorithm
5 pages
Mining Association Rules in Large Databases
No ratings yet
Mining Association Rules in Large Databases
40 pages
Ex 9 DWM Aryant
No ratings yet
Ex 9 DWM Aryant
9 pages
Association Rule Mining
No ratings yet
Association Rule Mining
17 pages
Unit 3 Data Mining
No ratings yet
Unit 3 Data Mining
15 pages
Association Rules Explained
No ratings yet
Association Rules Explained
10 pages
Apriori Algorithm in Data Mining
No ratings yet
Apriori Algorithm in Data Mining
8 pages
Unit3 Data Mining Pattern
No ratings yet
Unit3 Data Mining Pattern
46 pages
Association Rule Mining
No ratings yet
Association Rule Mining
10 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
23 pages
Association Rule Mining:: Dm-Unit-2
No ratings yet
Association Rule Mining:: Dm-Unit-2
16 pages
Da Exp 9
No ratings yet
Da Exp 9
5 pages
Pattern Mining
No ratings yet
Pattern Mining
36 pages
(2025-05-27) - FPM - Lecture 9
No ratings yet
(2025-05-27) - FPM - Lecture 9
35 pages
CIS Controls v8 Mapping To PCI v3.2.1 Final 08-19-2021
100% (1)
CIS Controls v8 Mapping To PCI v3.2.1 Final 08-19-2021
113 pages
Apriori
No ratings yet
Apriori
34 pages
Salesforce.B2C Commerce Developer.v2021!04!18.q40
100% (1)
Salesforce.B2C Commerce Developer.v2021!04!18.q40
15 pages
DM-M4.1-Association v25.4.2
No ratings yet
DM-M4.1-Association v25.4.2
40 pages
Topic 1, 2, 3
No ratings yet
Topic 1, 2, 3
5 pages
Slide-5 (AWS - IAM)
No ratings yet
Slide-5 (AWS - IAM)
28 pages
Association Rule Mining (ARM)
No ratings yet
Association Rule Mining (ARM)
24 pages
Rani 2
No ratings yet
Rani 2
98 pages
Devdm
No ratings yet
Devdm
7 pages
UNIT III
No ratings yet
UNIT III
13 pages
Lecture 2.3.1 2.3.2
No ratings yet
Lecture 2.3.1 2.3.2
23 pages
Data Mining and Predictive Modeling: Lecture 9: Association Rule Mining, Apriori Algorithm
No ratings yet
Data Mining and Predictive Modeling: Lecture 9: Association Rule Mining, Apriori Algorithm
24 pages
Association Rule: Association Rule Learning Is A Popular and Well Researched Method For Discovering
No ratings yet
Association Rule: Association Rule Learning Is A Popular and Well Researched Method For Discovering
10 pages
Mining Frequent Patterns Unit-3
No ratings yet
Mining Frequent Patterns Unit-3
13 pages
Unit IV DWDM
No ratings yet
Unit IV DWDM
17 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
4 pages
DWMExp 7
No ratings yet
DWMExp 7
3 pages
Sksevr Readme
No ratings yet
Sksevr Readme
3 pages
Department of Computer Science and Engineering
No ratings yet
Department of Computer Science and Engineering
41 pages
Netapp Management Solutions Suite For Vmware Vsphere: Datasheet
No ratings yet
Netapp Management Solutions Suite For Vmware Vsphere: Datasheet
4 pages
3 Tier
100% (1)
3 Tier
23 pages
Database Management System (CSE249)
No ratings yet
Database Management System (CSE249)
18 pages
FAST - A Framework For Automating Software Testing A Practical Approach
No ratings yet
FAST - A Framework For Automating Software Testing A Practical Approach
15 pages
76 DWM Experiment No 2
No ratings yet
76 DWM Experiment No 2
7 pages
Forte For Java, Enterprise Edition, 3.0
No ratings yet
Forte For Java, Enterprise Edition, 3.0
124 pages
Tutorial 11 Solutions
No ratings yet
Tutorial 11 Solutions
7 pages
Log
No ratings yet
Log
103 pages
Week 06 - Analysis - Requirements
No ratings yet
Week 06 - Analysis - Requirements
45 pages
Web Application Firewall Detect Block Common Web Application Attacks 33831
No ratings yet
Web Application Firewall Detect Block Common Web Application Attacks 33831
33 pages
SRT Trail
No ratings yet
SRT Trail
3 pages
The Relational Data Model & Relational Database Constraints
No ratings yet
The Relational Data Model & Relational Database Constraints
4 pages
Chapter 4: The Revenue Cycle Notes
No ratings yet
Chapter 4: The Revenue Cycle Notes
13 pages
UNIT No. 1 Introduction To Software and Software Engineering
No ratings yet
UNIT No. 1 Introduction To Software and Software Engineering
2 pages
Term-II Question Paper Ip
No ratings yet
Term-II Question Paper Ip
2 pages
SAPS4 Product
No ratings yet
SAPS4 Product
2 pages
Naukri VivekChaubey (2y 5m)
No ratings yet
Naukri VivekChaubey (2y 5m)
2 pages
Managing the Testing Process: Practical Tools and Techniques for Managing Hardware and Software Testing
From Everand
Managing the Testing Process: Practical Tools and Techniques for Managing Hardware and Software Testing
Rex Black
4/5 (8)
Enterprise Artificial Intelligence Transformation
From Everand
Enterprise Artificial Intelligence Transformation
Rashed Haq
No ratings yet
Data Science with R: Beginner to Expert
From Everand
Data Science with R: Beginner to Expert
Narayana Nemani
No ratings yet

Data Mining Experiment 4

Uploaded by

Data Mining Experiment 4

Uploaded by

Department of Computer Science and Engineering

Experiment No. 4: Apply the Apriori Algorithm

Stanley College of Engineering and Technology for

Figure 1: Notepad - supermarket.arff file

Stanley College of Engineering and Technology for

Table 1: Transaction Dataset

T1 {bread, cheese, juice}

T2 {bread, egg, juice, yogurt}

T4 {bread, cheese, egg, yogurt}

3) Save the file in .arff format (supermarket.arff)

Figure 2: Weka Environment

Figure 3: Weka Explorer

Stanley College of Engineering and Technology for

5) Open file, choose path - weather.arff

Figure 4: Open supermarket.arff

Figure 6: Visualization of all Attributes

Stanley College of Engineering and Technology for

6) Viewing the data, click edit

Figure 7: Data Viewer

Figure 8: Selecting Apriori Algorithm for Association Rule Mining

Stanley College of Engineering and Technology for

Figure 9: Apriori Association Rule Mining Interface

Figure 10: Weka Apriori Algorithm Configuration Window

Stanley College of Engineering and Technology for

Figure 11: Start the associator

Figure 12: Apriori Algorithm Results

Stanley College of Engineering and Technology for

Figure 13: Notepad - supermarket.arff file

Figure 14: Weka Environment

Stanley College of Engineering and Technology for

Figure 15: Weka Explorer

Figure 16: Open supermarket.arff

Figure 17: supermarket.arff

Stanley College of Engineering and Technology for

Figure 18: Visualization of all Attributes

Figure 19: Data Viewer

Figure 20: Selecting Apriori Algorithm for Association Rule Mining

Stanley College of Engineering and Technology for

Figure 21: Apriori Association Rule Mining Interface

Figure 22: Weka Apriori Algorithm Configuration Window

Stanley College of Engineering and Technology for

Figure 23: Start the associator

Figure 24: Apriori Algorithm Results

Stanley College of Engineering and Technology for

You might also like