FP-Growth Algorithm

FP-Growth is an algorithm for frequent itemset mining that does not require candidate generation like the Apriori algorithm. It uses a compact data structure called an FP-tree to store transaction data. It operates in two steps: [1] building the FP-tree from the dataset in two scans, and [2] extracting frequent itemsets directly from the FP-tree using a divide-and-conquer approach extracting prefix path subtrees. It provides performance advantages over Apriori by avoiding candidate generation but may require more memory due to building the FP-tree structure.

Uploaded by

Sunitha Chetan R S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

257 views16 pages

FP-Growth Algorithm

Uploaded by

Sunitha Chetan R S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 16

FP-Growth algorithm

Introduction
 Apriori: uses a generate-and-test approach – generates
candidate itemsets and tests if they are frequent
– Generation of candidate itemsets is expensive(in both
space and time)
– Support counting is expensive
• Subset checking (computationally expensive)
• Multiple Database scans (I/O)
 FP-Growth: allows frequent itemset discovery without
candidate itemset generation. Two step approach:
– Step 1: Build a compact data structure called the FP-tree
• Built using 2 passes over the data-set.
– Step 2: Extracts frequent itemsets directly from the FP-
tree
Step 1: FP-Tree Construction
 FP-Tree is constructed using 2 passes over the
data-set:
Pass 1:
– Scan data and find support for each item.
– Discard infrequent items.
– Sort frequent items in decreasing order based on
their support.
Use this order when building the FP-Tree, so
common prefixes can be shared.
Step 1: FP-Tree Construction
Pass 2:
Nodes correspond to items and have a counter
1. FP-Growth reads 1 transaction at a time and maps it to a
path
2. Fixed order is used, so paths can overlap when transactions
share items (when they have the same prfix ).
– In this case, counters are incremented
3. Pointers are maintained between nodes containing the
same item, creating singly linked lists (dotted lines)
– The more paths that overlap, the higher the compression. FP-
tree may fit in memory.
4. Frequent itemsets extracted from the FP-Tree.
Step 1: FP-Tree Construction
(Example)
FP-Tree size
 The FP-Tree usually has a smaller size than the uncompressed data -
typically many transactions share items (and hence prefixes).
– Best case scenario: all transactions contain the same set of items.
• 1 path in the FP-tree
– Worst case scenario: every transaction has a unique set of items (no
items in common)
• Size of the FP-tree is at least as large as the original data.
• Storage requirements for the FP-tree are higher - need to store the pointers
between the nodes and the counters.

 The size of the FP-tree depends on how the items are ordered
 Ordering by decreasing support is typically used but it does not
always lead to the smallest tree (it's a heuristic).
Step 2: Frequent Itemset Generation
 FP-Growth extracts frequent itemsets from
the FP-tree.
 Bottom-up algorithm - from the leaves
towards the root
 Divide and conquer: first look for frequent
itemsets ending in e, then de, etc. . . then d,
then cd, etc. . .
 First, extract prefix path sub-trees ending in
an item(set).
Prefix path sub-trees (Example)
Step 2: Frequent Itemset Generation
 Each prefix path sub-tree is
processed recursively to extract the
frequent itemsets. Solutions are
then merged.
– E.g. the prefix path sub-tree for e will
be used to extract frequent itemsets
ending in e, then in de, ce, be and ae,
then in cde, bde, ade, etc.
– Divide and conquer approach
Conditional FP-Tree
 The FP-Tree that would be built if we only
consider transactions containing a particular
itemset (and then removing that itemset from
all transactions).
 I Example: FP-Tree conditional on e.
Example
Let minSup = 2 and extract all frequent itemsets
containing e.
 1. Obtain the prefix path sub-tree for e:
Example
 2. Check if e is a frequent item by adding the
counts along the linked list (dotted line). If so,
extract it.
– Yes, count =3 so {e} is extracted as a frequent
itemset.
 3. As e is frequent, find frequent itemsets
ending in e. i.e. de, ce, be and ae.
Example
 4. Use the the conditional FP-tree for e to find
frequent itemsets ending in de, ce and ae
– Note that be is not considered as b is not in the
conditional FP-tree for e.
• I For each of them (e.g. de), find the prefix
paths from the conditional tree for e, extract
frequent itemsets, generate conditional FP-
tree, etc... (recursive)
Example
• Example: e -> de -> ade ({d,e}, {a,d,e} are
found to be frequent)

•Example: e -> ce ({c,e} is found to be frequent)

Result
Frequent itemsets found (ordered by sufix and
order in which they are found):
Discusion
 Advantages of FP-Growth
– only 2 passes over data-set
– “compresses” data-set
– no candidate generation
– much faster than Apriori
 Disadvantages of FP-Growth
– FP-Tree may not fit in memory!!
– FP-Tree is expensive to build

Mining Frequent Patterns Without Candidate Generation
No ratings yet
Mining Frequent Patterns Without Candidate Generation
44 pages
Module 4.2 Association Rule Mining
No ratings yet
Module 4.2 Association Rule Mining
88 pages
15-Fp-Tree Problem-10-09-2024
No ratings yet
15-Fp-Tree Problem-10-09-2024
2 pages
Unit4 2 Association Rules FP Growth
No ratings yet
Unit4 2 Association Rules FP Growth
33 pages
18-FP-Growth algorithm-12-02-2025
No ratings yet
18-FP-Growth algorithm-12-02-2025
24 pages
FP Tree
No ratings yet
FP Tree
42 pages
Lecture_13_14_FP
No ratings yet
Lecture_13_14_FP
41 pages
Lecture 5 - FP-Growth Algorithm
No ratings yet
Lecture 5 - FP-Growth Algorithm
26 pages
FPTree-09
No ratings yet
FPTree-09
45 pages
DM Unit2_1 Association Mining 19I504
No ratings yet
DM Unit2_1 Association Mining 19I504
86 pages
Lecture 2.3.3 2.3.4
No ratings yet
Lecture 2.3.3 2.3.4
29 pages
FP Growth
No ratings yet
FP Growth
30 pages
fp-tree
No ratings yet
fp-tree
37 pages
FP Tree
No ratings yet
FP Tree
54 pages
fpgrowth
No ratings yet
fpgrowth
11 pages
An Improved Frequent Pattern Tree the Child Struct
No ratings yet
An Improved Frequent Pattern Tree the Child Struct
19 pages
Chapter 5
No ratings yet
Chapter 5
26 pages
4.1) FP Growth Algorithm
No ratings yet
4.1) FP Growth Algorithm
26 pages
Tutorial 02
No ratings yet
Tutorial 02
17 pages
Lecture 6
No ratings yet
Lecture 6
18 pages
Mining Frequent Patterns Without Candidate Generation
No ratings yet
Mining Frequent Patterns Without Candidate Generation
44 pages
Untitled Document (3)
No ratings yet
Untitled Document (3)
5 pages
Association Rule Mining: FP Growth
No ratings yet
Association Rule Mining: FP Growth
22 pages
FP GROWTH ALG
No ratings yet
FP GROWTH ALG
17 pages
Machine Learning Based FP Growth Algorithm
No ratings yet
Machine Learning Based FP Growth Algorithm
8 pages
FP Growth (Tree)
No ratings yet
FP Growth (Tree)
24 pages
FP Growth Algorithm
No ratings yet
FP Growth Algorithm
17 pages
FP Growth
No ratings yet
FP Growth
21 pages
fp-growth
No ratings yet
fp-growth
16 pages
Shihab Rahman Dolon Chanpa Department of Computer Science and Engineering, University of Dhaka
No ratings yet
Shihab Rahman Dolon Chanpa Department of Computer Science and Engineering, University of Dhaka
23 pages
Tan FP Growth
No ratings yet
Tan FP Growth
8 pages
DWM EXP10_96
No ratings yet
DWM EXP10_96
11 pages
ml4
No ratings yet
ml4
13 pages
From Introduction To Data Mining: Data Mining Association Analysis: Basic Concepts and Algorithms
No ratings yet
From Introduction To Data Mining: Data Mining Association Analysis: Basic Concepts and Algorithms
37 pages
What Is Frequent Pattern Analysis?
No ratings yet
What Is Frequent Pattern Analysis?
37 pages
Data Mining - : Dr. Mahmoud Mounir Mahmoud - Mounir@cis - Asu.edu - Eg
No ratings yet
Data Mining - : Dr. Mahmoud Mounir Mahmoud - Mounir@cis - Asu.edu - Eg
23 pages
ESE Handouts 4 - FP Growth Algorithm (Fall 2016)
No ratings yet
ESE Handouts 4 - FP Growth Algorithm (Fall 2016)
13 pages
How To Find Frequent Patterns?: Wim Pijls Walter A. Kosters
No ratings yet
How To Find Frequent Patterns?: Wim Pijls Walter A. Kosters
8 pages
FP-Growth Algorithm (1)
No ratings yet
FP-Growth Algorithm (1)
5 pages
FP-Growth Algorithm
No ratings yet
FP-Growth Algorithm
23 pages
Guide: Mr. Gautam Borkar: Group Members: Rahul Kelaskar A - 636 Anish Khale A - 638 Dhaval Doshi A - 682
No ratings yet
Guide: Mr. Gautam Borkar: Group Members: Rahul Kelaskar A - 636 Anish Khale A - 638 Dhaval Doshi A - 682
22 pages
Data Mining Unit 2 (Part 2)-1
No ratings yet
Data Mining Unit 2 (Part 2)-1
7 pages
U3 - FP Trees - 5th Sem - DS
No ratings yet
U3 - FP Trees - 5th Sem - DS
9 pages
03 Pre Processing
No ratings yet
03 Pre Processing
20 pages
Fp-Tree Growth Algorithm
No ratings yet
Fp-Tree Growth Algorithm
11 pages
FP-Tree Growth Algorithm
No ratings yet
FP-Tree Growth Algorithm
15 pages
A Frequent Pattern Mining Algorithm Based On Fp-Tree Structure Andapriori Algorithm
No ratings yet
A Frequent Pattern Mining Algorithm Based On Fp-Tree Structure Andapriori Algorithm
3 pages
An Implementation of The FP-growth Algorithm: Christian Borgelt
No ratings yet
An Implementation of The FP-growth Algorithm: Christian Borgelt
5 pages
Q) FP Growth Algorithm?: This Algorithm Works As Follows
No ratings yet
Q) FP Growth Algorithm?: This Algorithm Works As Follows
3 pages
AzqaSaleemKhan (SP22 RCS 003) FPGrowth
No ratings yet
AzqaSaleemKhan (SP22 RCS 003) FPGrowth
19 pages
Mining Frequent Patterns Without Candidate Generation
No ratings yet
Mining Frequent Patterns Without Candidate Generation
12 pages
FPgrowth
No ratings yet
FPgrowth
2 pages
FP Tree Growth: Frequent Pattern Growth Algorithm
100% (1)
FP Tree Growth: Frequent Pattern Growth Algorithm
2 pages
FP Growth Presentation v1 (Handout)
No ratings yet
FP Growth Presentation v1 (Handout)
10 pages
Splunk 6.1 SearchReference
No ratings yet
Splunk 6.1 SearchReference
454 pages
Frequent Closed Pattern Mining Algorithm Based On COFI-Tree
No ratings yet
Frequent Closed Pattern Mining Algorithm Based On COFI-Tree
2 pages
Improv Me Net
No ratings yet
Improv Me Net
7 pages
Stream Processing Lab Manual
No ratings yet
Stream Processing Lab Manual
9 pages
Lecture 5 - Monday, September 3, 2007: 2.1 Example From Paper
No ratings yet
Lecture 5 - Monday, September 3, 2007: 2.1 Example From Paper
6 pages
F P-Tree F P-Growth
No ratings yet
F P-Tree F P-Growth
7 pages
ACE501 All Questions
No ratings yet
ACE501 All Questions
40 pages
Spatial Clustering Algorithm Using R-Tree
100% (2)
Spatial Clustering Algorithm Using R-Tree
6 pages
NetWorker 8.0 Avamar Integration Guide
No ratings yet
NetWorker 8.0 Avamar Integration Guide
56 pages
Data Migration To SAP S4 HANA
No ratings yet
Data Migration To SAP S4 HANA
6 pages
Siebel Oracle Database Monitoring
100% (2)
Siebel Oracle Database Monitoring
36 pages
ITDSA2-12 Week 2 2
No ratings yet
ITDSA2-12 Week 2 2
55 pages
Join Index and Hash Index in Teradata
No ratings yet
Join Index and Hash Index in Teradata
20 pages
BD_Unit4_Summary_efde2208-1937-44c2-9c1d-e0d171eb6120
No ratings yet
BD_Unit4_Summary_efde2208-1937-44c2-9c1d-e0d171eb6120
6 pages
Big Data in Business (2)
No ratings yet
Big Data in Business (2)
13 pages
What Are Recommendation Engines ?: Item - Feature Matrix
No ratings yet
What Are Recommendation Engines ?: Item - Feature Matrix
6 pages
Concepts of Database Management Eighth Edition
No ratings yet
Concepts of Database Management Eighth Edition
47 pages
DocumentDB Data Migration Tool v1.7
No ratings yet
DocumentDB Data Migration Tool v1.7
35 pages
Questions: (April 18)
No ratings yet
Questions: (April 18)
15 pages
ROBUSTNESS
No ratings yet
ROBUSTNESS
6 pages
Atul Kumar Resume
No ratings yet
Atul Kumar Resume
1 page
Data Mining: Sunitha R S Dept of ISE, RIT
No ratings yet
Data Mining: Sunitha R S Dept of ISE, RIT
12 pages
Sai Sreelatha
No ratings yet
Sai Sreelatha
3 pages
6 Days National Level Hands-On E-Workshop On Basics of C Programming and Python For Beginners
No ratings yet
6 Days National Level Hands-On E-Workshop On Basics of C Programming and Python For Beginners
1 page
Data Mining: Sunitha R S Asst Prof Dept of ISE, RIT
No ratings yet
Data Mining: Sunitha R S Asst Prof Dept of ISE, RIT
9 pages
A Verifable Semantic Searching Scheme by Optimal
No ratings yet
A Verifable Semantic Searching Scheme by Optimal
5 pages
Subdomain Enumeration Cheat Sheet: @yamakira
No ratings yet
Subdomain Enumeration Cheat Sheet: @yamakira
1 page
What Is A Data Warehouse?: Data Warehouse Architecture From Data Warehousing To Data Mining
No ratings yet
What Is A Data Warehouse?: Data Warehouse Architecture From Data Warehousing To Data Mining
27 pages
Mapping BEx Query Elements To The SAP BusinessObjects BI 4 Query Panel PDF
No ratings yet
Mapping BEx Query Elements To The SAP BusinessObjects BI 4 Query Panel PDF
14 pages
Recommendation Systems
No ratings yet
Recommendation Systems
6 pages
XIInfo Pract S E 380
No ratings yet
XIInfo Pract S E 380
6 pages
Assignment 2 s1 2017
No ratings yet
Assignment 2 s1 2017
5 pages
Warehouse Design For Airline Reservation
No ratings yet
Warehouse Design For Airline Reservation
3 pages
Ramaiah Institute of Technology MSR NAGAR, M.S.R.I.T.POST, BANGALORE - 560054, Karnataka, India
No ratings yet
Ramaiah Institute of Technology MSR NAGAR, M.S.R.I.T.POST, BANGALORE - 560054, Karnataka, India
2 pages
Ramaiah Institute of Technology
No ratings yet
Ramaiah Institute of Technology
6 pages
Data Analytics Interview QnAs
No ratings yet
Data Analytics Interview QnAs
21 pages
Cdb101 Assignment
No ratings yet
Cdb101 Assignment
10 pages
Disk Cloning in Solaris
No ratings yet
Disk Cloning in Solaris
3 pages
ADF Course Content
No ratings yet
ADF Course Content
11 pages
M S Ramaiah Institute of Technology Department of Information Science & Engg
No ratings yet
M S Ramaiah Institute of Technology Department of Information Science & Engg
11 pages
Synopsis of T24 Java Documentations
No ratings yet
Synopsis of T24 Java Documentations
1 page
Questions From Chapter 6
No ratings yet
Questions From Chapter 6
4 pages
Mastering Python
From Everand
Mastering Python
Rick van Hattem
No ratings yet
Search Tree: Fundamentals and Applications
From Everand
Search Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet

FP-Growth Algorithm

Uploaded by

FP-Growth Algorithm

Uploaded by

FP-Growth algorithm

•Example: e -> ce ({c,e} is found to be frequent)

You might also like