0% found this document useful (0 votes)

107 views26 pages

4.1) FP Growth Algorithm

The FP-Growth algorithm uses an FP-tree to efficiently mine frequent itemsets from transactional databases. It involves two steps: (1) building an FP-tree from the database by scanning it twice, and (2) mining the FP-tree by identifying conditional patterns and constructing conditional FP-trees.

Uploaded by

Sanjana Sairama

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

107 views26 pages

4.1) FP Growth Algorithm

Uploaded by

Sanjana Sairama

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Association Analysis (3)

FP-Tree/FP-Growth Algorithm
• Use a compressed representation of the database using an
FP-tree
• Once an FP-tree has been constructed, it uses a recursive
divide-and-conquer approach to mine the frequent itemsets.

Building the FP-Tree

1. Scan data to determine the support count of each item.
Infrequent items are discarded, while the frequent items are
sorted in decreasing support counts.
2. Make a second pass over the data to construct the FPtree.
As the transactions are read, before being processed, their items
are sorted according to the above order.
First scan – determine frequent 1-
itemsets, then build header
TID Items B 8
1 {A,B} A 7
2 {B,C,D}
C 7
3 {A,C,D,E}
D 5
4 {A,D,E}
E 3
5 {A,B,C}
6 {A,B,C,D}
7 {B,C}
8 {A,B,C}
9 {A,B,D}
10 {B,C,E}
FP-tree construction
null
After reading TID=1:

B:1
TID Items
1 {A,B}
2 {B,C,D} A:1
3 {A,C,D,E}
4 {A,D,E} After reading TID=2:
5 {A,B,C} null
6 {A,B,C,D}
B:2
7 {B,C}
8 {A,B,C}
C:1
9 {A,B,D} A:1
10 {B,C,E}
D:1
FP-Tree Construction
TID Items
Transaction
1 {A,B} null
2 {B,C,D} Database
3 {A,C,D,E}
4 {A,D,E}
B:8 A:2
5 {A,B,C}
6 {A,B,C,D} A:5 C:3 C:1 D:1
7 {B,C}
8 {A,B,C}
9 {A,B,D} C:3 D:1 D:1 E:1 D:1 E:1
10 {B,C,E}

Header table D:1 E:1

Item Pointer
B 8
A 7
Chain pointers help in quickly finding all the paths
C 7
of the tree containing some given item.
D 5
E 3
FP-Tree size
• The size of an FPtree is typically smaller than the size of the uncompressed
data because many transactions often share a few items in common.
• Bestcase scenario:
– All transactions have the same set of items, and the FPtree contains only a
single branch of nodes.
• Worstcase scenario:
– Every transaction has a unique set of items.
– As none of the transactions have any items in common, the size of the FP-
tree is effectively the same as the size of the original data.

• The size of an FPtree also depends on how the items are ordered.
– If the ordering scheme in the preceding example is reversed,
• i.e., from lowest to highest support item, the resulting FPtree probably is
denser (shown in next slide).
• Not always though…ordering is just a heuristic.
An FPtree representation for the data set with a different item ordering scheme.
FP-Growth (I)
• FPgrowth generates frequent itemsets from an FPtree by
exploring the tree in a bottomup fashion.

• Given the example tree, the algorithm looks for frequent

itemsets ending in E first, followed by D, C, A, and finally, B.

• Since every transaction is mapped onto a path in the FPtree, we

can derive the frequent itemsets ending with a particular item,
say, E, by examining only the paths containing node E.

• These paths can be accessed rapidly using the pointers

associated with node E.
Paths containing node E
null

B:8 A:2

A:5 C:3 C:1 D:1

C:3 D:1 D:1 E:1 D:1 E:1

null
D:1 E:1

B:3 A:2

C:3 C:1 D:1

E:1 D:1 E:1

E:1
Conditional FP-Tree for E
• We now need to build a conditional FP-Tree for E, which is the
tree of itemsets ending in E.

• It is not the tree obtained in previous slide as result of deleting

nodes from the original tree.

• Why? Because the order of the items change.

– In this example, C has a higher count than B.
Conditional FP-Tree for E
null Header table
Item Pointer
The
B:3 A:2 C 4
conditional
B 3
FP-Tree for E
A 2
C:3 C:1 D:1 D 2 null
The new
C:3 C:1 A:1
E:1 D:1 E:1 header

B:3
E:1 A:1 D:1

The set of paths containing E.

D:1
Insert each path (after truncating Adding up the counts for D we get
E) into a new tree. 2, so {E,D} is frequent itemset.

We continue recursively.
Base of recursion: When the tree
has a single path only.
FP-Tree Another Example
Transactions Freq. 1-Itemsets. Transactions with items sorted based
Supp. Count 2 on frequencies, and ignoring the
infrequent items.
ABCEFO A:8 ACEBF
ACG C:8 ACG
EI E:8 E
ACDEG G:5 ACEGD
B:2
ACEGL ACEG
D:2
EJ E
F:2
ABCEFP ACEBF
ACD ACD
ACEGM ACEG
ACEGN ACEG
FP-Tree after reading 1st transaction
ACEBF
Header null
ACG
E A:8 A:1
C:8
ACEGD
E:8 C:1
ACEG
G:5
E
B:2 E:1
ACEBF D:2
ACD F:2 B:1
ACEG
ACEG F:1
FP-Tree after reading 2nd transaction
ACEBF
Header null
ACG
E A:8 A:2
C:8
ACEGD
E:8 C:2
ACEG
G:5
E G:1
B:2 E:1
ACEBF D:2
ACD F:2 B:1
ACEG
ACEG F:1
FP-Tree after reading 3rd transaction
ACEBF
Header null
ACG
E A:8 A:2 E:1
C:8
ACEGD
E:8 C:2
ACEG
G:5
E G:1
B:2 E:1
ACEBF D:2
ACD F:2 B:1
ACEG
ACEG F:1
FP-Tree after reading 4th transaction
ACEBF
Header null
ACG
E A:8 A:3 E:1
C:8
ACEGD
E:8 C:3
ACEG
G:5
E G:1
B:2 E:2
ACEBF D:2
ACD F:2 B:1
G:1
ACEG
ACEG F:1 D:1
FP-Tree after reading 5th transaction
ACEBF
Header null
ACG
E A:8 A:4 E:1
C:8
ACEGD
E:8 C:4
ACEG
G:5
E G:1
B:2 E:3
ACEBF D:2
ACD F:2 B:1
G:2
ACEG
ACEG F:1 D:1
FP-Tree after reading 6th transaction
ACEBF
Header null
ACG
E A:8 A:4 E:2
C:8
ACEGD
E:8 C:4
ACEG
G:5
E G:1
B:2 E:3
ACEBF D:2
ACD F:2 B:1
G:2
ACEG
ACEG F:1 D:1
FP-Tree after reading 7th transaction
ACEBF
Header null
ACG
E A:8 A:5 E:2
C:8
ACEGD
E:8 C:5
ACEG
G:5
E G:1
B:2 E:4
ACEBF D:2
ACD F:2 B:2
G:2
ACEG
ACEG F:2 D:1
FP-Tree after reading 8th transaction
ACEBF
Header null
ACG
E A:8 A:6 E:2
C:8
ACEGD
E:8 C:6
ACEG
G:5
E G:1 D:1
B:2 E:4
ACEBF D:2
ACD F:2 B:2
G:2
ACEG
ACEG F:2 D:1
FP-Tree after reading 9th transaction
ACEBF
Header null
ACG
E A:8 A:7 E:2
C:8
ACEGD
E:8 C:7
ACEG
G:5
E G:1 D:1
B:2 E:5
ACEBF D:2
ACD F:2 B:2
G:3
ACEG
ACEG F:2 D:1
FP-Tree after reading 10th transaction
ACEBF
Header null
ACG
E A:8 A:8 E:2
C:8
ACEGD
E:8 C:8
ACEG
G:5
E G:1 D:1
B:2 E:6
ACEBF D:2
ACD F:2 B:2
G:4
ACEG
ACEG F:2 D:1
Conditional FP-Trees
Build the conditional FP-Tree for each of the items.
For this:

1. Find the paths containing on focus item. With those paths we

build the conditional FP-Tree for the item.

2. Read again the tree to determine the new counts of the items
along those paths. Build a new header.

3. Insert the paths in the conditional FP-Tree according to the new

order.
Conditional FP-Tree for F
Header null null
New Header

A:8 A:8 A:2 A:2

C:8 C:2
E:8 C:8 E:2 C:2
G:5 B:2
B:2 E:6 E:2
D:2
F:2 B:2 B:2

F:2

There is only a single path containing F

Recursion
• We continue recursively on the
null
conditional FP-Tree for F. New Header
• However, when the tree is just a A:6 A:2
single path it is the base case for C:6
the recursion. E:5 C:2
• So, we just produce all the subsets B:2
of the items on this path merged E:2
with F.
B:2
{F} {A,F} {C,F} {E,F} {B,F}
{A,C,F}, …,
{A,C,E,F}
Conditional FP-Tree for D
New Header null
null

A:8
A:2 A:2
C:2
C:2
C:8
The other items are
E:6 D:1 removed as infrequent.
The tree is just a single path; it is
G:4 the base case for the recursion.
So, we just produce all the
subsets of the items on this path
merged with D.
D:1
{D} {A,D} {C,D} {A,C,D}
Paths containing D after updating the counts
Exercise: Complete the example.

Abstract Algebra Structures and Applications 1st Edition Stephen Lovett - Download The Ebook Now For The Best Reading Experience
No ratings yet
Abstract Algebra Structures and Applications 1st Edition Stephen Lovett - Download The Ebook Now For The Best Reading Experience
84 pages
Relations and Functions Class 12 Notes CBSE Maths Chapter 1 (PDF)
0% (1)
Relations and Functions Class 12 Notes CBSE Maths Chapter 1 (PDF)
4 pages
Mathematical Induction
100% (2)
Mathematical Induction
25 pages
1.4 Inverse of Function
100% (1)
1.4 Inverse of Function
12 pages
Aies Unit - 2
No ratings yet
Aies Unit - 2
28 pages
DWDM Unit-3
100% (1)
DWDM Unit-3
63 pages
Type Conversions
No ratings yet
Type Conversions
6 pages
Fpgrowth
No ratings yet
Fpgrowth
11 pages
Digital Logic Design
No ratings yet
Digital Logic Design
2 pages
Sample Questions
No ratings yet
Sample Questions
5 pages
DM Unit2 - 1 Association Mining 19I504
No ratings yet
DM Unit2 - 1 Association Mining 19I504
86 pages
Computer Science Paper Xii 2024 Marking Scheme
No ratings yet
Computer Science Paper Xii 2024 Marking Scheme
21 pages
Module 4.2 Association Rule Mining
No ratings yet
Module 4.2 Association Rule Mining
88 pages
Asset v1 HKUSTx+MSBD5002x+1T2022+Type@Asset+Block@03 FP Tree
No ratings yet
Asset v1 HKUSTx+MSBD5002x+1T2022+Type@Asset+Block@03 FP Tree
50 pages
Lecture 13 14 FP
No ratings yet
Lecture 13 14 FP
41 pages
FP Tree
No ratings yet
FP Tree
54 pages
FP Tree
No ratings yet
FP Tree
42 pages
Malakand University Bscs Syllabus
No ratings yet
Malakand University Bscs Syllabus
37 pages
Pointers and Arrays in C Jensen 1.5
No ratings yet
Pointers and Arrays in C Jensen 1.5
64 pages
FP Tree
No ratings yet
FP Tree
37 pages
FP Growth
No ratings yet
FP Growth
30 pages
Lecture 2.3.3 2.3.4
No ratings yet
Lecture 2.3.3 2.3.4
29 pages
FP Growth
No ratings yet
FP Growth
32 pages
Unit4 2 Association Rules FP Growth
No ratings yet
Unit4 2 Association Rules FP Growth
33 pages
Introduction Updated
No ratings yet
Introduction Updated
95 pages
FP Growth Datamining Lect 5
No ratings yet
FP Growth Datamining Lect 5
86 pages
18-FP-Growth Algorithm-12-02-2025
No ratings yet
18-FP-Growth Algorithm-12-02-2025
24 pages
Lecture 05
No ratings yet
Lecture 05
22 pages
Pattern Matching Algo
No ratings yet
Pattern Matching Algo
21 pages
FP Growth Alg
No ratings yet
FP Growth Alg
17 pages
FP Growth
No ratings yet
FP Growth
16 pages
Lecture 6
No ratings yet
Lecture 6
18 pages
Lecture 5 - FP-Growth Algorithm
No ratings yet
Lecture 5 - FP-Growth Algorithm
26 pages
Tutorial 02
No ratings yet
Tutorial 02
17 pages
Toc QB
No ratings yet
Toc QB
8 pages
ML 4
No ratings yet
ML 4
13 pages
DWM Exp10 - 201107
No ratings yet
DWM Exp10 - 201107
13 pages
FP Growth Example 2
No ratings yet
FP Growth Example 2
21 pages
DWM Exp10 - 96
No ratings yet
DWM Exp10 - 96
11 pages
FP Growth Algorithm Example Problems
No ratings yet
FP Growth Algorithm Example Problems
12 pages
CSE 385 - Data Mining and Business Intelligence - Lecture 03 - Part 01
No ratings yet
CSE 385 - Data Mining and Business Intelligence - Lecture 03 - Part 01
31 pages
Numericals On Turing Machine
No ratings yet
Numericals On Turing Machine
6 pages
Tan FP Growth
No ratings yet
Tan FP Growth
8 pages
MA5260: Probability Theory II Homework 1 Spring 2023 Exercise 1 (I)
No ratings yet
MA5260: Probability Theory II Homework 1 Spring 2023 Exercise 1 (I)
18 pages
AzqaSaleemKhan (SP22 RCS 003) FPGrowth
No ratings yet
AzqaSaleemKhan (SP22 RCS 003) FPGrowth
19 pages
Data Mining Unit 2 (Part 2) - 1
No ratings yet
Data Mining Unit 2 (Part 2) - 1
7 pages
Data Mining - : Dr. Mahmoud Mounir Mahmoud - Mounir@cis - Asu.edu - Eg
No ratings yet
Data Mining - : Dr. Mahmoud Mounir Mahmoud - Mounir@cis - Asu.edu - Eg
23 pages
2 Stepping Stone and Modi
No ratings yet
2 Stepping Stone and Modi
13 pages
Association Rule Mining: FP Growth
No ratings yet
Association Rule Mining: FP Growth
22 pages
Mining Frequent Patterns Without Candidate Generation
No ratings yet
Mining Frequent Patterns Without Candidate Generation
44 pages
U3 - FP Trees - 5th Sem - DS
No ratings yet
U3 - FP Trees - 5th Sem - DS
9 pages
FP Tree Growth: Frequent Pattern Growth Algorithm
100% (1)
FP Tree Growth: Frequent Pattern Growth Algorithm
2 pages
Frequent Pattern Mining Without Candidate Generation: Lesson Introduction
No ratings yet
Frequent Pattern Mining Without Candidate Generation: Lesson Introduction
6 pages
Topic - 6 (Logical Agents)
No ratings yet
Topic - 6 (Logical Agents)
32 pages
AP CSA Mock 6 FRQs
No ratings yet
AP CSA Mock 6 FRQs
7 pages
FP Growth
No ratings yet
FP Growth
21 pages
FP-Growth Algorithm
No ratings yet
FP-Growth Algorithm
5 pages
FP-Growth Algorithm
No ratings yet
FP-Growth Algorithm
23 pages
Resolution Method in Ai: Assistant Proof. Dr. Emad I Abdul Kareem
No ratings yet
Resolution Method in Ai: Assistant Proof. Dr. Emad I Abdul Kareem
12 pages
Untitled Document
No ratings yet
Untitled Document
5 pages
PHI REVIEWER LESSON 8 To 10
No ratings yet
PHI REVIEWER LESSON 8 To 10
6 pages
Tail Lard 1993
No ratings yet
Tail Lard 1993
8 pages
Appc 2.8 Packet
No ratings yet
Appc 2.8 Packet
5 pages
03 Pre Processing
No ratings yet
03 Pre Processing
20 pages
Shihab Rahman Dolon Chanpa Department of Computer Science and Engineering, University of Dhaka
No ratings yet
Shihab Rahman Dolon Chanpa Department of Computer Science and Engineering, University of Dhaka
23 pages
Open Methods (Newton-Raphson Secant Methods)
100% (2)
Open Methods (Newton-Raphson Secant Methods)
4 pages
FP-Growth Algorithm
No ratings yet
FP-Growth Algorithm
16 pages
AI Lab 3
No ratings yet
AI Lab 3
8 pages
FP-Tree Growth Algorithm
No ratings yet
FP-Tree Growth Algorithm
15 pages
ESE Handouts 4 - FP Growth Algorithm (Fall 2016)
No ratings yet
ESE Handouts 4 - FP Growth Algorithm (Fall 2016)
13 pages
Q) FP Growth Algorithm?: This Algorithm Works As Follows
No ratings yet
Q) FP Growth Algorithm?: This Algorithm Works As Follows
3 pages
Estimating Frequent Patterns Using FP-Growth On A Transactional Data Stream
No ratings yet
Estimating Frequent Patterns Using FP-Growth On A Transactional Data Stream
3 pages
15-Fp-Tree Problem-10-09-2024
No ratings yet
15-Fp-Tree Problem-10-09-2024
2 pages
Fp-Tree Growth Algorithm
No ratings yet
Fp-Tree Growth Algorithm
11 pages
Module 1: Sets GED0103 Mathematics in The Modern World Examples Roster Method
No ratings yet
Module 1: Sets GED0103 Mathematics in The Modern World Examples Roster Method
6 pages
Frequent Closed Pattern Mining Algorithm Based On COFI-Tree
No ratings yet
Frequent Closed Pattern Mining Algorithm Based On COFI-Tree
2 pages
FP Growth Algorithm
No ratings yet
FP Growth Algorithm
17 pages
A Metalogical Primer: 1 Notation
No ratings yet
A Metalogical Primer: 1 Notation
12 pages
FP Tree Example
No ratings yet
FP Tree Example
11 pages
How To Find Frequent Patterns?: Wim Pijls Walter A. Kosters
No ratings yet
How To Find Frequent Patterns?: Wim Pijls Walter A. Kosters
8 pages
FP Growth Presentation v1 (Handout)
No ratings yet
FP Growth Presentation v1 (Handout)
10 pages
Mining Association Rules With Systolic Trees: Dept. of Electrical and Computer Engineering Iowa State University Email
No ratings yet
Mining Association Rules With Systolic Trees: Dept. of Electrical and Computer Engineering Iowa State University Email
6 pages
2.5 BSED-Filipino 1A
No ratings yet
2.5 BSED-Filipino 1A
3 pages
F P-Tree F P-Growth
No ratings yet
F P-Tree F P-Growth
7 pages
Lecture 5 - Monday, September 3, 2007: 2.1 Example From Paper
No ratings yet
Lecture 5 - Monday, September 3, 2007: 2.1 Example From Paper
6 pages
A New Fast Algorithm For Constructing FP - Tree: Zhenzhou Wang Jiaomin Liu Sheng Guo Lijuan Yang
No ratings yet
A New Fast Algorithm For Constructing FP - Tree: Zhenzhou Wang Jiaomin Liu Sheng Guo Lijuan Yang
4 pages
Operators in Java
No ratings yet
Operators in Java
30 pages
Data Mining of Frequent Patterns Using FP Tree
No ratings yet
Data Mining of Frequent Patterns Using FP Tree
1 page
IGNOU BCA Data and File Structure Previous Year Unsolved Papers MCS 021
From Everand
IGNOU BCA Data and File Structure Previous Year Unsolved Papers MCS 021
Manish Soni
No ratings yet
Christening Blanket
From Everand
Christening Blanket
Annie's
No ratings yet

4.1) FP Growth Algorithm

Uploaded by

4.1) FP Growth Algorithm

Uploaded by

Association Analysis (3)

Building the FP-Tree

Header table D:1 E:1

• Given the example tree, the algorithm looks for frequent

• Since every transaction is mapped onto a path in the FPtree, we

• These paths can be accessed rapidly using the pointers

A:5 C:3 C:1 D:1

C:3 D:1 D:1 E:1 D:1 E:1

C:3 C:1 D:1

E:1 D:1 E:1

• It is not the tree obtained in previous slide as result of deleting

• Why? Because the order of the items change.

The set of paths containing E.

1. Find the paths containing on focus item. With those paths we

3. Insert the paths in the conditional FP-Tree according to the new

A:8 A:8 A:2 A:2

There is only a single path containing F

You might also like