0% found this document useful (0 votes)

16 views18 pages

Lecture 6

Uploaded by

srinutirumanisetti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views18 pages

Lecture 6

Uploaded by

srinutirumanisetti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

FP-tree and FP-growth

Frequent itemset mining …

Is Apriori Fast Enough? — Performance
Bottlenecks

■ The core of the Apriori algorithm:

■ Use frequent (k – 1)-itemsets to generate candidate frequent
k-itemsets
■ Use database scan and pattern matching to collect counts for the
candidate itemsets
■ The bottleneck of Apriori: candidate generation
■ Huge candidate sets:

■ 104 frequent 1-itemset will generate 107 candidate 2-itemsets

■ To discover a frequent pattern of size 100, e.g., {a1, a2, …,
a100}, one needs to generate 2100 ≈ 1030 candidates.
■ Multiple scans of database:
■ Needs (n +1 ) scans, n is the length of the longest pattern

2
Mining Frequent Patterns Without
Candidate Generation

■ Compress a large database into a compact,

Frequent-Pattern tree (FP-tree) structure
■ highly condensed, but complete for frequent pattern
mining
■ avoid costly database scans
■ Develop an efficient, FP-tree-based frequent pattern
mining method
■ A divide-and-conquer methodology: decompose mining
tasks into smaller ones
■ Avoid candidate generation: sub-database test only!

3
Construct FP-tree from a
Transaction DB
TID Items bought (ordered) frequent items
100 {f, a, c, d, g, i, m, p} {f, c, a, m, p}
200 {a, b, c, f, l, m, o} {f, c, a, b, m} min_support = 0.5
300 {b, f, h, j, o} {f, b}
400 {b, c, k, s, p} {c, b, p}
500 {a, f, c, e, l, p, m, n} {f, c, a, m, p}
{}
Steps: Header Table
1. Scan DB once, find frequent Item frequency head f:4 c:1
1-itemset (single item f 4
pattern) c 4 c:3 b:1 b:1
2. Order frequent items in a 3
b 3 a:3 p:1
frequency descending order m 3
3. Scan DB again, construct p 3 m:2 b:1
FP-tree
p:2 m:1 4
Construct FP-tree from a
Transaction DB
TID Items bought (ordered) frequent items
100 {f, a, c, d, g, i, m, p} {f, c, a, m, p}
200 {a, b, c, f, l, m, o} {f, c, a, b, m} min_support = 0.5
300 {b, f, h, j, o} {f, b}
400 {b, c, k, s, p} {c, b, p}
500 {a, f, c, e, l, p, m, n} {f, c, a, m, p}

5
Benefits of the FP-tree Structure

■ Completeness:
■ never breaks a long pattern of any transaction

■ preserves complete information for frequent pattern

mining
■ Compactness
■ reduce irrelevant information—infrequent items are gone

■ frequency descending ordering: more frequent items are

more likely to be shared
■ Not larger than the original database (node-links and
counts are extra space; but asymptotically the size is
upperbouded by the DB size)
■ Example: For Connect-4 DB, compression ratio could be
over 100
6
Mining Frequent Patterns Using FP-tree

■ General idea (divide-and-conquer)

■ Recursively grow frequent pattern path using the

FP-tree
■ Method
■ For each item, construct its conditional pattern-base,

and then its conditional FP-tree

■ Repeat the process on each newly created conditional

FP-tree
■ Until the resulting FP-tree is empty, or it contains only

one path (single path will generate all the combinations of its
sub-paths, each of which is a frequent pattern)

7
Major Steps to Mine FP-tree

1) Construct conditional pattern base for each node in the

FP-tree
2) Construct conditional FP-tree from each conditional
pattern-base
3) Recursively mine conditional FP-trees and grow
frequent patterns obtained so far
▪ If the conditional FP-tree contains a single path,
simply enumerate all the patterns

8
Step 1: From FP-tree to Conditional
Pattern Base
■ Starting at the frequent header table in the FP-tree
■ Traverse the FP-tree by following the link of each frequent item
■ Accumulate all of transformed prefix paths of that item to form a
conditional pattern base

Header Table {}
Conditional pattern bases
Item frequency head f:4 c:1
f 4 itemcond. pattern base
c 4 c:3 b:1 b:1 c f:3
a 3
b 3 a fc:3
a:3 p:1
m 3 b fca:1, f:1, c:1
p 3 m:2 b:1 m fca:2, fcab:1
p fcam:2, cb:1
p:2 m:1
9
Properties of FP-tree for Conditional
Pattern Base Construction

■ Node-link property
■ For any frequent item ai all the possible frequent
patterns that contain ai can be obtained by following
ai's node-links, starting from ai's head in the FP-tree
header
■ Prefix path property
■ To calculate the frequent patterns for a node ai in a
path P, only the prefix sub-path of ai in P need to be
accumulated, and its frequency count should carry the
same count as node ai

10
Step 2: Construct Conditional FP-tree

■ For each pattern-base

■ Accumulate the count for each item in the base

■ Construct the FP-tree for the frequent items of the

pattern base

{} m-conditional pattern
Header Table base:
Item frequency head f:4 c:1 fca:2, fcab:1
f 4 All frequent patterns
c 4 c:3 b:1 b:1 {} concerning m
m,
a 3 🡲
b 3 a:3 p:1 f:3 🡲 fm, cm, am,
fcm, fam, cam,
m 3
p 3 m:2 b:1 c:3 fcam

p:2 m:1 a:3

m-conditional FP-tree
11
Mining Frequent Patterns by Creating
Conditional Pattern-Bases

Item Conditional pattern-base Conditional FP-tree

p {(fcam:2), (cb:1)} {(c:3)}|p
m {(fca:2), (fcab:1)} {(f:3, c:3, a:3)}|m
b {(fca:1), (f:1), (c:1)} Empty
a {(fc:3)} {(f:3, c:3)}|a
c {(f:3)} {(f:3)}|c
f Empty Empty

12
Step 3: Recursively mine the
conditional FP-tree
{}

{} Cond. pattern base of “am”: (fc:3) f:3

c:3
f:3
am-conditional FP-tree
c:3 {}
Cond. pattern base of “cm”: (f:3)
a:3 f:3
m-conditional FP-tree
cm-conditional FP-tree

{}

Cond. pattern base of “cam”: (f:3) f:3

cam-conditional FP-tree

13
Single FP-tree Path Generation

■ Suppose an FP-tree T has a single path P

■ The complete set of frequent pattern of T can be
generated by enumeration of all the combinations of the
sub-paths of P

{}
All frequent patterns
concerning m
f:3 m,
c:3 🡲 fm, cm, am,
fcm, fam, cam,
a:3 fcam

m-conditional FP-tree
14
Principles of Frequent Pattern
Growth
■ Pattern growth property
■ Let α be a frequent itemset in DB, B be α's
conditional pattern base, and β be an itemset in B.
Then α ∪ β is a frequent itemset in DB iff β is
frequent in B.
■ “abcdef ” is a frequent pattern, if and only if
■ “abcde ” is a frequent pattern, and
■ “f ” is frequent in the set of transactions containing
“abcde ”

15
Why Is Frequent Pattern Growth
Fast?

■ Our performance study shows

■ FP-growth is an order of magnitude faster than
Apriori, and is also faster than tree-projection
■ Reasoning
■ No candidate generation, no candidate test
■ Use compact data structure
■ Eliminate repeated database scan
■ Basic operation is counting and FP-tree building

16
FP-growth vs. Apriori: Scalability With
the Support Threshold

Data set T25I20D10K

17
FP-growth vs. Tree-Projection: Scalability
with Support Threshold

Data set T25I20D100K

Fpgrowth
No ratings yet
Fpgrowth
11 pages
DM Unit2 - 1 Association Mining 19I504
No ratings yet
DM Unit2 - 1 Association Mining 19I504
86 pages
Lecture 2.3.3 2.3.4
No ratings yet
Lecture 2.3.3 2.3.4
29 pages
FPTree 09
No ratings yet
FPTree 09
45 pages
FP Growth
No ratings yet
FP Growth
21 pages
Estimating Frequent Patterns Using FP-Growth On A Transactional Data Stream
No ratings yet
Estimating Frequent Patterns Using FP-Growth On A Transactional Data Stream
3 pages
FP Growth
No ratings yet
FP Growth
30 pages
Lecture 13 14 FP
No ratings yet
Lecture 13 14 FP
41 pages
Module 4.2 Association Rule Mining
No ratings yet
Module 4.2 Association Rule Mining
88 pages
DM Lecture 29
No ratings yet
DM Lecture 29
20 pages
DWDM Unit-3
100% (1)
DWDM Unit-3
63 pages
Chapter 5
No ratings yet
Chapter 5
26 pages
FP Tree
No ratings yet
FP Tree
54 pages
18-FP-Growth Algorithm-12-02-2025
No ratings yet
18-FP-Growth Algorithm-12-02-2025
24 pages
FP Tree
No ratings yet
FP Tree
37 pages
FP Tree
No ratings yet
FP Tree
42 pages
An Improved Frequent Pattern Tree The Child Struct
No ratings yet
An Improved Frequent Pattern Tree The Child Struct
19 pages
Unit4 2 Association Rules FP Growth
No ratings yet
Unit4 2 Association Rules FP Growth
33 pages
Mining Frequent Patterns Without Candidate Generation
No ratings yet
Mining Frequent Patterns Without Candidate Generation
44 pages
FP Growth Alg
No ratings yet
FP Growth Alg
17 pages
FP Growth
No ratings yet
FP Growth
16 pages
FP-Growth Algorithm
No ratings yet
FP-Growth Algorithm
5 pages
Untitled Document
No ratings yet
Untitled Document
5 pages
Frequent Pattern Mining Without Candidate Generation: Lesson Introduction
No ratings yet
Frequent Pattern Mining Without Candidate Generation: Lesson Introduction
6 pages
Tutorial 02
No ratings yet
Tutorial 02
17 pages
U3 - FP Trees - 5th Sem - DS
No ratings yet
U3 - FP Trees - 5th Sem - DS
9 pages
Frequent Closed Pattern Mining Algorithm Based On COFI-Tree
No ratings yet
Frequent Closed Pattern Mining Algorithm Based On COFI-Tree
2 pages
Lecture 5 - FP-Growth Algorithm
No ratings yet
Lecture 5 - FP-Growth Algorithm
26 pages
Q) FP Growth Algorithm?: This Algorithm Works As Follows
No ratings yet
Q) FP Growth Algorithm?: This Algorithm Works As Follows
3 pages
15-Fp-Tree Problem-10-09-2024
No ratings yet
15-Fp-Tree Problem-10-09-2024
2 pages
FPgrowth
No ratings yet
FPgrowth
2 pages
FP Growth (Tree)
No ratings yet
FP Growth (Tree)
24 pages
Data Mining - : Dr. Mahmoud Mounir Mahmoud - Mounir@cis - Asu.edu - Eg
No ratings yet
Data Mining - : Dr. Mahmoud Mounir Mahmoud - Mounir@cis - Asu.edu - Eg
23 pages
AzqaSaleemKhan (SP22 RCS 003) FPGrowth
No ratings yet
AzqaSaleemKhan (SP22 RCS 003) FPGrowth
19 pages
TADANO 80ton GR-800EX - Specification & Load Chart PDF
0% (1)
TADANO 80ton GR-800EX - Specification & Load Chart PDF
13 pages
FP-Growth Algorithm
No ratings yet
FP-Growth Algorithm
16 pages
SANDEL-3308 Install Manual Rev G 9-25-03
No ratings yet
SANDEL-3308 Install Manual Rev G 9-25-03
130 pages
FP Growth Algorithm
No ratings yet
FP Growth Algorithm
17 pages
Fp-Tree Growth Algorithm
No ratings yet
Fp-Tree Growth Algorithm
11 pages
What Is Frequent Pattern Analysis?
No ratings yet
What Is Frequent Pattern Analysis?
5 pages
FP-Tree Growth Algorithm
No ratings yet
FP-Tree Growth Algorithm
15 pages
Efficient Algorithm For Mining Frequent Patterns Java Project
No ratings yet
Efficient Algorithm For Mining Frequent Patterns Java Project
38 pages
Association Rule Mining: FP Growth
No ratings yet
Association Rule Mining: FP Growth
22 pages
A Frequent Pattern Mining Algorithm Based On Fp-Tree Structure Andapriori Algorithm
No ratings yet
A Frequent Pattern Mining Algorithm Based On Fp-Tree Structure Andapriori Algorithm
3 pages
Cold Calling Scripts
100% (1)
Cold Calling Scripts
32 pages
Shihab Rahman Dolon Chanpa Department of Computer Science and Engineering, University of Dhaka
No ratings yet
Shihab Rahman Dolon Chanpa Department of Computer Science and Engineering, University of Dhaka
23 pages
ESE Handouts 4 - FP Growth Algorithm (Fall 2016)
No ratings yet
ESE Handouts 4 - FP Growth Algorithm (Fall 2016)
13 pages
Powerpoint Presentation On Somlething
No ratings yet
Powerpoint Presentation On Somlething
181 pages
Ics 2105 Data Structures & Algorithm
No ratings yet
Ics 2105 Data Structures & Algorithm
4 pages
Mining Frequent Patterns Without Candidate Generation
No ratings yet
Mining Frequent Patterns Without Candidate Generation
44 pages
What Is Frequent Pattern Analysis?
No ratings yet
What Is Frequent Pattern Analysis?
37 pages
Guide: Mr. Gautam Borkar: Group Members: Rahul Kelaskar A - 636 Anish Khale A - 638 Dhaval Doshi A - 682
No ratings yet
Guide: Mr. Gautam Borkar: Group Members: Rahul Kelaskar A - 636 Anish Khale A - 638 Dhaval Doshi A - 682
22 pages
Mining Frequent Patterns Without Candidate Generation
No ratings yet
Mining Frequent Patterns Without Candidate Generation
12 pages
FP-Growth Algorithm
No ratings yet
FP-Growth Algorithm
23 pages
Improv Me Net
No ratings yet
Improv Me Net
7 pages
Association Rule: Frequent Pattern Approach
No ratings yet
Association Rule: Frequent Pattern Approach
16 pages
Autoduel Quarterly 3 1
No ratings yet
Autoduel Quarterly 3 1
52 pages
SCM Module1 Questions and Answers 1
No ratings yet
SCM Module1 Questions and Answers 1
11 pages
03 Pre Processing
No ratings yet
03 Pre Processing
20 pages
Suraj Data
No ratings yet
Suraj Data
100 pages
2023 Toyota Crown
No ratings yet
2023 Toyota Crown
9 pages
FP Growth Presentation v1 (Handout)
No ratings yet
FP Growth Presentation v1 (Handout)
10 pages
FP Tree Growth: Frequent Pattern Growth Algorithm
100% (1)
FP Tree Growth: Frequent Pattern Growth Algorithm
2 pages
F P-Tree F P-Growth
No ratings yet
F P-Tree F P-Growth
7 pages
Lecture 5 - Monday, September 3, 2007: 2.1 Example From Paper
No ratings yet
Lecture 5 - Monday, September 3, 2007: 2.1 Example From Paper
6 pages
2-Alarm Check Valve Viking Manual........
No ratings yet
2-Alarm Check Valve Viking Manual........
23 pages
17 Decidabi - Ity
No ratings yet
17 Decidabi - Ity
58 pages
SOP For Protocol For Working Standard
No ratings yet
SOP For Protocol For Working Standard
6 pages
18 Reducibility
No ratings yet
18 Reducibility
57 pages
Problem Solving by Searching
No ratings yet
Problem Solving by Searching
40 pages
Lecture 11
No ratings yet
Lecture 11
49 pages
16 Turing Machines Variants NTM
No ratings yet
16 Turing Machines Variants NTM
36 pages
Artificial Intelligence: Dr. Piyush Joshi IIIT Sri City
No ratings yet
Artificial Intelligence: Dr. Piyush Joshi IIIT Sri City
27 pages
A Search: F (N) Estimated Cost of The Best Path That Continues From N To A Goal
No ratings yet
A Search: F (N) Estimated Cost of The Best Path That Continues From N To A Goal
20 pages
19 Reduction Computation History PCP
No ratings yet
19 Reduction Computation History PCP
25 pages
SPE-199091-MS, Electric Submersible Pump Troubleshooting Guide, An Effective Way To Improve System Performance and Reduce Avoidable System Failues
100% (1)
SPE-199091-MS, Electric Submersible Pump Troubleshooting Guide, An Effective Way To Improve System Performance and Reduce Avoidable System Failues
18 pages
Exam Questions With Answers
No ratings yet
Exam Questions With Answers
11 pages
Artificial Intelligence: Dr. Piyush Joshi IIIT Sri City
No ratings yet
Artificial Intelligence: Dr. Piyush Joshi IIIT Sri City
23 pages
Kareem Shagar Formation An Oil Field Located in Ras Gharib Development
No ratings yet
Kareem Shagar Formation An Oil Field Located in Ras Gharib Development
53 pages
Ss # SDT - 065: Site Acceptance Test Report For 22Kv Transformer
No ratings yet
Ss # SDT - 065: Site Acceptance Test Report For 22Kv Transformer
6 pages
Lecture 14
No ratings yet
Lecture 14
20 pages
Biped Humanoid Robot of 17 Degree of Freedom (Dof)
No ratings yet
Biped Humanoid Robot of 17 Degree of Freedom (Dof)
5 pages
Lecture 12
No ratings yet
Lecture 12
13 pages
19 - Heating and Ventilating Systems - HVAC
No ratings yet
19 - Heating and Ventilating Systems - HVAC
6 pages
Search Gps
No ratings yet
Search Gps
27 pages
Gidukevo Nusimiga Zapog
No ratings yet
Gidukevo Nusimiga Zapog
3 pages
Exp22 Excel Ch04 CumulativeAssessment Variation Rockville Auto Sales Instructions
No ratings yet
Exp22 Excel Ch04 CumulativeAssessment Variation Rockville Auto Sales Instructions
2 pages
Magnum Press-On
No ratings yet
Magnum Press-On
5 pages
Cisco 500-444 Exam Dumps
No ratings yet
Cisco 500-444 Exam Dumps
6 pages
T34 Catlogue - Catalogue - V2 - 2023
No ratings yet
T34 Catlogue - Catalogue - V2 - 2023
8 pages
DEA 5TT2 Quiz
No ratings yet
DEA 5TT2 Quiz
4 pages
Print Production: Digital Images
No ratings yet
Print Production: Digital Images
24 pages
System Requirements Guidelines NX 8 5
No ratings yet
System Requirements Guidelines NX 8 5
3 pages
Cirvyn Ithinus
No ratings yet
Cirvyn Ithinus
2 pages
CSF213 OOP Handout 2023 24 Sem I
No ratings yet
CSF213 OOP Handout 2023 24 Sem I
3 pages
20 Properties of RE and R Sets
No ratings yet
20 Properties of RE and R Sets
2 pages
Ejemplo - Contrato de Obra by Alicia Cosio González - Issuu
No ratings yet
Ejemplo - Contrato de Obra by Alicia Cosio González - Issuu
1 page
Expt 6 - P-I-N and Avalanche Photodiode BER Performance Comparison
No ratings yet
Expt 6 - P-I-N and Avalanche Photodiode BER Performance Comparison
4 pages
Essay and Hackathon
No ratings yet
Essay and Hackathon
2 pages
The Recursive Book of Recursion: Ace the Coding Interview with Python and JavaScript
From Everand
The Recursive Book of Recursion: Ace the Coding Interview with Python and JavaScript
Al Sweigart
No ratings yet

Lecture 6

Uploaded by

Lecture 6

Uploaded by

FP-tree and FP-growth

Frequent itemset mining …

■ The core of the Apriori algorithm:

■ 104 frequent 1-itemset will generate 107 candidate 2-itemsets

■ Compress a large database into a compact,

■ preserves complete information for frequent pattern

■ frequency descending ordering: more frequent items are

■ General idea (divide-and-conquer)

and then its conditional FP-tree

1) Construct conditional pattern base for each node in the

■ For each pattern-base

■ Construct the FP-tree for the frequent items of the

p:2 m:1 a:3

Item Conditional pattern-base Conditional FP-tree

{} Cond. pattern base of “am”: (fc:3) f:3

Cond. pattern base of “cam”: (f:3) f:3

■ Suppose an FP-tree T has a single path P

■ Our performance study shows

Data set T25I20D10K

Data set T25I20D100K

You might also like