0% found this document useful (0 votes)

8K views

FP Growth Algorithm

The FP-Growth algorithm mines frequent itemsets from transaction data by constructing an FP-tree and recursively mining conditional FP-trees. It transforms the problem of mining frequent k-itemsets into mining frequent 1-itemsets on conditional pattern bases. The algorithm recursively grows frequent patterns by combining shorter patterns found in conditional FP-trees.

Uploaded by

Allison Collier

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8K views

FP Growth Algorithm

Uploaded by

Allison Collier

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 21

FP-Growth algorithm

Lecture 33/15-10-09

Lecture 33/15-10-09 1
Observations about FP-tree
• Size of FP-tree depends on how items are
ordered.
• In the previous example, if ordering is done in
increasing order, the resulting FP-tree will be
different and for this example, it will be denser
(wider).
• At the root node the branching factor will
increase from 2 to 5 as shown on next slide.
• Also, ordering by decreasing support count
doesn’t always lead to the smallest tree.

Lecture 33/15-10-09 2
Lecture 33/15-10-09 3
FP-Growth

FP-growth Algorithm:
Mining Frequent Patterns
Using FP-tree

Lecture 33/15-10-09 4
Frequent itemset generation using FP-growth
algorithm
• This algo generates frequent itemsets from
FP-tree by traversing in bottom-up fashion.
• This algo extracts frequent itemsets ending in
‘e’ first and then ending in ‘d’, ‘c’, ‘b’ and ‘a’.
• As every trans. is mapped onto a single path
in the FP-tree, so frequent itemsets, say
ending in ‘e’ can be found by investigating
the paths containing node ‘e’.

5
Lecture 33/15-10-09 6
Mining Frequent Patterns Using FP-tree

• General idea (divide-and-conquer)

Recursively grow frequent patterns using the FP-
tree: looking for shorter ones recursively and then
concatenating the suffix:
– For each frequent item, construct its
conditional pattern base, and then its
conditional FP-tree;
– Repeat the process on each newly created
conditional FP-tree until the resulting FP-tree is
empty.

Lecture 33/15-10-09 7
Major Steps of FP-Growth algorithm
Starting the processing from the end of list L:
Step 1:
Construct conditional pattern base for each item in the header table.

Step 2:
Construct conditional FP-tree from each conditional pattern base.

Step 3:
Recursively mine conditional FP-trees and grow frequent patterns
obtained so far.

Lecture 33/15-10-09 8
Step 1: Construct Conditional Pattern Base
• Starting at the bottom of frequent-item header table in the FP-tree
• Traverse the FP-tree by following the link of each frequent item
• Accumulate all of transformed prefix paths of that item to form a conditional
pattern base

{} Conditional pattern bases

Header Table item cond. pattern base
f:4 c:1 p fcam:2, cb:1
Item head
f m fca:2, fcab:1
c c:3 b:1 b:1 b fca:1, f:1, c:1
a a fc:3
b a:3 p:1
m c f:3
p m:2 b:1 f {}

p:2 m:1 Considering ‘p’ as suffix ,its 2 corresponding

prefix paths are {(fcam:2)} and {(cb:1)} 9
FP-Growth: An Example

Step 2: Construct Conditional FP-tree

• For each pattern base

– Accumulate the count for each item in the base
– Construct the conditional FP-tree for the frequent items of the
pattern base
{}
Header Table
Item head f:4 {}
f 4
c 4 c:3 f:3
m- cond. pattern base:
a 3
b 3
a:3  fca:2, fcab:1 
c:3
m 3 m:2 b:1
p 3 a:3
m:1 m-conditional FP-tree

Lecture 33/15-10-09 10
Principles of FP-Growth
(why ‘b’ is not considered?)

• Pattern growth property

– Let α be a frequent itemset in DB, B be α 's conditional pattern
base, and β be an itemset in B. Then α ∪ β is a frequent
itemset in DB iff β is frequent in B.
• Is “fcabm ” a frequent pattern?
– “fcab” is a branch of m's conditional pattern base
– “b” is NOT frequent in transactions containing “fcab ”
– “bm” is NOT a frequent itemset.

Lecture 33/15-10-09 11
FP-Growth
Step 3: Recursively mine the conditional
FP-tree
conditional FP-tree of conditional FP-tree of conditional FP-tree of
add
“m”: (fca:3) “am”: (fc:3) “c” “cam”: (f:3)
{} {}
{} add Frequent Pattern
Frequent Pattern “a” f:3 Frequent Pattern
f:3
f:3 add c:3 add ad
“f” d
“c”
c:3 “f”
conditional FP-tree of conditional FP-tree of
a:3 “cm”: (f:3) of “fam”: 3
add
{} “f”
Frequent Pattern
Frequent Pattern
add conditional FP-tree of
f:3 “fcm”: 3
“f”

Frequent Pattern
fcam
conditional FP-tree of “fm”: 3 Frequent Pattern

Frequent Pattern Lecture 33/15-10-09 12

12
FP-Growth
Conditional Pattern Bases and
Conditional FP-Tree

Item Conditional pattern base Conditional FP-tree

p {(fcam:2), (cb:1)} {(c:3)}|p
m {(fca:2), (fcab:1)} {(f:3, c:3, a:3)}|m
b {(fca:1), (f:1), (c:1)} Empty
a {(fc:3)} {(f:3, c:3)}|a
c {(f:3)} {(f:3)}|c
f Empty Empty
order of L
Lecture 33/15-10-09 13
FP-Growth

Single FP-tree Path Generation

{}
All frequent patterns concerning m:
combination of {f, c, a} and m
f:3 m,
c:3  fm, cm, am,
fcm, fam, cam,
a:3
fcam
m-conditional FP-tree

Lecture 33/15-10-09 14
Summary of FP-Growth
Algorithm
• Mining frequent patterns can be viewed as first
mining 1-itemset and progressively growing each 1-
itemset by mining on its conditional pattern base
recursively

• Transform a frequent k-itemset mining problem into

a sequence of k frequent 1-itemset mining problems
via a set of conditional pattern bases

Lecture 33/15-10-09 15
Evaluation of Association patterns
• Objective interestingness measure:
– It uses statistics derived from data to
determine whether a pattern is interesting or
not.
– Examples are support, confidence and
correlation.
• Subjective interestingness measure:
– A pattern is called subjectively interesting if it
reveals unexpected information about the
data/ that can approach to profitable actions.

Lecture 33/15-10-09 16
• Example: {butter} {bread} may not be
interesting b’coz relnship represents
obvious information.
– But {Diapers} {beers} can be interesting
as relnship is quite unexpected and can really
help retailers in cross-selling for making
profits.
• Determining subjective knowledge is little
difficult as it requires prior information from
domain experts.
Lecture 33/15-10-09 17
Different approaches for incorporating
subjective knowledge
• 1. Visualization:
– Domain experts interact with the data mining system
by interpreting and verifying the discovered patterns.
• 2. Template-based approach:
– instead of considering all the rules, only those rules
that specify the user requirement are considered.
• 3. Subjective interestingness measure:
– A subjective measure can be defined depending on
domain information such as concept hierarchy. The
measure can be used to filter patterns that are
obvious and not required.
Lecture 33/15-10-09 18
Objective measures of
interestingness
• It is a data-driven approach for evaluating
the quality of an asso. rule.
• Domain-independent and needs least
input from users (like a threshold value for filtering low-
quality patterns).

• An objective measure is computed based

on frequency counts tabulated in a
contingency table.
Lecture 33/15-10-09 19
Computing Interestingness Measure
• Given a rule X → Y, information needed to compute rule
interestingness can be obtained from a contingency table
f11 : support of X and Y
Contingency table for X → Y f : support of X and Y
10
Y Y f01 : support of X and Y
X f11 f10 f1+ f00 : support of X and Y
X f01 f00 fo+ Row sum is support
Supp
count for Y f+1 f+0 |T| count for X

f11 denotes the no. of

times X and Y appear
together in the same Used to define various measures
trans.
◆ support, confidence, lift, Gini,
f01 denotes the no. of
J-measure, etc.
trans. containing Y but
not X Lecture 33/15-10-09 20
Drawback of Confidence

Coffee Coffee
Tea 15 5 20
Tea 75 5 80
90 10 100
Association Rule: Tea → Coffee

Confidence= P(Coffee|Tea) = 0.75

but P(Coffee) = 0.9
⇒ Although confidence is high, rule is misleading
⇒ P(Coffee|Tea) = 0.9375
Lecture 33/15-10-09 21

Module 17 - EASA Questions
90% (10)
Module 17 - EASA Questions
19 pages
Strategic Human Resource Management at Lloyds TSB
No ratings yet
Strategic Human Resource Management at Lloyds TSB
12 pages
Weider System
No ratings yet
Weider System
7 pages
Machine Learning Project Checklist
100% (1)
Machine Learning Project Checklist
10 pages
Business Analytics For Decision Making
100% (7)
Business Analytics For Decision Making
326 pages
Joining A Group
No ratings yet
Joining A Group
6 pages
Case Set 1 Auto Saved)
No ratings yet
Case Set 1 Auto Saved)
14 pages
BCG Matrix of Itc LTD v02 1222197387335911 8
No ratings yet
BCG Matrix of Itc LTD v02 1222197387335911 8
25 pages
What Is Ribotyping
0% (1)
What Is Ribotyping
8 pages
Subhash Yadav Surbhi Singh Manish Singh Parihar Apeksha Parmar Sanjay Kumar Mallik
No ratings yet
Subhash Yadav Surbhi Singh Manish Singh Parihar Apeksha Parmar Sanjay Kumar Mallik
40 pages
Norbert Elias State Formation and Civilization, Ch. II: Presentation by Syed Iftikhar Hussain Shah
No ratings yet
Norbert Elias State Formation and Civilization, Ch. II: Presentation by Syed Iftikhar Hussain Shah
22 pages
Secret of Nakshatras
No ratings yet
Secret of Nakshatras
41 pages
The Real Secret of Vedic Astrology - Nakshatras
No ratings yet
The Real Secret of Vedic Astrology - Nakshatras
41 pages
Case Study
No ratings yet
Case Study
1 page
Guyton Hall Physio Chapter 26 Urine Formation by The Kidneys
No ratings yet
Guyton Hall Physio Chapter 26 Urine Formation by The Kidneys
66 pages
A Project Report On Portfolio Management
No ratings yet
A Project Report On Portfolio Management
30 pages
Cases
No ratings yet
Cases
7 pages
Balance of Payments
No ratings yet
Balance of Payments
19 pages
Betrothed and Would Have Married Her Perforce
No ratings yet
Betrothed and Would Have Married Her Perforce
12 pages
Ebay
100% (1)
Ebay
10 pages
Councilwoman Gerrie Schipske: Presented by
No ratings yet
Councilwoman Gerrie Schipske: Presented by
46 pages
Tiss Analysis
No ratings yet
Tiss Analysis
3 pages
The Difference Between Internment Camps and Concentration Camps
No ratings yet
The Difference Between Internment Camps and Concentration Camps
2 pages
Ancient
No ratings yet
Ancient
4 pages
Iifl
No ratings yet
Iifl
47 pages
The New Space Race
No ratings yet
The New Space Race
7 pages
C CCCC CC
No ratings yet
C CCCC CC
13 pages
Johann Gutenberg
No ratings yet
Johann Gutenberg
7 pages
Chandrayana 1
No ratings yet
Chandrayana 1
24 pages
Adjusted Playbook
No ratings yet
Adjusted Playbook
37 pages
Luxor
No ratings yet
Luxor
8 pages
Commissioning Circuit Breakers
No ratings yet
Commissioning Circuit Breakers
77 pages
Confucius Essay
No ratings yet
Confucius Essay
4 pages
Indian Civilization (Harappan-Indus Valley)
80% (5)
Indian Civilization (Harappan-Indus Valley)
23 pages
World Bank Presentation
No ratings yet
World Bank Presentation
22 pages
Voce Driven LCD
No ratings yet
Voce Driven LCD
13 pages
Hedge Fund Profile-Iskandia
No ratings yet
Hedge Fund Profile-Iskandia
2 pages
Naica Crystal Cave - You Have To See This
100% (20)
Naica Crystal Cave - You Have To See This
38 pages
Parle
100% (1)
Parle
14 pages
PM BC 19 Psalm 49.1-15 Philippians 2.1-11 Truly God and Truly Man
No ratings yet
PM BC 19 Psalm 49.1-15 Philippians 2.1-11 Truly God and Truly Man
7 pages
NHD Annotated Bibliography
No ratings yet
NHD Annotated Bibliography
7 pages
Hafeez Contractor - Contemporary Architecture
50% (2)
Hafeez Contractor - Contemporary Architecture
21 pages
Parts of A Church
No ratings yet
Parts of A Church
4 pages
Anchoring Script For An International Seminar
0% (1)
Anchoring Script For An International Seminar
6 pages
Metodología de La Ciencia Política - ANDUIZA Et Al
No ratings yet
Metodología de La Ciencia Política - ANDUIZA Et Al
12 pages
Supply Chain Management of KFC
40% (5)
Supply Chain Management of KFC
29 pages
Biography of Classical Composers
No ratings yet
Biography of Classical Composers
5 pages
Autumn Term - Child Psychology Assignment (Piaget & Vygotsky) - Elaine Yuen
No ratings yet
Autumn Term - Child Psychology Assignment (Piaget & Vygotsky) - Elaine Yuen
4 pages
European Union - Evolution & Geopolitical Impact - Project Report
No ratings yet
European Union - Evolution & Geopolitical Impact - Project Report
5 pages
Salazar
No ratings yet
Salazar
7 pages
Alzheimer's Disease
No ratings yet
Alzheimer's Disease
28 pages
Test 1
0% (1)
Test 1
24 pages
Saptamana 3 Partial
No ratings yet
Saptamana 3 Partial
2 pages
Witricity Presentation
100% (1)
Witricity Presentation
26 pages
Similarity and Dissimilarity
No ratings yet
Similarity and Dissimilarity
34 pages
New Microsoft Power Point Presentation
No ratings yet
New Microsoft Power Point Presentation
18 pages
Properties of Objective Measures
No ratings yet
Properties of Objective Measures
6 pages
Association Rule Mining
No ratings yet
Association Rule Mining
34 pages
Rule Based Classification
No ratings yet
Rule Based Classification
42 pages
Rule Based Classifier
No ratings yet
Rule Based Classifier
20 pages
Unit - 8 Security: Security Manager Class That Controls What Actions Code Can Perform
No ratings yet
Unit - 8 Security: Security Manager Class That Controls What Actions Code Can Perform
19 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
Maximum and Minimum Values: Partial Derivatives
No ratings yet
Maximum and Minimum Values: Partial Derivatives
40 pages
IS Lab Mannual
No ratings yet
IS Lab Mannual
26 pages
Design and Performance Analysis of Band Pass Iir Filter For Sonar Application
No ratings yet
Design and Performance Analysis of Band Pass Iir Filter For Sonar Application
5 pages
Table of Contents
No ratings yet
Table of Contents
5 pages
Recurrence Relations
No ratings yet
Recurrence Relations
58 pages
AmortizedAnalysisExplained Fiebrink
No ratings yet
AmortizedAnalysisExplained Fiebrink
16 pages
Conducting A Nonlinear Fit Analysis in MATLAB: Using The Function
No ratings yet
Conducting A Nonlinear Fit Analysis in MATLAB: Using The Function
2 pages
Stat Thermal Phys Python
No ratings yet
Stat Thermal Phys Python
435 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Machine Learning Q and AI: 30 Essential Questions and Answers On Machine Learning and AI 1 / Converted Edition Sebastian Raschka
100% (6)
Machine Learning Q and AI: 30 Essential Questions and Answers On Machine Learning and AI 1 / Converted Edition Sebastian Raschka
52 pages
NEWBACGROUP2FT
No ratings yet
NEWBACGROUP2FT
5 pages
Development of Empirical Models From Process Data: - An Attractive Alternative
No ratings yet
Development of Empirical Models From Process Data: - An Attractive Alternative
27 pages
Cross-Lingual Contextualized Topic Models With Zero-Shot Learning
No ratings yet
Cross-Lingual Contextualized Topic Models With Zero-Shot Learning
8 pages
ISS Assi 1-5 Ques
No ratings yet
ISS Assi 1-5 Ques
5 pages
Solutions For Additonal Problems in Homogeneous and Non Homogeneous Differential Euations
No ratings yet
Solutions For Additonal Problems in Homogeneous and Non Homogeneous Differential Euations
54 pages
Mulitimedia Computing: Online Lecture-6 Instructor-in-Charge Dr. Mukesh Kumar Rohil
No ratings yet
Mulitimedia Computing: Online Lecture-6 Instructor-in-Charge Dr. Mukesh Kumar Rohil
31 pages
Quiz Capitulo3
No ratings yet
Quiz Capitulo3
3 pages
Construct Tree From Given Inorder and Preorder Traversals
No ratings yet
Construct Tree From Given Inorder and Preorder Traversals
5 pages
Dynamic Simulation of A Three-Phase Induction Motor Using Matlab Simulink
100% (1)
Dynamic Simulation of A Three-Phase Induction Motor Using Matlab Simulink
5 pages
Session 2
No ratings yet
Session 2
58 pages
Example of Des Algorithm PDF
0% (1)
Example of Des Algorithm PDF
2 pages
8FM0-28 As Decision Mathematics 2 - Practice Paper 1
No ratings yet
8FM0-28 As Decision Mathematics 2 - Practice Paper 1
4 pages
Data Compression Seminar Report
67% (6)
Data Compression Seminar Report
34 pages
ADA mid 2 question bank solved
No ratings yet
ADA mid 2 question bank solved
104 pages
COS3751 Nov 2022 Exams
No ratings yet
COS3751 Nov 2022 Exams
8 pages
Ai - Unit I
No ratings yet
Ai - Unit I
38 pages

FP Growth Algorithm

Uploaded by

FP Growth Algorithm

Uploaded by

FP-Growth algorithm

• General idea (divide-and-conquer)

{} Conditional pattern bases

p:2 m:1 Considering ‘p’ as suffix ,its 2 corresponding

Step 2: Construct Conditional FP-tree

• For each pattern base

• Pattern growth property

Frequent Pattern Lecture 33/15-10-09 12

Item Conditional pattern base Conditional FP-tree

Single FP-tree Path Generation

• Transform a frequent k-itemset mining problem into

• An objective measure is computed based

f11 denotes the no. of

Confidence= P(Coffee|Tea) = 0.75

You might also like