Frequent Item-Set Mining Methods: Prepared By-Mr - Nilesh Magar
Frequent Item-Set Mining Methods: Prepared By-Mr - Nilesh Magar
Methods
Prior knowledge
Join
finding Lk, a set of candidate k-itemsets is generated
by joining Lk-1 with itself
Prune
To reduce the size of Ck the Apriori property is used:
if any (k-1) subset of a candidate k-itemset is not in
Lk-1, then the candidate cannot be frequent either,so
it can be removed from Ck. – subset testing.
Adv: Dis-Adv:
1) Assumes
1) Uses Large item-set
transaction
Property database is
2) Easily parallelized memory resident
Requires up to ‘m’
3) Easy to implement
database scan.
T1 f,c,a,m,p
T2 f,c,a,b,m
T3 f,b
T4 c,b,p
T5 f,c,a,p,m
Header Table {}
m:2 b:1
p:2 m:1
Prepared By- Mr.Nilesh Magar
FP-Growth Example
c {(f:3)} {(f:3)}|c
f Empty Empty
Output: FP-Tree
Adv:
1) Only 2 Passes Over Data-set
2) No Candidate Generation
3) Much Faster Than Apriori
DisAdv:
• FP-Tree may not fit in memory.
• FP-Tree is expensive to build