Association Rule Mining
Association Rule Mining
{Diaper} → {Beer},
{Milk, Bread} → {Eggs, Coke},
{Beer, Bread} → {Milk},
Association Rule Mining (ARM)
• Itemset
• A collection of one or more items, eg; {Milk, Bread, Diaper}
• k-itemset - An itemset that contains k items.
• Support count (σ)
• Frequency of occurrence of an itemset
• E.g. σ({Milk, Bread,Diaper}) = 2
• Support
• Fraction of transactions that contain an itemset
• E.g. s({Milk, Bread, Diaper}) = 2/5
• Frequent Itemset
• An itemset whose support is greater than or equal to a minsup threshold
Association Rule Mining (ARM)
• Rule Evaluation Metrics
• Support (s)
• Fraction of transactions that contain both X and Y
• The support of an itemset I is defined as the fraction of the
transactions in the database T = {T . . . T } that contain I as a subset.
1 n
• Confidence (c)
• Measures how often items in Y appear in transactions that contain X.
• The confidence conf(X ∪ Y ) of the rule X ∪ Y is the conditional
probability of X ∪ Y occurring in a transaction, given that the
transaction contains X. Therefore, the confidence conf(X ⇒ Y ) is
defined as follows –