Association Rule Mining: Data Mining and Knowledge Discovery Prof. Carolina Ruiz and Weiyang Lin
Association Rule Mining: Data Mining and Knowledge Discovery Prof. Carolina Ruiz and Weiyang Lin
Association Rule Mining: Data Mining and Knowledge Discovery Prof. Carolina Ruiz and Weiyang Lin
Problem Statement
Given:
a set of transactions (each transaction is a set of items)
user-specified minimum support
user-specified minimum confidence
Find:
all association rules that have support and confidence
greater than or equal to the user-specified minimum
support and minimum confidence
Naïve Procedure to mine rules
List all the subsets of the Complexity: Let n be the number
set of items of items. The number of rules
For each subset naively considered is:
Split the subset into two n i-1
n i
parts (one for the
antecedent and one for the S[(i ) * S ( )] k
consequent of the rule i=2 k=1
Compute the support of the
rule n
n i
Compute the confidence of
the rule = S[(i ) * (2 -2) ]
IF support and confidence i=2
are no lower than user-
specified min. support and
confident THEN output the =3 –2 n (n+1)
+1
rule
The Apriori Algorithm