Answer To Assignment 3
Answer To Assignment 3
Question 1(50 points): Given the following transactions and minimum support - 50%
and minimum confidence - 80% large item sets, sequential patterns, rules, lifts,
recommend some management decisions
TID Brand_Item_bought
100 King’s-Crab, Sunset-Milk, Dairyland-Cheese, Best-Bread
200 Best-Cheese, Dairyland-Milk, Goldenfarm-Apple, Tasty-Pie, Wonder-Bread
300 Westcoast-Apple, Dairyland-Milk, Wonder-Bread, Tasty-Pie
400 Wonder-Bread, Sunset-Milk, Dairyland-Cheese
a) At the granularity of item without brand (e.g., “milk” and “bread”), please identify all
large itemsets using the Apriori algorithm. Be sure to include all steps in Apriori, i.e.,
Large (k-1)-itemset Candidate k-itemset (Join, Prune) Large k-itemset.
{Bread, Cheese, Pie} is pruned because its subset {Cheese, Pie} is not large 2-
itemset.
c) Please list all association rules (i.e., association rules that meet minimum support and
minimum confidence requirements) derived from the itemsets you derived in b) and
their supports, confidences and lifts.
The store can put the Tasty-Pie and Wonder-Bread near the Dairyland-Milk to further
encourage the customer to buy them together.
Question 2 (25 points): Let the minimum support be 60% when you derive large
sequences from the following transaction database.
a) Please identify all large sequencies using the Apriori algorithm. Be sure to include all
steps in Apriori, i.e., Large (k-1)-sequences Candidate k-sequencies (Join, Prune)
Large k-sequences.