1 Lab Program 4 2 Vinay Sirohi 3 2139472: December 1, 2021
1 Lab Program 4 2 Vinay Sirohi 3 2139472: December 1, 2021
1 Lab Program 4 2 Vinay Sirohi 3 2139472: December 1, 2021
December 1, 2021
1 Lab Program 4
2 Vinay Sirohi
3 2139472
3.1 Generate FP Tree for a transaction dataset
[54]: #Importing mlxtend and printing the current version of it.
import mlxtend
print(mlxtend.__version__)
0.19.0
1
10 1340 24-02-2015 citrus fruit
11 2193 14-04-2015 beef
12 1997 21-07-2015 frankfurter
13 4546 03-09-2015 chicken
14 4736 21-07-2015 butter
15 1959 30-03-2015 fruit/vegetable juice
16 1974 03-05-2015 packaged fruit/vegetables
17 2421 02-09-2015 chocolate
18 1513 03-08-2015 specialty bar
19 1905 07-07-2015 other vegetables
df
2
itemDescription
0 tropical fruit,rolls/buns,candy,tropical fruit…
1 whole milk,tropical fruit,chocolate,whole milk…
2 pip fruit,other vegetables,flour,pip fruit,oth…
3 other vegetables,onions,shopping bags,other ve…
4 whole milk,other vegetables,white bread,whole …
… …
38760 whole milk,yogurt,sliced cheese,whole milk,yog…
38761 cat food,yogurt,candy,cat food,yogurt,candy,ca…
38762 sausage,whole milk,cake bar,sausage,whole milk…
38763 beef,canned beer,fruit/vegetable juice,beef,ca…
38764 ham,seasonal products,cat food,ham,seasonal pr…
[34]: #Let's now create a list of the transactions so that we can transform our data␣
,→into the correct format using TransactionEncoder.
df1=[]
for i in range(0,len(df)-1):
data = df['itemDescription'][i].split(',')
df1.append(data)
3
4 False False False False False False
… … … … … … …
38759 False False False False True False
38760 False False False False False False
38761 False False False False False False
38762 False False False False False False
38763 False False False False True False
4
7 0.023140 (onions)
8 0.028841 (white bread)
9 0.060649 (citrus fruit)
10 0.040140 (fruit/vegetable juice)
11 0.069497 (bottled water)
12 0.048344 (whipped/sour cream)
13 0.046048 (newspapers)
14 0.112269 (soda)
15 0.060623 (pastry)
16 0.101434 (yogurt)
17 0.054742 (bottled beer)
18 0.039624 (beef)
19 0.055954 (canned beer)
20 0.047983 (frankfurter)
21 0.044165 (brown bread)
22 0.033072 (chicken)
23 0.022315 (waffles)
24 0.041895 (butter)
25 0.024533 (UHT-milk)
26 0.025384 (hamburger meat)
27 0.020638 (frozen meals)
28 0.038464 (margarine)
29 0.026287 (napkins)
30 0.040914 (curd)
31 0.021128 (long life bakery product)
32 0.020586 (butter milk)
33 0.036658 (coffee)
34 0.044139 (domestic eggs)
35 0.074580 (sausage)
36 0.022289 (salty snack)
37 0.079404 (root vegetables)
38 0.043778 (pork)
39 0.033201 (frozen vegetables)
40 0.026029 (dessert)
41 0.027035 (cream cheese )
42 0.024739 (berries)
43 0.021128 (sugar)
44 0.022237 (whole milk, rolls/buns)
45 0.022340 (other vegetables, whole milk)
5
3 (rolls/buns) (whole milk) 0.127954