0% found this document useful (0 votes)

30 views35 pages

Statistical Computing With R: Masters in Data Sciences 503 (S28) Third Batch, SMS, TU, 2024

The document discusses the application of association rule mining, particularly in the context of market basket analysis, using the R programming language. It explains key concepts such as the apriori algorithm, support, confidence, and lift, which are essential for identifying relationships between items in transactional data. The document also provides practical examples and R code snippets to illustrate how to implement these techniques for data analysis in retail settings.

Uploaded by

pra Bee In adhikari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views35 pages

Statistical Computing With R: Masters in Data Sciences 503 (S28) Third Batch, SMS, TU, 2024

Uploaded by

pra Bee In adhikari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Statistical Computing with R:

Masters in Data Sciences 503 (S28)

Third Batch, SMS, TU, 2024
Shital Bhandary
Associate Professor
Statistics/Bio-statistics, Demography and Public Health Informatics
Patan Academy of Health Sciences, Lalitpur, Nepal
Faculty, Data Analysis and Decision Modeling, MBA, Pokhara University, Nepal
Faculty, FAIMER Fellowship in Health Professions Education, India/USA.
Review Preview: Unsupervised models
• Association rules learning • Monte Carlo simulations
• Market-Basket analysis • Good old days!

• Class imbalance problem

• Statistical approach
• Data science approach
Association rules learning/mining:
https://towardsdatascience.com/association-rule-mining-in-r-ddf2d044ae50

• Association Rule Mining (also • A most common example that we

called as Association Rule Learning) encounter in our daily lives —
is a common technique used to Amazon knows what else you want
find associations (co-occurrence) to buy when you order something
between many variables. on their site.
• The same idea extends to Spotify
• It is often used by grocery stores, e- too — They know what song you
commerce websites, and anyone want to listen to next.
with large transactional databases. • All of these incorporate, at some
level, data mining concepts and
association rule mining algorithms.
Association rules: example problem
https://www.datacamp.com/community/tutorials/market-basket-analysis-r

• You get a client who runs a retail • Your client will use your findings
store and gives you data for all to not only change/update/add
transactions that consists of items in inventory but also use
items bought in the store by them to change the layout of
several customers over a period the physical store or rather an
of time. online store.
• To find results that will help your
• Your client then asks you to use client, you will use Market
that data to help boost their Basket Analysis (MBA) which
business. uses Association Rule Mining on
the given transaction data.
Use of association rules mining result:
https://www.datacamp.com/community/tutorials/market-basket-analysis-r

• Changing the store layout • Cross marketing on online stores

according to trends
• What are the trending items
• Customer behavior analysis customers buy

• Catalogue design • Customized emails with add-on

sales

• etc.
Association rule mining: If => Then analyis
https://www.datacamp.com/community/tutorials/market-basket-analysis-r

• Association Rule Mining is used • The applications of Association

when you want to find an Rule Mining are found in
association between different Marketing, Basket Data Analysis (or
objects in a set, find frequent Market Basket Analysis) in retailing,
patterns in a transaction database, clustering and classification.
relational databases or any other • It can tell you what items do
information repository. customers frequently buy together
by generating a set of rules
called Association Rules.
• In simple words, it gives you output
as rules in form if this then that.
What is apriori algorithm and rule?
http://r-statistics.co/Association-Mining-With-R.html
• Association mining is usually • A rule is a notation that
done on transactions data from represents which item/s is
a retail market or from an online frequently bought with what
e-commerce store. item/s.
• It has an LHS and an RHS part
• Since most transactions data is and can be represented as
large, the apriori algorithm follows:
makes it easier to find these itemsetA => itemsetB
patterns or rules quickly. • This means, the item/s on the
right were frequently purchased
along with items on the left.
How to measure the strength of a rule?
http://r-statistics.co/Association-Mining-With-R.html
• The apriori algorithm generates Lets consider the rule A => B in order to compute these
metrics.
the most relevant set of rules
from a given transaction data. Support=Number of transactions with both A and B/Total
number of transactions
• It also shows the support, =P(A∩B) = frequency(A,B)/N
confidence and lift of those Confidence=Number of transactions with both A and B/Tot
rules. al number of transactions with A
=P(A∩B)/P(A) = frequency(A,B)/frequency(A)
• These three measures can be
used to decide the relative ExpectedConfidence=Number of transactions with B/Total
number of transactions
strength of the rules. =P(B)=frequency(B)/N
• How are they computed?
Lift=Confidence/Expected Confidence
=P(A∩B)/P(A).P(B) = Support(A,B)/Support(A).Support(B)
Association rule: Support and confidence
• Association rules are given in the Computer=>Anti−virusSoftware
form as below: [Support=20%,confidence=60%]
A=>B[Support,Confidence]
Above rule says:
• The part before => is referred to as • 20% transaction show Anti-virus
if (Antecedent) and the part after software is bought with purchase
=> is referred to as then of a Computer (support)
(Consequent).

• Where A and B are sets of items in • 60% of customers who purchase

the transaction data. A and B are Anti-virus software is bought with
disjoint sets. purchase of a Computer
(confidence)
Lift:
• Lift is the factor by which, the • lift = 1: implies no association
co-occurence of A and B exceeds between items.
the expected probability of A
and B co-occuring, had they
been independent. • lift > 1: greater than 1 means
that item B is likely to be bought
if item A is bought,
• So, higher the lift, higher the
chance of A and B occurring
together. • lift < 1: less than 1 means that
item B is unlikely to be bought if
item A is bought.
Note:
• Frequent Itemsets: • Strong rules:
If a rule A=>B[Support, Confidence]
Item-sets whose support is greater satisfies min_sup and
or equal than minimum support min_confidence then it is a strong
threshold (min_sup). rule.

• min_sup is set on user choice. • Coverage:

Coverage (also called cover or LHS-
support) is the support of the left-
hand-side of the rule, i.e., supp(X).
It represents a measure of “to how
often the rule can be applied”.
Example:
https://www.datacamp.com/community/tutorials/market-basket-analysis-r
Calculate the following for {Bread => Milk}:
• Support for (Bread) • Support for (Bread)=4/5 =f(B)/N=0.8
• Support for (Milk) • Support for (Milk)=4/5=f(M)/N=0.8
• Support for (Break, Milk) • Support(B,M) = f(B,M)/N=3/5=0.6

• Confidence (Bread => Milk) • Confidence (Bread => Milk) =3/4=0.75

• ExpectedConfidence=P(M)=4/5=0.8
• ExpectedConfidence(Bread=>Milk)
• Lift (Bread => Milk)
• Lift (Bread => Milk) =Confidence/ExpectedConfidence=0.75/0.80
=0.9375
• Coverage(Break=>Milk) = support(lhs) OR
=support(A,B)/support(A).support(B)
=(0.6)/[(0.8).(0.8)] = 0.6/0.64 = 0.9375
Let’s do it in R!
# create a list of baskets > # create a list of baskets
market_basket <- > market_basket <-
list( + list(
c("bread", "milk"), + c("bread", "milk"),
c("bread", "diapers", "beer", "Eggs"), + c("bread", "diapers", "beer", "Eggs"),
c("milk", "diapers", "beer", "cola"), + c("milk", "diapers", "beer", "cola"),
c("bread", "milk", "diapers", "beer"), + c("bread", "milk", "diapers", "beer"),
c("bread", "milk", "diapers", "cola") + c("bread", "milk", "diapers", "cola")
) + )
>
# set transaction names (T1 to T5) > # set transaction names (T1 to T5)
names(market_basket) <- paste("T", c(1:5), sep > names(market_basket) <- paste("T", c(1:5),
= "") sep = "")
Let’s use “arules” package and get some
outputs:
• library(arules) #Transformation to transactions data
#Transformation trans <- as(market_basket, "transactions")
• trans <- as(market_basket, "transactions")
#Dimensions # dim(trans)
• dim(trans) • [1] 5 6 #5 transactions, 6 items
#Item labels
• itemLabels(trans) #Item labels
#Summary > itemLables(trans)
• summary(trans)
#Plot [1] "beer" "bread" "cola" "diapers"
• image(trans) "Eggs" "milk“
Let’s use “arules” package and get some
outputs:
transactions as itemMatrix in sparse format with
#Summary
5 rows (elements/itemsets/transactions) and
• summary(trans)
6 columns (items) and a density of 0.6 (non-zero cells)

most frequent items:

bread diapers milk beer cola (Other)
4 4 4 3 2 1

element (itemset/transaction) length distribution:

sizes
2 4 (Itemset)
1 4 (transactions)

Min. 1st Qu. Median Mean 3rd Qu. Max.

2.0 4.0 4.0 3.6 4.0 4.0
Let’s inspect the “trans”
• inspect(trans) • items transactionID
• [1] {bread, milk} T1
• [2] {beer, bread, diapers, Eggs} T2
• [3] {beer, cola, diapers, milk} T3
• [4] {beer, bread, diapers, milk} T4
• [5] {bread, cola, diapers, milk} T5
Plot of “trans”
#Plot
• image(trans)
Apriori algorithm: why?
• Frequent Itemset Generation is • For this APRIORI Algorithm is used
the most computationally to create new rules.
expensive step because it requires • Since Support and Confidence
a full database scan. measure how interesting the rule
is, we will use them to create rules.
• In above example, we have seen • New rule is set by the minimum
the example of only 5 transactions, support and minimum confidence
but in real-world transaction data thresholds.
for retail can exceed up to GB s • The closer to threshold the more
and TBs of data for which an the rule is of use to the client.
optimized algorithm is needed to
prune out Item-sets that will not • These thresholds set by client help
help in later steps. to compare the rule strength
according to your own or client's
will.
Apriori algorithm in “trans” with minimum
support of 0.3 and min. confidence of 0.5:
#Min Support 0.3, confidence as Apriori
0.5. • Parameter specification:
confidence minval smax arem aval originalSupport maxtime support minlen maxlen

rules <- apriori(trans, 0.5 0.1 1 none FALSE TRUE 5 0.3 1 10

parameter = list(supp=0.3, conf=0.5,

target ext
maxlen=10,
rules TRUE
target= "rules"))
Algorithmic control:
Note: maxlen = maximum length of
the transaction! We could have used filter tree heap memopt load sort verbose
maxlen = 4 here as we know it but 0.1 TRUE TRUE FALSE TRUE 2 TRUE
this will not be known in real-life!
Summary of the “rules”:
summary(rules) # mining info:
#summary of quality measures: • data ntransactions support confidence
support confidence coverage lift • trans 5 0.3 0.5
Min. :0.4000 Min. :0.5000 Min. :0.4000 Min. :0.8333
1st Qu.:0.4000 1st Qu.:0.6667 1st Qu.:0.6000 1st Qu.:0.8333
Median :0.4000 Median :0.7500 Median :0.6000 Median :1.0000
Mean :0.4938 Mean :0.7474 Mean :0.6813 Mean :1.0473
3rd Qu.:0.6000 3rd Qu.:0.8000 3rd Qu.:0.8000 3rd Qu.:1.2500
Max. :0.8000 Max. :1.0000 Max. :1.0000 Max. :1.6667
Inspection of the “rules” with minlen:
Inspect (rules) #Output from R:
lhs rhs support confidence coverage lift count
• [1] {} => {beer} 0.6 0.6000000 1.0 1.0000000 3
• [2] {} => {milk} 0.8 0.8000000 1.0 1.0000000 4
• [3] {} => {bread} 0.8 0.8000000 1.0 1.0000000 4
• [4] {} => {diapers} 0.8 0.8000000 1.0 1.0000000 4
• [5] {cola} => {milk} 0.4 1.0000000 0.4 1.2500000 2
• [6] {milk} => {cola} 0.4 0.5000000 0.8 1.2500000 2
• [7] {cola} => {diapers} 0.4 1.0000000 0.4 1.2500000 2
• [8] {diapers} => {cola} 0.4 0.5000000 0.8 1.2500000 2
• [9] {beer} => {milk} 0.4 0.6666667 0.6 0.8333333 2
• [10] {milk} => {beer} 0.4 0.5000000 0.8 0.8333333 2
• [11] {beer} => {bread} 0.4 0.6666667 0.6 0.8333333 2
• [12] {bread} => {beer} 0.4 0.5000000 0.8 0.8333333 2
• [13] {beer} => {diapers} 0.6 1.0000000 0.6 1.2500000 3
• [14] {diapers} => {beer} 0.6 0.7500000 0.8 1.2500000 3
• [15] {milk} => {bread} 0.6 0.7500000 0.8 0.9375000 3
• [16] {bread} => {milk} 0.6 0.7500000 0.8 0.9375000 3
• ….
• [32]
We can remove the “empty” rules
rules <- apriori(trans, • set of 28 rules
parameter = list(supp=0.3, • rule length distribution (lhs + rhs):
conf=0.5, sizes
maxlen=10, • 2 3
minlen=2, • 16 12
target= "rules")) •
lhs rhs support confidence coverage lift count
• [1] {cola} => {milk} 0.4 1.0000000 0.4 1.2500000 2
• [2] {milk} => {cola} 0.4 0.5000000 0.8 1.2500000 2
• [3] {cola} => {diapers} 0.4 1.0000000 0.4 1.2500000 2
• …
• [17] {cola, milk} => {diapers} 0.4 1.0000000 0.4 1.2500000 2
• [18] {cola, diapers} => {milk} 0.4 1.0000000 0.4 1.2500000 2
• [19] {diapers, milk} => {cola} 0.4 0.6666667 0.6 1.6666667 2
Let’s set RHS rule for “trans” data:
#For example, to analyze what items lhs rhs support confidence coverage lift count
customers buy before buying {beer}, • [1] {bread} => {beer} 0.4 0.5000000 0.8 0.8333333 2
• [2] {milk} => {beer} 0.4 0.5000000 0.8 0.8333333 2
#we set rhs=beer and default=lhs: • [3] {diapers} => {beer} 0.6 0.7500000 0.8 1.2500000 3
beer_rules_rhs <- apriori(trans, • [4] {bread, diapers} => {beer} 0.4 0.6666667 0.6 1.1111111 2

parameter = • [5] {diapers, milk} => {beer} 0.4 0.6666667 0.6 1.1111111 2
list(supp=0.3, conf=0.5,
maxlen=10,
minlen=2),
appearance = list(default="lhs",
rhs="beer"))
#Inspect
• inspect(beer_rules_rhs)
Let’s set LHS rule for “trans” data:
#For example, to analyze what items lhs rhs support confidence coverage lift count
customers buy before buying {beer}, [1] {beer} => {bread} 0.4 0.6666667 0.6 0.8333333 2
[2] {beer} => {milk} 0.4 0.6666667 0.6 0.8333333 2
#we set lhs=beer and default=rhs: [3] {beer} => {diapers} 0.6 1.0000000 0.6 1.2500000 3
beer_rules_lhs <- apriori(trans,
parameter =
list(supp=0.3, conf=0.5,
maxlen=10,
minlen=2),
appearance =
list(lhs="beer", default="rhs"))
#Inspect the result:
inspect(beer_rules_lhs)
Product recommendation rule:
#Product recommendation rule lhs rhs support confidence coverage lift n
• [1] {cola} => {milk} 0.4 1 0.4 1.25 2
• rules_conf <- sort (rules, • [2] {cola} => {diapers} 0.4 1 0.4 1.25 2
by="confidence", • [3] {beer} => {diapers} 0.6 1 0.6 1.25 3
decreasing=TRUE) • [4] {cola, milk} => {diapers} 0.4 1 0.4 1.25 2
• [5] {cola, diapers} => {milk} 0.4 1 0.4 1.25 2
• [6] {beer, milk} => {diapers} 0.4 1 0.4 1.25 2
#inspect the rule
# show the support, lift and
confidence for all rules
• inspect(head(rules_conf))
Plotting rules with “arulesViz” package:
• library(arulesViz)
• plot(rules)
Plotting rules with “arulesViz” package:
• plot(rules, measure =
"confidence")
Plotting rules with “arulesViz” package:
• plot(rules, method = "two-key
plot")
Interactive plot with “plotly” engine:
• #Interactive plot
• plot(rules, engine = "plotly")
Graph based visualization:
#Graph based visualization
subrules <- head(rules, n = 10, by
= "confidence")
plot(subrules, method = "graph",
engine = "htmlwidget")
Parallel coordinate plot for 10 rules:
#Paraller coordinate plot
• plot(subrules,
method="paracoord")
More here:
• Like the one we did before:

• https://www.kirenz.com/post/2020-05-14-r-association-rule-mining/

• Real life example:

• https://www.youtube.com/watch?v=91CmrpD-4Fw
Question/queries?
• Next class • Monte Carlo Simulations
• Class imbalance problem
• Statistical approach
• Data sciences approach
Thank you!
@shitalbhandary

DA Unit 4
100% (1)
DA Unit 4
125 pages
Data Mining-Module Ii Notes (S4 Bca)
No ratings yet
Data Mining-Module Ii Notes (S4 Bca)
40 pages
Chapter-6 (Association Analysis Basic Concepts and Algorithms)
No ratings yet
Chapter-6 (Association Analysis Basic Concepts and Algorithms)
75 pages
Unit 4 - DA - Frequent Itemsets and Associations
No ratings yet
Unit 4 - DA - Frequent Itemsets and Associations
31 pages
Market Basket Analysis Using Association Rules Unit 5
No ratings yet
Market Basket Analysis Using Association Rules Unit 5
21 pages
Session 8-Association Rules Mining
No ratings yet
Session 8-Association Rules Mining
75 pages
Association Rules
No ratings yet
Association Rules
29 pages
CH 5
No ratings yet
CH 5
53 pages
Association Rule Mining
No ratings yet
Association Rule Mining
97 pages
Association Rules
No ratings yet
Association Rules
39 pages
Unit 4 - Part 1
No ratings yet
Unit 4 - Part 1
152 pages
Association
No ratings yet
Association
54 pages
Module 3 Mining Frequent Patterns and Associations
No ratings yet
Module 3 Mining Frequent Patterns and Associations
37 pages
Untitled Document
No ratings yet
Untitled Document
59 pages
Unit 3 Mining Frequent Patterens
No ratings yet
Unit 3 Mining Frequent Patterens
30 pages
ICS 2408 - Lecture 5 - Association
No ratings yet
ICS 2408 - Lecture 5 - Association
44 pages
Data Mining and Predictive Modeling: Lecture 9: Association Rule Mining, Apriori Algorithm
No ratings yet
Data Mining and Predictive Modeling: Lecture 9: Association Rule Mining, Apriori Algorithm
24 pages
Association Rule Mining
No ratings yet
Association Rule Mining
24 pages
6 - Association Rules - For Students
No ratings yet
6 - Association Rules - For Students
39 pages
Lec 4
No ratings yet
Lec 4
22 pages
Data Mining Frequent Patterns
No ratings yet
Data Mining Frequent Patterns
22 pages
Association: Market Basket Analysis
No ratings yet
Association: Market Basket Analysis
40 pages
Lec 2
No ratings yet
Lec 2
18 pages
UNIT-4 DMCT Discovering Patterns and Rules
No ratings yet
UNIT-4 DMCT Discovering Patterns and Rules
18 pages
CH-4 Mining Association Rules
No ratings yet
CH-4 Mining Association Rules
35 pages
Market Basket Analysis
No ratings yet
Market Basket Analysis
20 pages
CA03CA3405Notes On Association Rule Mining and Apriori Algorithm
No ratings yet
CA03CA3405Notes On Association Rule Mining and Apriori Algorithm
41 pages
APRIARI Algorithm
No ratings yet
APRIARI Algorithm
55 pages
Unit 3 1
No ratings yet
Unit 3 1
34 pages
CSA 106 Market Basket Analysis
No ratings yet
CSA 106 Market Basket Analysis
13 pages
DSP Interview Questions and Topics
100% (7)
DSP Interview Questions and Topics
16 pages
Seminar 6
No ratings yet
Seminar 6
30 pages
Association Rule Mod 3
No ratings yet
Association Rule Mod 3
28 pages
Association Rule Mining
No ratings yet
Association Rule Mining
61 pages
Mining: Association Rules
No ratings yet
Mining: Association Rules
54 pages
2024 Market Basket Analysis
No ratings yet
2024 Market Basket Analysis
30 pages
Market Basket Analysis Case PDF
No ratings yet
Market Basket Analysis Case PDF
35 pages
Dataanalytics Unit-4
No ratings yet
Dataanalytics Unit-4
23 pages
Mining Frequent Patterns, Associations, and Correlations: Basic Concepts and Methods
No ratings yet
Mining Frequent Patterns, Associations, and Correlations: Basic Concepts and Methods
12 pages
Frequent Itemsets and Associations
No ratings yet
Frequent Itemsets and Associations
15 pages
Data Analytics Unit III
No ratings yet
Data Analytics Unit III
88 pages
Market Basket Analysis
No ratings yet
Market Basket Analysis
4 pages
Market Basket Analysis
No ratings yet
Market Basket Analysis
14 pages
M5 m6 KC
No ratings yet
M5 m6 KC
36 pages
Market Basket Analysis
No ratings yet
Market Basket Analysis
9 pages
Data Mining Mod 2
No ratings yet
Data Mining Mod 2
7 pages
DWM Unit 5 Mining Frequent Patterns and Cluster Analysis
100% (1)
DWM Unit 5 Mining Frequent Patterns and Cluster Analysis
15 pages
Market Basket Analysis: Rengarajan R (19049)
No ratings yet
Market Basket Analysis: Rengarajan R (19049)
12 pages
Data Mining Chapter 2: Market Basket Analysis
No ratings yet
Data Mining Chapter 2: Market Basket Analysis
4 pages
Unit 5 Mining Frequent Patterns and Cluster Analysis
No ratings yet
Unit 5 Mining Frequent Patterns and Cluster Analysis
63 pages
Spectral Decomposition
100% (1)
Spectral Decomposition
165 pages
Market Basket Analysis
No ratings yet
Market Basket Analysis
7 pages
Market Basket Analysis
No ratings yet
Market Basket Analysis
12 pages
Unit 4 - Data Mining - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Data Mining - WWW - Rgpvnotes.in
10 pages
Unit 4 - DA - Frequent Itemsets and Clustering-1 (Unit-5)
No ratings yet
Unit 4 - DA - Frequent Itemsets and Clustering-1 (Unit-5)
86 pages
Association Rules Explained
No ratings yet
Association Rules Explained
10 pages
Dsa Roadmap Python
No ratings yet
Dsa Roadmap Python
4 pages
Market Basket Analysis Using: R Tool
No ratings yet
Market Basket Analysis Using: R Tool
23 pages
Final Year Report Presentation Edited
No ratings yet
Final Year Report Presentation Edited
52 pages
01 Decision Analysis Presentation Group D
No ratings yet
01 Decision Analysis Presentation Group D
78 pages
Unit 3 Clustering Algorithm
No ratings yet
Unit 3 Clustering Algorithm
44 pages
Market Basket Analysis and Advanced Data Mining: Professor Amit Basu
No ratings yet
Market Basket Analysis and Advanced Data Mining: Professor Amit Basu
24 pages
Unit3 KNN Examples
No ratings yet
Unit3 KNN Examples
7 pages
Unit4 Clustering Evaluation
No ratings yet
Unit4 Clustering Evaluation
53 pages
Unit4 Clustering Algorithms
No ratings yet
Unit4 Clustering Algorithms
43 pages
Unit2 AssociationAnalysis V2
No ratings yet
Unit2 AssociationAnalysis V2
46 pages
Unit4 Clustering
No ratings yet
Unit4 Clustering
46 pages
Statistical Computing With R: Masters in Data Science 503 (S15) Third Batch, SMS, TU, 2024
No ratings yet
Statistical Computing With R: Masters in Data Science 503 (S15) Third Batch, SMS, TU, 2024
40 pages
Microsoft PPT On Bilateral Filtering
100% (1)
Microsoft PPT On Bilateral Filtering
30 pages
Unit1 Introduction
No ratings yet
Unit1 Introduction
38 pages
Unit3 SVM
No ratings yet
Unit3 SVM
20 pages
0.10.numerical Integration Trapezium Rule
No ratings yet
0.10.numerical Integration Trapezium Rule
16 pages
02 Decision Making Under Uncertainty and Risk
No ratings yet
02 Decision Making Under Uncertainty and Risk
12 pages
Adaptive Filter Analysis For System Identification Using Various Adaptive Algorithms
No ratings yet
Adaptive Filter Analysis For System Identification Using Various Adaptive Algorithms
7 pages
Image Classification Using CNN: Page - 1
No ratings yet
Image Classification Using CNN: Page - 1
13 pages
Unit4 HAC Example
No ratings yet
Unit4 HAC Example
7 pages
Hyper Parameter Tuning
No ratings yet
Hyper Parameter Tuning
4 pages
CS-419: Applied Image Processing: Dr. Muhammad Hanif
No ratings yet
CS-419: Applied Image Processing: Dr. Muhammad Hanif
69 pages
DSP Integrated Circuits 4
No ratings yet
DSP Integrated Circuits 4
3 pages
CS 331: Artificial Intelligence: Informed Search
No ratings yet
CS 331: Artificial Intelligence: Informed Search
64 pages
Linear Programming Problems
No ratings yet
Linear Programming Problems
11 pages
SEHH1008 Chapter 11 Linear Programming - Sensitivity Analysis
No ratings yet
SEHH1008 Chapter 11 Linear Programming - Sensitivity Analysis
23 pages
Error Detection Methods: by DR C Navaneethan / Site
No ratings yet
Error Detection Methods: by DR C Navaneethan / Site
30 pages
CS 473: Algorithms: Chandra Chekuri Chekuri@cs - Illinois.edu 3228 Siebel Center
No ratings yet
CS 473: Algorithms: Chandra Chekuri Chekuri@cs - Illinois.edu 3228 Siebel Center
98 pages
Intro To PETSc
No ratings yet
Intro To PETSc
111 pages
Ip Unit-3
No ratings yet
Ip Unit-3
21 pages
8a - Linear Programming Simplex Method
No ratings yet
8a - Linear Programming Simplex Method
9 pages
NELDER - MEAD - The Nelder-Mead Optimization Algorithm
No ratings yet
NELDER - MEAD - The Nelder-Mead Optimization Algorithm
5 pages
Wireless Communications by Theodore S Ra
No ratings yet
Wireless Communications by Theodore S Ra
31 pages
Analysis of Algorithm
No ratings yet
Analysis of Algorithm
12 pages
BFS and DFS
No ratings yet
BFS and DFS
8 pages
DSA - W2022 (3134201) (GTURanker - Com)
No ratings yet
DSA - W2022 (3134201) (GTURanker - Com)
1 page
EENG385 Spring 2014-2015 Final
No ratings yet
EENG385 Spring 2014-2015 Final
12 pages
Causality: The Impulse Response H (N) of An Ideal Low Pass Filter With Frequency Response
No ratings yet
Causality: The Impulse Response H (N) of An Ideal Low Pass Filter With Frequency Response
3 pages
CMPT Final Handcoding
No ratings yet
CMPT Final Handcoding
5 pages
Cellular Neural Networks
No ratings yet
Cellular Neural Networks
8 pages
Digital Interpolation Beamforming For Low-Pass and Bandpass Signals
No ratings yet
Digital Interpolation Beamforming For Low-Pass and Bandpass Signals
16 pages
20an Algorith
No ratings yet
20an Algorith
2 pages
Data Structures 2
No ratings yet
Data Structures 2
1 page
Outmarket the Competition: Advanced Marketing Tactics to Drive Growth and Profitability
From Everand
Outmarket the Competition: Advanced Marketing Tactics to Drive Growth and Profitability
Nick Doyle
No ratings yet
HBase High Performance Cookbook
From Everand
HBase High Performance Cookbook
Ruchir Choudhry
No ratings yet
Excel Data Analysis For Dummies
From Everand
Excel Data Analysis For Dummies
Paul McFedries
No ratings yet
How to Optimise Your Supply Chain to Make Your Firm Competitive!
From Everand
How to Optimise Your Supply Chain to Make Your Firm Competitive!
Andrei Besedin
2/5 (2)

Statistical Computing With R: Masters in Data Sciences 503 (S28) Third Batch, SMS, TU, 2024

Uploaded by

Statistical Computing With R: Masters in Data Sciences 503 (S28) Third Batch, SMS, TU, 2024

Uploaded by

Statistical Computing with R:

Masters in Data Sciences 503 (S28)

• Class imbalance problem

• Association Rule Mining (also • A most common example that we

• Changing the store layout • Cross marketing on online stores

• Catalogue design • Customized emails with add-on

• Association Rule Mining is used • The applications of Association

• Where A and B are sets of items in • 60% of customers who purchase

• min_sup is set on user choice. • Coverage:

• Confidence (Bread => Milk) • Confidence (Bread => Milk) =3/4=0.75

most frequent items:

element (itemset/transaction) length distribution:

Min. 1st Qu. Median Mean 3rd Qu. Max.

rules <- apriori(trans, 0.5 0.1 1 none FALSE TRUE 5 0.3 1 10

parameter = list(supp=0.3, conf=0.5,

• Real life example:

You might also like