Mining Frequent Patterns and Associations
Mining Frequent Patterns and Associations
This is a technique that gives the careful study of purchases done by a customer in a
supermarket. This concept identifies the pattern of frequent purchase items by customers.
This analysis can help to promote deals, offers, sale by the companies, and data mining
techniques helps to achieve this analysis task. Example:
● Data mining concepts are in use for Sales and marketing to provide better
response rates.
● Risk Assessment and Fraud area also use the data-mining concept for
Market basket analysis mainly works with the ASSOCIATION RULE {IF} -> {THEN}.
the antecedent.
There are three types of Market Basket Analysis. They are as follow:
1. Descriptive market basket analysis: This sort of analysis looks for patterns and
connections in the data that exist between the components of a market basket.
what products are purchased in combination and what the most typical item
2. Predictive Market Basket Analysis: Market basket analysis that predicts future
basket analysis. Large volumes of data are analyzed using machine learning
products are most likely to be bought together in the future. Retailers may make
data-driven decisions about which products to carry, how to price them, and
how to optimize shop layouts with the use of predictive market basket research.
analyses two sets of market basket data to identify variations between them.
over time is a common usage for this kind of study. Retailers can respond to
shifting consumer behavior by modifying their marketing and sales tactics with
customer behavior, including what products they buy together and which
products they buy the most frequently. Retailers can use this information to
can determine which products are sluggish sellers and which ones are
well-informed choices about what products to stock and how to manage their
product prices and consumer behavior might help merchants develop better
pricing strategies. Using this knowledge, pricing plans that boost sales and
which products are most frequently bought together and where they should be
positioned in the store to grow sales. Retailers may boost revenue and enhance
positioning.
1. Retail: Market basket research is frequently used in the retail sector to examine
inventory management, and pricing tactics. Retailers can utilize market basket
research to identify which items are sluggish sellers and which ones are
strategy accordingly.
understand the customer buying habits and make data-driven decisions about
problem areas.
3. Finance: Market basket analysis can be used to evaluate investor behaviour and
forecast the types of investment items that investors will likely buy in the future.
business might employ market basket analysis. The usage of this data can
decisions about which products to produce and which materials to employ in the
production process, the manufacturing sector might use market basket analysis.
1.2 Frequent Item sets, Closed Item sets, and Association Rule:
Frequent itemset mining is a market basket analysis methodology that helps to find
patterns in the shopping behaviors of the users across different shopping platforms. These
relationships are represented in the form of association rules. Frequent element set or
pattern mining is widely used due to its wide applications in pattern mining, correlations, and
constraints that are based on frequent patterns, sequential patterns, and many other data
mining tasks. Talking specifically, this technique is used to find sets of products that are
frequently bought together.
Closed Item set mining is a frequent itemset that is both closed and its support is greater
An itemset is closed in a data set if there exists no superset that has the same support count
as this original itemset.
Association Rules
Applications
Association rules help to predict the occurrence of one item based on the occurrences of
other items in a set of transactions.
Examples
Finding recurrent patterns or item sets in huge datasets is the goal of frequent
pattern mining, a crucial data mining approach. It looks for groups of objects
that regularly appear together in order to expose underlying relationships and
interdependence. Market basket analysis, web usage mining, and bioinformatics
are a few areas where this method is important.
Apriori Algorithm
One of the most popular methods, the Apriori algorithm, uses a step−by−step
procedure to find frequent item sets. It starts by creating candidate itemsets of
length 1, determining their support, and eliminating any that fall below the
predetermined cutoff. The method then joins the frequent itemsets from the
previous phase to produce bigger itemsets repeatedly.
Once no more common item sets can be located, the procedure is repeated. The
Apriori approach is commonly used because of its efficiency and simplicity, but
because it requires numerous database scans for big datasets, it can be
computationally inefficient.
FP−growth Algorithm
Eclat Algorithm
Equivalence Class Clustering and bottom−up Lattice Traversal are the acronyms
for the Eclat algorithm, a well−liked frequent pattern mining method. It explores
the itemset lattice using a depth−first search approach, concentrating on the
representation of vertical data formats.
Web usage mining is examining user navigation patterns to learn more about
how people use websites. In order to personalize websites and enhance their
performance, frequent pattern mining makes it possible to identify recurrent
navigation patterns and session patterns. Businesses can change content,
layout, and navigation to improve user experience and boost engagement by
studying how consumers interact with a website.
Bioinformatics
going through all possible combinations of frequent item sets and pruning the
○ S --> (I - S )
● At the point when a uniform least help edge is utilised, the pursuit system is
rearranged.
● A similar least help edge is utilised when mining at each degree of deliberation.
(for example for mining from “PC” down to “PC”). Both “PC” and “PC” are
At the point when a uniform least help edge is used, the search methodology is
simplified. The technique is likewise basic in that clients are needed to determine
search keeps away from analyzing item sets containing anything that doesn’t
have minimum support. The uniform support approach however has some
threshold is set too high it could miss several meaningful associations occurring
at low abstraction levels. This provides the motivation for the following
approach.
2. Reduce Support –
For mining various level relationship with diminished support, there are various
● Level-by-Level independence –
A thing at the I level is inspected if and just if its parent hub at the
from a more broad one. If a hub is frequent, its kids will be examined;
● Level-by-Level independence –
A thing at the 1st level is inspected if and just if its parent hub at the
relationship from a more broad one. If a hub is frequent, its kids will
3. Group-based support –
The group-wise threshold value for support and confidence is input by the user
or expert. The group is selected based on a product price or item set because
often expert has insight as to which groups are more important than others.
Example –
For e.g. Experts are interested in purchase patterns of laptops or clothes in the
non and electronic category. Therefore low support threshold is set for this
should be continuous.
Example –
Note –
Information blocks are appropriate for mining since they make mining quicker. The
Example –:
age(X, "20..25") Λ income(X, "30K..41K")buys ( X, "Laptop
Computer")
● Get affiliation rules via looking for gatherings of groups that happen
together.