0% found this document useful (0 votes)
23 views8 pages

Dmbi Lab 7om

Uploaded by

Shubham Jha
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
23 views8 pages

Dmbi Lab 7om

Uploaded by

Shubham Jha
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

DMBI LAb 7

Name: Om Gaikwad

Class : D15A 16

Aim: To implement the Association Mining algorithm using tools like


Weka/RapidMiner.

THEORY:
Association mining, also known as association rule mining, is a data mining technique
used to discover interesting relationships, associations, or patterns within large
datasets. The primary goal is to uncover hidden correlations or dependencies between
different variables or items.

Association mining is commonly applied in various domains, including retail,


e-commerce, marketing, healthcare, and more. One of the classic examples is
market basket analysis, where the goal is to identify which items are frequently
purchased together by customers.

The process of association mining typically involves the following steps:

● Data Preparation: The dataset is preprocessed to ensure that it is in a


suitable format for analysis. This may involve tasks such as data
cleaning, filtering, and transformation.

● Frequent Itemset Generation: This step involves identifying sets of


items that frequently co-occur together in the dataset. The support
metric is often used to determine the frequency of itemsets. Frequent
itemsets are those whose support exceeds a specified threshold.

● Association Rule Generation: Once frequent itemsets are identified,


association rules are generated from them. An association rule typically
has the form A ➔ B, where A and B are sets of items. These rules
represent relationships between different sets of items in the dataset.

● Rule Evaluation: Generated rules are evaluated based on various metrics


such as support, confidence, lift, and conviction. These metrics help
assess the strength and significance of the discovered associations.
● Rule Pruning and Selection: Rules may be pruned based on predefined
criteria or selected based on their interestingness measures. This helps to
focus on the most relevant and actionable associations.
● Interpretation and Visualization: Finally, the discovered association rules
are interpreted and visualized to gain insights into the underlying
patterns or relationships in the data. Visualization techniques such as
scatter plots, heatmaps, and network graphs can be used to explore and
communicate the findings.

The Apriori algorithm is a classical algorithm in data mining and association rule
learning. It's used for discovering frequent itemsets within transaction databases, and
from those itemsets, association rules that meet specified criteria are derived. The
algorithm is based on the concept of "Apriori property," which states that if an itemset
is frequent, then all of its subsets must also be frequent.

​ Find frequent itemsets:

● Scan the database to find the support (frequency) of each item.

● Generate frequent itemsets (sets of items that appear together frequently)


by iteratively joining smaller frequent itemsets until no new frequent
itemsets can be found.

​ Generate association rules:

● From frequent itemsets, generate association rules by


partitioning itemsets into antecedents and consequents.

● Keep rules that meet a minimum confidence threshold (a measure of


how often the rule is correct).

​ Optimize and evaluate:

● Use optimizations like pruning to improve efficiency.

● Evaluate rules based on additional measures like lift or conviction.


Import Dataset -
CONCLUSION:
In RapidMiner, when no association is obtained using the Apriori method, it suggests that
there are no significant associations or frequent itemsets present in the dataset according
to the specified parameters and constraints. Association mining methods like Apriori aim
to uncover patterns or relationships among items in transactional data. However, the
absence of associations does not necessarily indicate a lack of interesting insights in the
dataset. It could be due to various reasons such as the dataset being sparse, the minimum
support or confidence thresholds being too high, or the data not containing meaningful
associations. It's essential to interpret the results in the context of the dataset and the
specific goals of the analysis. In cases where no associations are
found, alternative data mining techniques or further exploration of the data may be
warranted to extract valuable insights.

You might also like