0% found this document useful (0 votes)
4 views6 pages

Student Name: Srijan Dawn University Roll No: 11700222095 University Registration No: 221170110353 3 Year, 6 Semester Program Name

The report discusses the Apriori algorithm, a key technique in data mining for identifying frequent itemsets and generating association rules. It outlines the algorithm's working principle, including setting support and confidence thresholds, generating itemsets, and creating association rules, along with a practical example. The applications of the algorithm span various fields such as market basket analysis, fraud detection, and recommendation systems.

Uploaded by

Srijan Dawn
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views6 pages

Student Name: Srijan Dawn University Roll No: 11700222095 University Registration No: 221170110353 3 Year, 6 Semester Program Name

The report discusses the Apriori algorithm, a key technique in data mining for identifying frequent itemsets and generating association rules. It outlines the algorithm's working principle, including setting support and confidence thresholds, generating itemsets, and creating association rules, along with a practical example. The applications of the algorithm span various fields such as market basket analysis, fraud detection, and recommendation systems.

Uploaded by

Srijan Dawn
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Problem Statement for Content/ Exercise/ Assignment:

Write a Report on Working Principle of Apriori Algorithm.

Topic Genre: Apriori Algorithm


Module Name /No. : 3
Paper Name: Data Warehousing & Data Mining
Paper Code: PECIT – 602B

Student Name: Srijan Dawn


University Roll No: 11700222095
University Registration No: 221170110353

3rd Year, 6th Semester


Program Name: B. Tech. (IT)

Continuous Assessment – 2 (CA2)

Department of Information Technology

RCC Institute of Information Technology

Canal South Road, Beliaghata, Kolkata

Affiliated to Maulana Abul Kalam Azad University of Technology, W.B.


1. Abstract :
The Apriori algorithm is a fundamental technique
in data mining used for association rule learning.
It helps identify frequent itemsets in large datasets
and extract meaningful relationships between
them. This report explores the working principle
of the Apriori algorithm, provides a step-by-step
example, discusses its applications in real-world
scenarios, and highlights its significance in pattern
recognition and market basket analysis.

2. Introduction :
Data mining is a crucial field in computer science
that enables organizations to discover hidden
patterns in large datasets. The Apriori algorithm,
introduced by Rakesh Agrawal and Ramakrishnan
Srikant in 1994, is one of the most widely used
algorithms for discovering association rules. It
operates on the principle of identifying frequent
itemsets in a transactional database and generating
strong association rules. This report provides an
in-depth exploration of the Apriori algorithm’s
working mechanism and its applications.
3. Working Principle of Apriori
Algorithm :
The Apriori algorithm is based on the Apriori
property, which states that "if an itemset is
frequent, then all of its subsets must also be
frequent." The algorithm follows these steps:
i. Set Minimum Support and Confidence:
Define minimum support and confidence
thresholds to filter out less significant rules.
ii. Generate Frequent Itemsets:
a. Scan the dataset to determine the
frequency of individual items.
b. Generate candidate itemsets by combining
frequent items from the previous step.
c. Prune candidate itemsets using the
Apriori property.
d. Repeat until no more frequent itemsets are
found.
iii. Generate Association Rules:
a. Use frequent itemsets to generate
association rules.
b. Compute confidence for each rule.
c. Retain rules that meet the confidence
threshold.
4. Example :
Consider a transaction dataset –
Transaction ID Items Purchased
T1 Milk, Bread, Butter
T2 Bread, Butter
T3 Milk, Bread
T4 Milk, Butter
T5 Milk, Bread, Butter
Step 1: Generate Frequent 1-itemsets
 {Milk}: 4 occurrences
 {Bread}: 4 occurrences
 {Butter}: 3 occurrences
Step 2: Generate Frequent 2-itemsets
 {Milk, Bread}: 3 occurrences
 {Milk, Butter}: 3 occurrences
 {Bread, Butter}: 3 occurrences
Step 3: Generate Frequent 3-itemsets
 {Milk, Bread, Butter}: 2 occurrences
Step 4: Generate Association Rules
 Rule: {Milk} → {Bread} (Support = 3/5,
Confidence = 3/4)
 Rule: {Bread, Butter} → {Milk} (Support = 2/5,
Confidence = 2/3)
Only rules meeting the minimum support and
confidence thresholds are retained.

5. Application :
a) Market Basket Analysis: Identifying
frequently bought items together to improve
sales strategies.
b) Fraud Detection: Discovering unusual
patterns in financial transactions.
c) Medical Diagnosis: Finding associations
between symptoms and diseases.
d) Recommendation Systems: Providing
personalized suggestions based on user
preferences.
e) Web Usage Mining: Analyzing browsing
behavior to enhance user experience.

6. Conclusion :
The Apriori algorithm is a powerful technique in
data mining for discovering frequent itemsets and
association rules. By leveraging the Apriori
property, it efficiently reduces computational
complexity and improves pattern recognition. Its
applications extend across various domains,
including retail, healthcare, and finance, making it
an indispensable tool for extracting valuable
insights from large datasets.

7. References:
Han, J., Kamber, M., & Pei, J. (2011). Data Mining:
Concepts and Techniques. Elsevier.
https://www.geeksforgeeks.org/apriori-algorithm/

https://en.wikipedia.org/wiki/Apriori_algorithm

You might also like