HTCB Unit 3
HTCB Unit 3
Description:
Association rule mining is a technique used in data mining to find relationships between
variables in large databases. It identifies if-then rules that show how the occurrence of one
item is related to the occurrence of another item.
Application:
Retail: To find patterns like if a customer buys bread, they are likely to buy butter.
Example:
Confidence: 50% of the transactions that include diapers also include beer.
2. Mining Methods
Description:
Different algorithms and techniques are used to discover association rules, like Apriori, FP-
Growth, and Eclat.
Application:
Apriori: It starts with single items and extends them step by step.
Description:
These rules are discovered at multiple levels of abstraction, such as finding patterns at
category levels like electronics -> computers -> laptops.
Application:
Example:
Rule: If a customer buys electronics, they are likely to buy a computer, and if they buy a
computer, they might buy a laptop.
Diagram:
A tree structure where the root is a broad category (electronics) and branches to
subcategories (computers, laptops).
Description:
Association rules that involve more than one dimension or attribute, such as time, location,
or customer demographic.
Application:
Market Basket Analysis: To find patterns like customers buying certain products in certain
seasons or regions.
Example:
Rule: If a customer from New York buys a coat in December, they are likely to buy gloves.
Description:
Finding patterns where item sets have a correlation, not just co-occurrence.
Application:
Example:
Description:
Mining association rules with certain constraints to make the process efficient and
relevant, such as constraints on item sets’ size, support, or confidence.
Application:
Inventory Management: To find association rules that only involve items with high sales.
Example:
Description:
Finding association rules in databases and data warehouses that have multiple
dimensions and large volumes of data.
Application:
Business Intelligence: To analyze sales data across different dimensions like time,
geography, and product lines.
Example:
Rule: Sales of product X in the eastern region increase in the first quarter.
Diagram:
A cube representing a data warehouse with dimensions like time, location, and product.
8. Correlation Analysis
Description:
Analyzing how closely two variables are related, not just their co-occurrence.
Application:
Example:
Rule: There is a high correlation between the stock price of company A and the overall
market index.
9. Constraint-Based Association Mining (repeated, covered earlier)
Description:
Using specific criteria (constraints) to focus on the most relevant patterns in data mining.
Types of Constraints:
2. Rule Constraints: Define conditions for the rules (e.g., must include certain items).
1. Retail:
- Rule: If a customer buys gift wrap, they are likely to buy a greeting card (support > 5%,
confidence > 70%).
2. Healthcare:
- Rule: If a patient reports fatigue, they likely have anemia (confidence > 80%).
3. Finance:
- Rule: If a transaction happens at an ATM, another is likely within an hour (support > 3%,
confidence > 60%).
Application:
Example:
Diagram:
Description:
Mining not only traditional association rules but also negative association rules (items that
do not co-occur), temporal rules (time-based), etc.
Application:
Example: