DWDM Unit 3
DWDM Unit 3
Mining
5. Decision Support: The insights gained from data mining help in making
strategic business decisions.
Example:
E-commerce companies like Amazon analyze millions of customer
transactions daily to identify purchasing patterns and recommend products.
Example:
In the healthcare industry, data mining is used to identify risk factors for
diseases by analyzing patient history, genetic data, and environmental
Example:
Banks use data mining to analyze customer transactions and detect
fraudulent activities in real time.
Example:
Netflix uses data mining to analyze user viewing patterns and recommend
personalized content to its subscribers.
Example:
Example:
Retailers use data mining to predict seasonal sales trends and adjust stock
levels accordingly.
Example:
Example:
Social media platforms like Facebook and Instagram use data mining to
analyze user behavior and optimize ad placements, generating more
revenue.
Example:
Example:
KDD is not just about data mining; it is a broader process that includes data
preprocessing, transformation, and interpretation of results.
1. Data Selection
3. Data Transformation
4. Data Mining
1. Data Selection
This is the first step, where relevant data is chosen from various sources
such as databases, data warehouses, web data, or sensor logs.
Objectives:
Identify the most relevant attributes (features) required for analysis.
Example:
In a retail business, sales records from the last five years might be selected
from a database for customer purchasing behavior analysis.
Key Tasks:
Data Cleaning: Handling missing values, removing duplicate records,
and correcting errors.
Example:
Filling missing age values in a customer database using the average of
available values.
Techniques Used:
Normalization: Scaling numerical values to a common range (e.g.,
between 0 and 1).
Example:
Converting salary figures into standardized values (e.g., converting
rupees into dollars).
4. Data Mining
This is the core step of the KDD process, where data mining algorithms are
applied to extract patterns, trends, and insights.
Example:
A bank uses classification to predict whether a loan applicant is likely to
default.
Key Aspects:
Filtering out patterns that are statistically insignificant or irrelevant.
Example:
A healthcare provider analyzes mined data to identify key factors
leading to heart disease and takes preventive actions.
1. It all starts when the user puts up certain data mining requests, these
requests are then sent to data mining engines for pattern evaluation.
2. These applications try to find the solution to the query using the already
present database.
3. The metadata then extracted is sent for proper analysis to the data
mining engine which sometimes interacts with pattern evaluation
modules to determine the result.
2. Database Server: The database server contains the actual data ready to
be processed. It performs the task of handling data retrieval as per the
request of the user.
3. Data Mining Engine: It is one of the core components of the data mining
architecture that performs all kinds of data mining techniques like
association, classification, characterization, clustering, prediction, etc.
5. Graphic User Interface: Since the user cannot fully understand the
complexity of the data mining process so graphical user interface helps
the user to communicate effectively with the data mining system.
Lack of security could also put the data at huge risk, as the data may
contain private customer details.
7. group by T.cust_ID