1. Introduction to Data Mining
1. Introduction to Data Mining
Concepts and
Techniques
— Chapter 1 —
— Introduction —
Task-relevant Data
Data Cleaning
Data Integration
Databases
August 22, 2024 Data Mining: Concepts and Techniques 6
KDD Process
Step 1: Goal Identification
Defined
Goals
60% of
Data
Target
effort
Transactional Data
Database
Step 4: Data Transformation
Transformed
Flat
Data
File
Data
Model
Increasing potential
to support
business decisions End User
Decision
Making
Data Exploration
Statistical Summary, Querying, and Reporting
Database
Technology Statistics
Machine Visualization
Learning Data Mining
Pattern
Recognition Other
Algorithm Disciplines
◼ General functionality
◼ Descriptive data mining
◼ Predictive data mining
◼ Different views lead to different classifications
◼ Data view: Kinds of data to be mined
◼ Knowledge view: Kinds of knowledge to be discovered
◼ Method view: Kinds of techniques utilized
◼ Application view: Kinds of applications adapted
◼ Outlier analysis
◼ Outlier: Data object that does not comply with the general behavior
of the data
◼ Noise or exception? Useful in fraud detection, rare events analysis
◼ Periodicity analysis
◼ Similarity-based analysis
Pattern Evaluation
Knowl
Data Mining Engine edge-
Base
Database or Data
Warehouse Server