Data Knowledge
Data Knowledge
KNOWLEDGE
DATA DISCOVERY
IN INFORMATION
SYSTEM
Presented by:
Franksel P. Tindoc Jr.
Rogiefel G. Torres
Christian A. Elaurza
DATA AND KNOWLEDGE
DISCOVERY
DATA AND KNOWLEDGE
DISCOVERY
refers to the overall process of extracting valuable, non-
trivial, and actionable knowledge from large datasets. It
combines techniques from data mining, machine learning,
statistics, and database management to analyze data,
discover patterns, and generate insights.
Key characteristics:
Non-trivial extraction of implicit knowledge
Useful information from data
Potentially valuable insights
Improves decision-
making processes
Enhances competitive
WHY KDD
advantage
MATTERS? Supports data-driven
strategies
Facilitates innovation
and problem-solving
Relational databases
Data warehouses
Transactional data TYPES OF
Time-series data
Sequence data
DATA
Web data SOURCE
Social media data
Sensor data
ADVANTAGES OF USING
KDD
Improved Decision-Making
Automation and Efficiency
Enhanced Prediction and Forecasting
Cost Savings
Wide Applicability
Customizability
DISADVANTAGES OF
USING KDD
Data Quality Issues
Complexity in Technique Selection
Privacy and Security Concerns
Scalability and Performance
Interpretation of Results
Integration with Business Processes
Cost and Expertise
APPLICATIONS OF KDD
APPLICATIONS OF
KDD
Healthcare
Disease Prediction and Diagnosis
Drug Discovery
Patient Management
APPLICATIONS OF
KDD
Education
Student Performance Analysis
Adaptive Learning Systems
Curriculum Optimization
Manufacturing
Quality Control
Supply Chain Optimization
Predictive Maintenance
CONCLUSION
Industrial Models
Hybrid Models
ACADEMIC RESEARCH
MODELS
The efforts to establish a Knowledge Discovery
Process model were initiated in academia, in the
mid-1990s
There are two process models developed in 1996 &
1998:
Nine-step model by Fayyad et al
Eight-step model by Anand and Buchner
FAYYAD ET AL. NINE STEP
MODEL
1. Developing and understanding the
application domain
2. Creating a target data set
3. Data cleaning and pre-processing
4. Data reduction and projection
5. Choosing the data mining task
6. Choosing the data mining algorithm
7. Data mining
8. Interpreting mined patterns
9. Consolidating discovered knowledge
ANAND & BUCHNER EIGHT STEP
MODEL
1. Defining the Objective
2. Creating a Target Dataset
3. Data cleaning
4. Data Transformation
5. Data Reduction
6. Choosing the Appropriate Data Mining Task
7. Applying the Data Mining Algorithm
8. Interpreting and Evaluating Results
INDUSTRIAL MODEL
BUSINESS
HEALTHCARE
FINANCE
E-COMMERCE
EDUCATION