Topic 1 Etw3482
Topic 1 Etw3482
Wisdom
Knowledge
Information
Data
Erl, T., Khattak, W., & Buhler, P. (2016). Big data fundamentals: Concepts, drivers & techniques. Boston:
Prentice Hall, ServiceTech Press.
Data Mining
• A synonym for “Knowledge Discovery From Data” or KDD.
• An interdisciplinary subject of computer science and statistics.
• It contains the knowledge discovery steps.
• The process of discovering interesting pattern and knowledge
from large amounts of data.
• Consists of many analytics methods, but they can be categorised
into two broad categories:
o Pattern Discovery
o Predictive Modelling.
What types of data can be mined?
• Data that are meaningful for the application.
• Basic forms of data are database data, data warehouse
data and transactional data.
• Other forms of data, like data streams, sequence data,
graph or networked data, spatial data, text data, etc.
DIKW Pyramid in Business Architecture
Wisdom
Strategic Judgement
(CSFs) (Constraints)
Tactical Knowledge Action
(KPIs) (Adjustments)
Operational
Information Experience
(PIs/Metrics) (Results)
Data
Events
Erl, T., Khattak, W., & Buhler, P. (2016). Big data fundamentals: Concepts, drivers & techniques. Boston:
Prentice Hall, ServiceTech Press.
Lesson Summary
• DIKW pyramid
• Data mining is a knowledge discovery
process
• There are two groups of analytics
methods in data mining: pattern discovery
and predictive modelling.
Why is data mining becoming popular?
Topic 1
Introduction to Data Mining
How to better target product/service Profiling and segmentation. Customer behaviours and needs by
offers? segment.
Which product/service to recommend? Cross-sell and up-sell. Probable customer purchases.
How to grow and maintain valuable Acquisition and retention. Customer preferences and purchase
customers? patterns.
How to direct the right offer to the right Campaign management. The success of customer
person at the right time? communications.
How to minimise operational disruptions Asset maintenance The real drivers of asset or equipment
and maintenance costs? failure
How to decrease fraud losses and lower Fraud management and Unknown fraud cases and future risks.
false positives? cybersecurity
Data Mining Process
Which data are relevant?
Data Mining
“What will happen?”
Complexity
Lesson Summary
Operationalising Analytics