ANL303 - Week - 1 - Jan 2023 Includes Course Overview
ANL303 - Week - 1 - Jan 2023 Includes Course Overview
• There will be hands-on exercises using IBM SPSS Modeler. Please ensure that you have installed the
software
3
Course Structure & Assessment
Six (6) weekly seminars of three (3) hour duration
• Course Assessment:
50%
OCAS
6%
50%
Graded Quizzes
Tutor-Marked OES
18%
Assignment (TMA)
Group-Based Final Examination 50%
20%
Assignment (GBA)
Participation 6%
4
Course Structure & Assessment
Six (6) weekly seminars of three (3) hour duration
• Course Assessment:
• OCAS
• 3 Quizzes (6%), (includes 1 Compulsory Pre-course Quiz)
• 1 Tutor-Marked Assignment (18%)
• 1 Group-Based Assignment (20%) and
• Participation (6%)
• OES
• Exam (50%)
• Attendance
• CET students to mark attendance before each class/session via the QR code
which will be sent to CET students’ email by CCPE.
Course Assessment
• Quizzes
– Pre-Course Quiz 1 (PCOQ1) (2%)
• Course coverage: Units 1 & 2
• Must achieve 60 marks to remain in course
– Pre-Class Quiz 1 (PCQ01, 2%)
• Course coverage: Units 3 & 4
– Pre-Class Quiz 2 (PCQ02, 2%)
• Course coverage: Units 5 & 6
6
Assessment components
Tutor-Marked Assignment (TMA) (18%)
7
Assessment components
Group-Based Assignment (GBA) (20%)
8
Assessment components
Participation (6%)
9
Assessment components
Exam (50%)
10
Unit 1 Overview & Activities
Study Unit 1
Overview of Data Mining
Key Learning Objectives for this unit include:
Knowledge
Information
Data by themselves have no meaning
because they are without context and
interpretation. Data
Not support decision-making
The Data, Information, Knowledge, Wisdom (DIKW) hierarchy developed by Rowley (2007)
Rowley, J. (2007). The wisdom hierarchy: Representations of the DIKW hierarchy. Journal of Information Science 33(2), 163-180.
What is Data? Wisdom
Knowledge
Data
The Data, Information, Knowledge, Wisdom (DIKW) hierarchy developed by Rowley (2007)
Rowley, J. (2007). The wisdom hierarchy: Representations of the DIKW hierarchy. Journal of Information Science 33(2), 163-180.
What is Data? Wisdom
Knowledge
Information
Data mining is the transformation of data
into information for decision-making.
Data
The Data, Information, Knowledge, Wisdom (DIKW) hierarchy developed by Rowley (2007)
Rowley, J. (2007). The wisdom hierarchy: Representations of the DIKW hierarchy. Journal of Information Science 33(2), 163-180.
How to increase the sales of this book?
20
Descriptive Data Mining
• Focuses on what has already happened in the past
• Explores patterns and relationships that may exist in data
21
Descriptive Data Mining
• Focuses on what has already happened in the past
• Explores patterns and relationships that may exist in data
22
Descriptive Data Mining
• Focuses on what has already happened in the past
• Explores patterns and relationships that may exist in data
Association rule mining can be used E.g., Item A and Item B are usually purchased by
customers at the same time.
(to be discussed in Study Unit 4)
23
Descriptive Data Mining
• Focuses on what has already happened in the past Income
• Explores patterns and relationships that may exist in data
Age
Cluster analysis can be used
(to be discussed in Study Unit 4)
24
Descriptive Data Mining
Examples of association analysis and clustering
Clustering can be used to group customers
based on their similarities in terms of age and
income Customer Age Income Chips Bread Milk Butter
Income
Amy 4000
Ben 2500
Cindy 1500
David 40 4500
Evan 2800
Flora 7000
Gloria 45 6000
Age
Association analysis can be used to identify the relationship
among items purchased by the customers
25
Predictive Data Mining Decision trees can be used
(to be discussed in Study Unit 5)
26
Summary
• Descriptive data mining:
– Summarisation
– Association
– Clustering Some data mining techniques can do one or
more of these…
• Predictive data mining:
– Classification
– Estimation
27
Data Mining Process
Problem
definition
Data mining
Data quality
technique
assessment
evaluation
Problem Definition
1. Identification of a business problem
30
In this
example,
what is the
business
problem?
Data mining
Data quality
technique
assessment
evaluation
Data Quality Assessment
1. Collection of data
34
In this
example,
what data
will you
collect?
35
Problem
definition
Data mining
Data quality
technique
assessment
evaluation
Data Mining Technique Evaluation
1. Identification of appropriate data mining techniques
2. Construction of models
37
Data Mining Technique Evaluation
38
Data Mining Applications
Data Mining Applications
Examples:
2. Credit Scoring
3. Fraud Detection
4. Retailing
40
Customer Relationship Management
E.g., identify products that are usually
Value Association purchased together by customers
Better cross-/up-selling
E.g., market
segmentation for Better retention
E.g., predict customer churn
target marketing
Predictive modelling
Clustering
Profit
Time
Loss
Better acquisition 41
Credit Scoring
• Predictive modelling can be used to:
– Identify factors related to at-risk customers
– Assess the risk of granting a loan to an applicant, based on the characteristics of that applicant
42
Fraud Detection
• Predictive modelling can be used to:
– Identify suspicious cases that may warrant further investigation
43
Retailing
• Data mining can be used to:
– Analyse buying patterns of customers
44
Advantages and
Disadvantages of Data
Mining
Advantages of Data Mining
• Provides a range of powerful analytical tools for organisations to outperform their
competitors
• Transforms large amounts of data into insights for better decision making
• Can be applied in many sectors such as banking and finance, manufacturing, marketing
and retail
46
Disadvantages of Data Mining
• The quality of data mining results and applications depends on the availability and quality of
data
• Data mining is not perfect and acting on wrongly discovered or “random” patterns can have
consequences
• Successful data mining requires users to be knowledgeable in the business domain and data
mining tools
• IT expertise is also necessary for extraction and preparation of data, as well as model
deployment
47
Case Discussion (30 mins)
Case Discussion: Tammy, the product manager
Background
• July 2021: The telco market in Singapore is intensely competitive. With the entrance of
new digital only operators, the full-service incumbents are feeling the pressure. With a
falling average revenue per user (ARPU) due to competition, Tammy, the product
manager of one of the incumbent telcos is thinking to offer a new plan in Q4 of 2021. The
idea is to bundle unlimited outgoing local calls with unlimited data and then promote it to
customers to prevent them from churning.
49
Case Discussion: Tammy, the product manager
Your task
*Please remember to write down the names of all group members in your post
50
End of Study Unit 1
See you next week!