ICT616 Topic 01 - Introduction
ICT616 Topic 01 - Introduction
your life?
Get appropriate support for exams/coursework
from
Equity and Social Inclusion
“I acknowledge that Murdoch University is situated
on the lands of the Whadjuk Noongar people.
BEWARE:
- Trying to memorise the lecture slides is NOT a good
strategy in this unit.
ICT616 Data Resources Management Slide10
Organization of classes
As with other postgraduate courses, there is more
emphasis on collaboration and discussion.
-Group Assignment
-Participation component
-Workshop Vs Lecture
Week Topic
Introduction
1
Data Management and Governance (Policies)
2
Data Architecture Management (Framework)
3
Data Development (Modeling and Solution Design)
4
5 Data Security Management
Data Warehousing
6
Big Data
7
Data Mining Overview
8
9 Intro to CRISP-DM and ASUM-DM (Start Using Rapid Miner)
Data Mining Methods
10
Data Mining Methods
11
Unit Review
12
Participation is assessed.
This will often include some discussion of material
illustrating the previous week’s topic. This may
include case studies, journal papers, or other
relevant material.
Introduction to DRM
This makes the ‘data asset’ sound very well demarcated and defined.
• Relating
• Discovering/exploring
• Modelling
The thrusters on the spacecraft, which were intended to control its rate of
rotation, were controlled by a computer that underestimated the effect of the
thrusters by a factor of 4.45. The software was working in pounds force, while
the spacecraft expected figures in newtons; 1 pound force equals approximately
4.45 newtons.
Heterogeneous data sources
Data source Relational, flat file, web…
Data types Salaries stored as integer,
text?
Units Salaries stored per week,
per month?
Concepts Are retired employees still
‘employees’?
Data may not conform to Semi-structured information,
fixed schema e.g. spreadsheet
When data is saved as a flat file, then this is more or less what it is.
Everything is in the file. There is no index, no relations between the
data, so in order to check where something is, you need to read the
whole file. And this, of course takes too much time and gives very little
information.
If you have a semi-structured model, then all records have a unique ID, and
are referenced with pointers to their location on the disk. So if you want to do a
search in your database, then you need to go through all the pointers and this
is not efficient because it takes too much time. That’s why relational databases
are so popular.
Slide35
https://blog.hubspot.com/marketing/semi-structured-data
Some DRM issues to consider…
The data resource is used by different users for varying
uses
• CEO
• CFO
• End User
• Manager
• Data Manager
• Client
•…
…who will have different (and often conflicting)
requirements of the data resource
• Reused
• Repurposed
• Conservation strategies
Poorly understood as a
resource