0% found this document useful (0 votes)
126 views66 pages

ICT616 Topic 01 - Introduction

The document provides an overview of the ICT616 Data Resources Management unit at Murdoch University. It discusses the unit aims, learning objectives, topics to be covered over the semester, assignments and assessment. The unit focuses on identifying, organizing, managing and leveraging organizational data as a strategic resource. It will examine issues in data governance, architecture, development, security and analytics. Students will complete two assignments, one involving a case study and one a data mining project, and there will be a final examination.

Uploaded by

Narayan Dahal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
126 views66 pages

ICT616 Topic 01 - Introduction

The document provides an overview of the ICT616 Data Resources Management unit at Murdoch University. It discusses the unit aims, learning objectives, topics to be covered over the semester, assignments and assessment. The unit focuses on identifying, organizing, managing and leveraging organizational data as a strategic resource. It will examine issues in data governance, architecture, development, security and analytics. Students will complete two assignments, one involving a case study and one a data mining project, and there will be a final examination.

Uploaded by

Narayan Dahal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 66

Is disability/medical conditions part of

your life?
Get appropriate support for exams/coursework
from
Equity and Social Inclusion
“I acknowledge that Murdoch University is situated
on the lands of the Whadjuk Noongar people.

I pay respect to their enduring and dynamic culture


and the leadership of Noongar elders both past and
present.

The boodjar (country) on which Murdoch University


is located has for thousands of years, been a place
of learning. We at Murdoch University are proud to
continue this long tradition.”
ICT616
Data
Resources
Management
Overview of the Unit
&
Topic 1: Data resource
management and the
intelligent enterprise
Coordinator and Tutor Contact
Details – Use Them!
Unit coordinator and Tutor:
•Despina Giannakaki
•Building 245 (Science and Computing) Room 1.012

Email for questions about the unit:


[email protected]
Unit outline

Data resources management views the information available in


organisational databases as a very significant strategic resource
Data and information are the lifeblood of the 21st century economy.
Organizations rely on their data assets to make more informed
and effective decisions.
Market leaders leverage their data assess to create competitive
advantages through greater knowledge of their customers,
innovative uses of information, and operational efficiencies.
Businesses use data to provide better products and services, cut
costs and control risks.
The unit focuses on those issues involved in managing and
strategically exploiting data, including discovery of trends and
patterns and new knowledge creation

ICT616 Data Resources Management Slide5


Unit aims
The broad aims of this unit are for you to develop a
deeper understanding of the nature of organisational
data resources and, more specifically, how these
resources can be:
-Identified
-Located
-Organised
-Situated
-Contextualised
-Manipulated
-Communicated
-Managed as a dynamic component of the organisation

ICT616 Data Resources Management Slide6


Learning Objectives
On successful completion of the unit you should be
able to:
-Demonstrate an understanding of the terminology,
principles, tools and techniques of data resources
management

-Describe and explain how organisations use data


resources management as a strategy for meeting
organisational goals

ICT616 Data Resources Management Slide7


Learning Objectives
On successful completion of the unit you should be
able to:
-Identify the data management issues that arise
from current trends in the field of Information
Technology.

-Review case studies and articles describing real-


world events, and situate these in the context of Data
Management to suggest the best practices to employ.

ICT616 Data Resources Management Slide8


How to study this unit….
Much of the onus for your development in this unit is
placed on YOU.
YOU are expected to prepare for the seminars by:
-Pre-reading
- The given readings
- Other relevant material
YOU are expected to participate in the seminars by
sharing your understanding of the topic that you have
gained:
-From your preparation
-From your practical experience

ICT616 Data Resources Management Slide9


How to study this unit….
The more you participate, and the better you prepare,
the more you will gain from the unit
ADDITIONALLY…
This is a Masters-level unit. This means:
- You are expected to demonstrate a deeper
understanding of the issues in the unit than would be
expected in an undergraduate unit

BEWARE:
- Trying to memorise the lecture slides is NOT a good
strategy in this unit.
ICT616 Data Resources Management Slide10
Organization of classes
As with other postgraduate courses, there is more
emphasis on collaboration and discussion.

-Group Assignment
-Participation component
-Workshop Vs Lecture

ICT616 Data Resources Management Slide11


Draft Topic Outline
Note that this is subject to change

Week Topic
Introduction
1
Data Management and Governance (Policies)
2
Data Architecture Management (Framework)
3
Data Development (Modeling and Solution Design)
4
5 Data Security Management
Data Warehousing
6
Big Data
7
Data Mining Overview
8
9 Intro to CRISP-DM and ASUM-DM (Start Using Rapid Miner)
Data Mining Methods
10
Data Mining Methods
11
Unit Review
12

ICT616 Data Resources Management Slide12


Assignments and examination

ICT616 Data Resources Management Slide13


Assignment 1 : Case Study for
Data Management in Healthcare
• Demonstrate your knowledge on developing and
maintaining an Enterprise Data Model, and conduct
research on its specific development and
maintenance in the healthcare field.
• Includes: analysis, data modelling, development .
• As this is an advanced unit, more emphasis will be
given to the consideration and discussion of wider
implications and issues facing the client
organization

ICT616 Data Resources Management Slide14


Assignment 2 :Project
The aim of the project is for you to conduct an
analysis of a real data set, and to document your
findings.
 Students will work in pairs for this assignment. Each
pair of students must undertake a project using
unique data.
Projects should be discussed with the unit coordinator
and a project outline submitted for approval before
the project commences. The tasks will follow the
format of the processes undertaken in the data
mining lectures/workshops. Therefore it is
important that you stay up to date with these.
ICT616 Data Resources Management Slide15
Assignment 2: Project Report

The format of your report will be a technical


report which will largely focus on the data
mining tasks and your findings. Therefore it
will not include a very large written/essay
component.

ICT616 Data Resources Management Slide16


Participation

Participation is assessed.
This will often include some discussion of material
illustrating the previous week’s topic. This may
include case studies, journal papers, or other
relevant material.

ICT616 Data Resources Management Slide17


Final Examination
The final examination will be held at the end of the
semester. If there are no COVID-19 restrictions, it
will be ‘closed book’ and be of two (2) hours
duration. Otherwise, it will be an open-book online
exam with longer duration.
 The examination consists of a series of questions that
address many of the topics in data resource
management that were covered in the unit.
The answers should address the key issues in each
question, considering breadth and integration of
themes, rather than an in-depth or highly technical
treatment.
Answers should be illustrated with suitable examples
drawn from the seminars, your project work, or your
professional career.

ICT616 Data Resources Management Slide18


Topic 1

Introduction to DRM

ICT616 Data Resources Management Slide19


Objectives
At the completion of this topic you should be able to:

Explain what is meant by Data Resource Management


(DRM)

Describe the data life-cycle and outline the implications of


this for DRM

Describe the relationship between Business Intelligence


Tools and DRM

Explain the importance of planning and integration in the


development of effective DRM strategies

ICT616 Data Resources Management Slide20


Topic outline

What are the following:


Data Resource Management?

Data life cycle?

The “Intelligent Enterprise”?

Data Resource Management Objectives?

ICT616 Data Resources Management Slide21


Data
Data is the representation of facts as text, numbers,
graphics, images, sound or video.
Information is data in context. Without context, data is
meaningless. We create meaningful information by
interpreting the context around data.

Context includes the business meaning of data elements,


the format in which data is presented,
the timeframe represented by the data and
the relevance of the data to a given usage

ICT616 Data Resources Management Slide22


Knowledge
Knowledge is information in perspective, integrated into a
viewpoint based on the recognition and interpretation of
patterns, such as trends, formed with other information and
experience.
It may include assumptions and theories about causes.
Knowledge may be explicit – what an enterprise or
community accepts as true – or tacit, inside the heads of
individuals.
We gain in knowledge when we understand the significance
of information.
Data -> Information -> Knowledge

ICT616 Data Resources Management Slide23


Meta-data
Meta-data is data that provides information about data.
It is to data what data is to real-life. Data reflects real life
transactions, events, objects, relationships, etc.
Meta-data reflects data transactions, events, objects,
relationships, etc.

A good analogy for meta-data is the card catalog in a library.


The card catalog identifies what books are stored in the library and
where they are located within the building. Users can search for books
by subject area, author or title. Without the card catalog resource, finding
books in the library would be difficult, time-consuming and frustrating.

ICT616 Data Resources Management Slide24


Is data truth?
Not necessarily!
Data can be inaccurate, incomplete, out of date, and
misunderstood.
On a practical level, truth is information of the highest
quality – data that is available, relevant, complete,
accurate, consistent, timely, usable, meaningful and
understood.
Organizations that recognize the value of data can take
concrete, proactive steps to increase the quality of data and
information.

ICT616 Data Resources Management Slide25


What is DRM?
“. . . the process of managing, controlling and
protecting an organisation’s data asset, while
supporting organisational goals”
Data Management Association (DAMA)

This makes the ‘data asset’ sound very well demarcated and defined.

Is this generally the case?

ICT616 Data Resources Management Slide26


What is DRM?
Data resources exist throughout the “modern”
organisation
They are considered to be assets as they have value
How might we measure the value of data?

• Both tangible and intangible


• Both current and future

ICT616 Data Resources Management Slide27


What is DRM?
Also, the data resource may have value added to it
through processes such as

• Relating
• Discovering/exploring
• Modelling

DRM may then be seen to be about maximising the


value associated with the organisation’s data
resources

This may take many forms!

ICT616 Data Resources Management Slide28


Some DRM issues to consider…

All organisations generate data

All data is useful in some way or another

Re-purposing of data is a must, not


something to talk about

Data has many perspectives and contexts

ICT616 Data Resources Management Slide29


Some DRM issues to consider…

Everyone has a view…

• “It’s a cost and should be outsourced…”


• “We have all this data but we get nothing from it…”
• “This data is incorrect…”
• “The reports take 4 hours to generate…”

ICT616 Data Resources Management Slide30


Some DRM issues to consider…
Data exist in heterogeneous formats…
• Agreement on common formats can be
problematic
• Conversion between formats can be lossy
• But must be able to be made!

…and in multiple locations


• Data islands
• Redundant data

ICT616 Data Resources Management Slide31


An issue caused by heterogeneous data formats

NASA lost a $125 million Mars orbiter because a


Lockheed Martin engineering team used English
units of measurement while the agency's team
used metric units.

Lockheed Martin provided navigation commands for


thrusters in English units although NASA uses the
metric system.

The thrusters on the spacecraft, which were intended to control its rate of
rotation, were controlled by a computer that underestimated the effect of the
thrusters by a factor of 4.45. The software was working in pounds force, while
the spacecraft expected figures in newtons; 1 pound force equals approximately
4.45 newtons.
Heterogeneous data sources
Data source Relational, flat file, web…
Data types Salaries stored as integer,
text?
Units Salaries stored per week,
per month?
Concepts Are retired employees still
‘employees’?
Data may not conform to Semi-structured information,
fixed schema e.g. spreadsheet

ICT616 Data Resources Management Slide33


Relational database
A database, structured to recognise relations between stored items
of information, e.g., the information is presented to the user through
a number of tables and there are relations between the data.

When data is saved as a flat file, then this is more or less what it is.
Everything is in the file. There is no index, no relations between the
data, so in order to check where something is, you need to read the
whole file. And this, of course takes too much time and gives very little
information.

ICT616 Data Resources Management Slide34


Example of structured and semi-structured data

If you have a semi-structured model, then all records have a unique ID, and
are referenced with pointers to their location on the disk. So if you want to do a
search in your database, then you need to go through all the pointers and this
is not efficient because it takes too much time. That’s why relational databases
are so popular.
Slide35
https://blog.hubspot.com/marketing/semi-structured-data
Some DRM issues to consider…
The data resource is used by different users for varying
uses
• CEO
• CFO
• End User
• Manager
• Data Manager
• Client
•…
…who will have different (and often conflicting)
requirements of the data resource

ICT616 Data Resources Management Slide36


Some DRM issues to consider…
The data resource needs to be readily available

• Unavailability of the data resource may be costly


• Think about the foreign exchange dealer who cannot
access current prices and misses out on 5 minutes
trading
• Availability may be outside the organisation’s control

…it is also important to realise that availability does not


always equal fulfilment of user requirements
e.g., Relevance

ICT616 Data Resources Management Slide37


For example
McCabe v British American Tobacco Services Ltd

In December 2002, the Victorian Court of Appeal


overturned an earlier decision of the Victorian
Supreme Court where Justice Geoffrey Eames had
struck out British American Tobacco's defence because
of the creation and implementation of a document
retention and destruction policy, which was held to
have prejudiced the Claimant's right to a fair trial

Web sources: Victoria Centre for Tobacco Control


See also: http://tobaccocontrol.bmj.com/cgi/reprint/11/3/271

ICT616 Data Resources Management Slide38


Some DRM issues to consider…
Repurposing/recycling of data:

• To maximise the value of the data resource it must


be able to be:

• Reused
• Repurposed

• This has implications for data modelling and design

ICT616 Data Resources Management Slide39


Some DRM issues…
DRM becomes even more important in the
context of the modern organisation:

• IT (and the data resource) are a potential source of


competitive/strategic advantage

• The data resource needs to be managed so that


changes in user and organisational needs for data can
be accommodated

• Organisations are required to be reactive to external


forces (economic, political, social, changed legislation.
E.g. SOX).
ICT616 Data Resources Management Slide40
The data life cycle
Data exists, and must be managed
beyond its initial creation

The way data is managed will depend on


what stage of its “life” it has reached

ICT616 Data Resources Management Slide41


The Data Life Cycle...
A view of the data life cycle is below:

Plan Specify Enable


Create&Acquire Maintain&Use
Archive&Retrieve Purge

All data lifecycle stages have associated costs and


risks, but only the “use” stage adds business value

The nature, use, and management of the data will


determine its life cycle

ICT616 Data Resources Management Slide42


The System Development Life
Cycle...
A view of the system development life cycle is below:

Plan Analyze Design


Build Test
Deploy Maintain

ICT616 Data Resources Management Slide43


Data Life Cycle vs SDLC

SDLC is not the same as the data lifecycle.


The SDLC describes the stages of a project, while
the data life cycle describes the processes
performed to manage data assets.
But: the two lifecycles are closely related, because
data planning, specification and enablement
activities are integral parts of the SDLC. Other SDLC
activities are operational or supervisory in nature.

ICT616 Data Resources Management Slide44


Data disposal???
“…
West Australian government agencies are too laissez faire
with the disposal of old computers, according to a report
by the WA Auditor General.
The Auditor General's office bought 19 second-hand PCs which
looked to be ex-government. Of those, 10 proved to be so. From
four of the 10 hard drives, the team was able to retrieve
information, some of it sensitive, including tax file numbers,
salary information, superannuation information, home addresses,
dates of birth, photos, personal e-mails, letters, resumes,
performance reviews, and contact details.
…” ZDNet Australia (1st April 2008)
http://www.zdnet.com/article/wa-govt-slammed-for-bogus-data-disposal-policy/
Retrieved February 5th, 2019

ICT616 Data Resources Management Slide45


Management of the data resource
through time
Management of the data resource from
procurement/acquisition to archiving

Approach can be:


Reactive – not ‘throwing anything away’
Proactive - making sure ‘islands of data’ are charted

Be aware of/minimise potential loss when


changing platform – conversion is never
lossless

ICT616 Data Resources Management Slide46


Issues for DRM in data life cycle
The data resource will continue to grow within
the organisation
Need for increased storage
Storage Resource Management
• Storage area networks (SANs)
• Networked attached storage (NAS)
• Hierarchical storage management (HSM)
• Direct attached storage (DAS)

ICT616 Data Resources Management Slide47


This Week’s Workshop + Short
Assignment for Next Week (1/2)
During the workshop, students will form groups of 3, to
find relevant material and understand the differences of
the four storage resource management approaches
(definition, usage, advantages/disadvantages). They
will discuss their findings with other groups in the class.
Storage Resource Management
• Storage area networks (SANs)
• Networked attached storage (NAS)
• Hierarchical storage management (HSM)
• Direct attached storage (DAS)

ICT616 Data Resources Management Slide48


This Week’s Workshop + Short
Assignment for Next Week (2/2)
Students need to provide a (maximum) 1-page report with
information regarding one of the four storage resource
management approaches (definition, usage,
advantages/disadvantages). Students can choose
whichever approach they prefer, to focus their report on.
This is a task to be completed individually by each
student.
Storage Resource Management
• Storage area networks (SANs)
• Networked attached storage (NAS)
• Hierarchical storage management (HSM)
• Direct attached storage (DAS)

ICT616 Data Resources Management Slide49


Issues for DRM in data life cycle…
Need for longer term management strategy

• Conservation strategies

• Making sure that data isn’t lost through neglect,


error or throwing aside

ICT616 Data Resources Management Slide50


Impact of the life cycle on data
management
The data lifecycle must be managed to ensure
accuracy and trustworthiness of the data:
• Processes must be implemented to support this
lifecycle to ensure the continued quality of the data
resource

• Alignment between these processes and the


strategic objectives of the organisation is a must at
both tactical and operational levels

ICT616 Data Resources Management Slide51


Accuracy and trustworthiness of
the data (UTS Example)

ICAC investigation into alteration of student records

 The Independent Commission Against Corruption


(ICAC) reported on its 2001 investigation into the
unauthorised use and alteration of student records

 ICAC found that a combination of technical deficiencies


and procedural weaknesses in the UTS student records
system led to the corrupt conduct it was investigating
http://icac.nsw.gov.au/documents/about-corruption/corrupt
ion-matters-newsletter/1274-corruption-matters-issue-no-2
1-september-2002/file
Retrieved February 5th, 2019

ICT616 Data Resources Management Slide52


Accuracy and trustworthiness
of the data…
Corruption and Crime Commission of Western
Australia
Report of an Inquiry into Unauthorised Access and
Disclosure of Confidential Personal Information Held on
the Electronic Databases of Public Sector Agencies
In September 2005 it undertook an inquiry to look at
aspects of:
• the legislative and policy framework for dealing
with unauthorised access and disclosure of
confidential personal information;
• arrangements for the selection and supervision of
staff with access to personal information of a
sensitive or confidential nature; and
• the awareness of staff of their responsibilities to
safeguard
ICT616 confidential personal information
Data Resources Management Slide53
Accuracy and trustworthiness of
the data…
The Commission observed:

“The exact extent of the problem of misuse of


computer systems through unauthorised access and
disclosure is not known and it is widely suspected that
a great deal goes undetected

The anecdotal advice of those working in this


area suggests that unauthorised access and disclosure
occurs a great deal more than is ever officially
reported or acted upon”

ICT616 Data Resources Management Slide54


Accuracy and trustworthiness of
the data
ACCC Investigates IELTS Test corruption at Curtin
University

A former Curtin University employee is among a group of


nine people whom the Corruption and Crime Commission
has charged with 59 bribery offences in April 2011.
The charges relate to the manipulation of the International
English Language Testing System (IELTS) conducted at
Curtin University’s English Language Centre over a 12
month period to June 2010.
https://www.ccc.wa.gov.au/sites/default/files/IELTS%20bribe
ry%20charges.pdf
Retrieved February 5th, 2019
ICT616 Data Resources Management Slide55
Accuracy and trustworthiness of
the data (Curtin Example)

The CCC investigation will also determine whether the


Curtin English Language Centre had policies in place
to detect misconduct and examine whether the
IELTS had been compromised at other testing
centres.

''Only in knowing precisely how the manipulation of


the IELTS testing was carried out and how it was
concealed for so long will it be possible to properly
examine those systemic issues'' (CCC Counsel).

ICT616 Data Resources Management Slide56


The “human element” of DRM
Much of the organisational data resource
resides in so-called “organisational memory”
• The memory of organisation
• Source of the tacit component of knowledge
• May be mined for rules and knowledge
bases – real source of business rules
• It is the most evanescent of corporate
assets (goes when people go)
• Yet most vital – minutes, reports,
documentation

ICT616 Data Resources Management Slide57


The “Intelligent Enterprise”
The intelligent enterprise makes use of its data
resource for competitive/strategic advantage
• Business Intelligence
• Most general definition is that of strategies that
help managers in decision making by
presenting/crunching/summarising data:

• Enterprising Resource Planning suites


• Customer Relationship Management (CRM)

ICT616 Data Resources Management Slide58


Business intelligence tools
Originally Decision Support Systems (DSS) and
more recently, Executive Information Systems
(EIS) -> initially conceived for upper
management
Now no clear distinction:
• Management roles less defined – all levels
need access to corporate data
• Move to client/server and Internet platform
• Help managers make decisions about
problems that may be quickly changing

ICT616 Data Resources Management Slide59


Business intelligence (Gartner)
Gartner classify BI projects into four
different styles:
• Departmental
• Enterprise
• B2B (Business to business)
• B2C (Business to consumer)

ICT616 Data Resources Management Slide60


Business intelligence
However, it is not just a matter of buying the latest
Business Intelligence Tool (BIT) and “plugging it in”

Intelligent use of the data resource must be planned and


integrated into the overall DRM strategy
• Possible disruption to existing systems (see next)
• Costly to implement
• Often difficult to justify
• Must be able to demonstrate benefit outweighs cost

ICT616 Data Resources Management Slide61


Business intelligence
Business-driven methodology and project management
Clear vision and planning
Committed management support & sponsorship
Data management and quality
Mapping solutions to user requirements
Performance considerations of the BI system
Robust and expandable framework

(Tarun K Vodapalli (2009). "Critical Success Factors of BI Implementation)

ICT616 Data Resources Management Slide62


So with all of these
issues in mind, how
may/should the data
resources be
managed?

ICT616 Data Resources Management Slide63


Data resources management:
Must involve all parts of IT discipline
Must be reflexive & holistic
Must be continuous
Must be agile and adaptive within the strategy
of the organisation
Must involve people
Must be driven by organisational need, rather
than IT

ICT616 Data Resources Management Slide64


What about the Real World?
Badly managed

Poorly understood as a
resource

Bottom up perspective – we have all this data


now what do we do ?

Disparate data sources, different systems all


storing the same data

What is the value of the data and how do you


“sell” it at different levels within organisations?
ICT616 Data Resources Management Slide65
Summary
Data resources management is about
conservation and exploitation of all the
resources that have come into an
organisation:
• Make sure that data isn’t lost through neglect, error
or throwing aside
• Recycle and repurpose existing data where possible
– make use of the structure and order that has been
created
• Design new tasks to value-add or recontextualise
existing data
• DRM must coexist with and extend existing systems
ICT616 Data Resources Management Slide66

You might also like