Week 4
Week 4
Chapter 3
5. Describe the purpose and benefits of data management. Plan or procedure used to
control, protect and deliver the data Benefits:Higher security Less legal liability
Improved sales and marketing strategies Improved data governance6. What is a
relational database management system? Provides access to data using a declarative
language. Such as HTML, SQLDeclarative language simplifies data access by
requiring that users only specify what data they want. Examples are Hypertext Markup
Language (HTML) and Structured Query Language (SQL) 3.2 Centralized and
Distributed Database Architectures 1. Describe the data life cycle. The data life cycle
is a model that illustrates the way data travel through an organization. The three
principles of data life cycle are principle of diminishing data value, principle of90/90 data
use and principle of data context.Principle of diminishing data value- value of data
diminish as they age.principle of 90/90 data use- after 90days data becomes dirty
dataprinciple of data context- capability to capture,process,format and distribute data
3. What are the consequences of not cleaning ‘dirty data’?poor judgement increase in legal
liabilityLoss of business when sales opportunities are missedInability to make proper business
decisionsCostly to the organization in terms of money (cost to prevent and correct errors) 5. Discuss
how data ownership and organizational policies affect the quality of an organization’s data. When
there is no accountability No policies and procedures.Inconsistent data format (unstructured). 3.3
Data Warehouses 5. What is an advantage of an enterprise data warehouse (EDW)? Refine data,
eliminating redundant information and improving overall data quality.Prevents volatility and data is
more stable.Enhance data quality Quick and easy access to data 7. What types of decisions can
benefit from a data warehouse? Sales- calculate sales profitabilityFinancial- provides real-time data
for credit termsMarketing – marketing program effectiveness.Tactical decision making- marketing and
forecasting and salesStrategical decision making- forecasting, financing 3.4 Big Data Analytics and
Data Discovery 1. Why are human expertise and judgment important to data analytics? Give an
example.Human expertise is needed to interpret the output of analytics.Example in hospital data is
interpreted by doctors. 4. How can manufacturers and health care benefit from data analytics? 1. Can
be used in research and development processes 8. What are the basic steps involved in text analytics? 1.
Exploring – documents are explored, e.g word count in a document collection2. Preprocessing – before
analysis of content, text may need to be preprocessed to standardize it.
Competitive – it consolidates reports from internal and external sources that will help mangers to make
healthy and proper decisions on the future of the business to remain competitive in the market Business
Cases Case 3.2 Business Case: Big Data Analytics is the “Magic Sauce” for Revitalizing McDonald’s 1.
Explain McDonald’s mission and responsibilities. To move from product oriented to customer oriented.
(mission)Retain its loyal customers (responsibilities)Attract new customers (responsibilities) 2. What
limitation did McDonald’s face in gaining data that was meaningful to decision making? Average based
matrix It was solved by a change into tend analytics3. Describe trend analytics. Uses historical data
collected over a period of time to predict future trends. McDonalds have analyzed and visualized the
flaws in individual stores 4. Is McDonald’s product oriented or customer oriented? Customer oriented
through provision of wi-fi which has helped to retain market share in the fast-food industry. 5. Why is
the ability to identify patterns and relationships critical to McDonald’s operations? Helps to identify
issues which have negative impacts on customers. E.g improvement in the drive through experiences
and breakfast menus
Summary
4.1 Data Management and
Database Technologies
Data management manages
the flow of data
Objectives of data
management include
maintaining accuracy of data,
data security and
legal requirements
Goal of data management-
transform raw data into usable
information.
Benefits of data
management – higher security,
greater compliance, improved
data
governance
Costs of data management-
difficult and expensive since
data is scattered, increased
volume of data and redundant
data
Data management
technologies include databases,
data warehouses, data marts and
business intelligence
Databases store data
Data warehouses integrated
collection of data.
Data marts are small scale
data warehouses. Subsets of
data
Data centers- facilities
containing components that
deliver data
Data mining- searching data
in the data warehouse.
Data dictionary – directory
containing metadata
Metadata- data which has
information about other data.
Business intelligences-
process data and do statistical
analysis
DBMSs is a software used
to manage and update data
DBMSs functions include:
data filtering-inspect data
for errors
data integrity- correct and
verify consistency of data
data access – authorized
access to data
data synchronization-
integrate, match or link data
data security- check and
control data integrity
Stand-alone systems result
in data redundancy,
inconsistency, and isolation.
Database management
systems help solve the data
problems of file based system
advantages of NoSQL are: -
high performance – greater
flexibility –easy distribution of
data
Relational database stored
data in rows and columns
Relational management
systems provide access to data
using a declarative language.
Online transaction
processing systems- manage
transaction data, which are
volatile.
Online analytics processing
system -is means of organizing
large business databases
4.2 Centralized and Distributed
Database Architectures
2 types of databases are
centralized and distributed
databases
Centralized database store
data is a single central computer
Distributed database stores
portions of data on multiple
computers figure 3.6 & 3.7. pg
74
Benefits of centralized
database are better control over
data and better IT security
Benefits of distributed
database is that it allows both
local and remote access
Types of databases :
Summary 4.1 Data Management and Database Technologies Data management manages the flow of
data Objectives of data management include maintaining accuracy of data, data security and legal
requirementsGoal of data management- transform raw data into usable information. Benefits of data
management – higher security, greater compliance, improved data governance Costs of data
management- difficult and expensive since data is scattered, increased volume of data and redundant
data Data management technologies include databases, data warehouses, data marts and business
intelligence Databases store data. Data warehouses integrated collection of data. Data marts are
small scale data warehouses. Subsets of dataData centers- facilities containing components that deliver
dataData mining- searching data in the data warehouse.Data dictionary – directory containing
metadata Metadata- data which has information about other data.Business intelligences- process
data and do statistical analysisDBMSs is a software used to manage and update dataDBMSs functions
include:data filtering-inspect data for errors data integrity- correct and verify consistency of
datadata access – authorized access to datadata synchronization- integrate, match or link datadata
security- check and control data integrity Stand-alone systems result in data redundancy,
inconsistency, and isolation. Database management systems help solve the data problems of file based
systemadvantages of NoSQL are: -high performance – greater flexibility –easy distribution of
dataRelational database stored data in rows and columns Relational management systems provide
access to data using a declarative language.Online transaction processing systems- manage transaction
data, which are volatile.Online analytics processing system -is means of organizing large business
databases4.2 Centralized and Distributed Database Architectures 2 types of databases are centralized
and distributed databasesCentralized database store data is a single central computerDistributed
database stores portions of data on multiple computers figure 3.6 & 3.7. pg 74Benefits of centralized
database are better control over data and better IT securityBenefits of distributed database is that it
allows both local and remote accessTypes of databases :
Benefits of BI Enhances decision making Competitive advantage Unites data Challenges of BIData
qualityAlignment with business strategy and BI strategy Aligning BI strategy with Business strategy
(7)Start with a roadmapUnderstand line of business needsEstablish metrics to track
performanceUnderstand the data and application environment Create an analytics committeeAllow
open communication Hire a chief data officerERM know the benefits and disadvantage