0% found this document useful (0 votes)
31 views20 pages

Mis Unit 3

This document provides an overview of data management and database concepts. It discusses the need for effective data management to support business decision making and regulatory compliance. It describes best practices for data management including simplifying access to data, scrubbing data for quality, and shaping data using flexible techniques. The document also discusses database administration responsibilities, database management systems, data warehouse concepts and applications, and different data mining techniques.

Uploaded by

Mohini Giri
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
31 views20 pages

Mis Unit 3

This document provides an overview of data management and database concepts. It discusses the need for effective data management to support business decision making and regulatory compliance. It describes best practices for data management including simplifying access to data, scrubbing data for quality, and shaping data using flexible techniques. The document also discusses database administration responsibilities, database management systems, data warehouse concepts and applications, and different data mining techniques.

Uploaded by

Mohini Giri
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 20

Management Information

System
Introduction

• DATA MANGEMENT is the process of ingesting,


storing, organizing and maintaining the data
created and collected by an organization.
Effective data management is a crucial piece
of deploying the IT systems that run business
applications and provide analytical
information to help drive operational
decision-making and strategic planning by
corporate executives, business managers and
other end users.
Need for Data Management

Regulatory Compliance requirement


 Data privacy and protection laws
Decision makers need information to make
effective decisions
greater financial exposure in fast-moving
markets
Record keeping
Data Management best practices
• Simplify access to traditional and emerging
data
• Scrub data to infuse quality into existing
business processes
• Shape data using flexible manipulation
techniques
Data Independence
Data Redundancy

Updation Anomaly:
Deletion Anomaly:
Insertion Anomaly:
Data Consistency

Point-in-time consistency
Application Consistency
Transaction Consistency
Data Base Administrator

• DBA is usually a dedicated role in the IT


department for large organizations. However,
many smaller companies that cannot afford a
full-time DBA usually outsource or contract
the role to a specialized vendor, or merge the
role with another in the ICT department so
that both are performed by one person.
Data Base Administrator-
Responsibility

• Database Security:
• Database Tuning:
• Backup and Recovery:
• Producing Reports from Queries:
Database Management Systems
Concepts
• Database Management System (DBMS) is a software for storing and retrieving
users’ data while considering appropriate security measures.
Characteristics of DBMS
• Provides security and removes redundancy
• Self-describing nature of a database system
• Insulation between programs and data abstraction
• Support of multiple views of the data
• Sharing of data and multiuser transaction processing
• DBMS allows entities and relations among them to form tables.
• It follows the ACID concept ( Atomicity, Consistency, Isolation, and Durability).
• DBMS supports multi-user environment that allows users to access and
manipulate data in parallel.
Types of DBMS

Four types of DBMS


• Hierarchical DBMS
• Network Model
• Relational model
• Object-Oriented Model
DBMS Architecture

Three tier Architecture


Data Warehouse

• The term “Data Warehouse” was first coined by Bill Inmon in 1990. According to
Inmon, a data warehouse is a subject oriented, integrated, time-variant, and non-
volatile collection of data. This data helps analysts to take informed decisions in an
organization.
The key features of a data warehouse are discussed below −
• Subject Oriented: A data warehouse is subject oriented because it provides
information around a subject rather than the organization’s ongoing operations.
These subjects can be product, customers, suppliers, sales, revenue, etc. A data
warehouse does not focus on the ongoing operations, rather it focuses on
modelling and analysis of data for decision making.
• Integrated: A data warehouse is constructed by integrating data from
heterogeneous sources such as relational databases, flat files, etc. This integration
enhances the effective analysis of data.
• Time Variant: The data collected in a data warehouse is identified with a particular
time period. The data in a data warehouse provides information from the historical
point of view.
• Non-volatile: Non-volatile means the previous data is not erased when new data is
added to it. A data warehouse is kept separate from the operational database and
therefore frequent changes in operational database is not reflected in the data
warehouse.
Data Warehouse Applications

• Financial services
• Banking services
• Consumer goods
• Retail sectors
• Controlled manufacturing
Types of Data Warehouse
• Information processing, analytical processing, and data mining are the three types
of data warehouse applications that are discussed below −
• Information Processing: A data warehouse allows to process the data stored in it.
The data can be processed by means of querying, basic statistical analysis,
reporting using crosstabs, tables, charts, or graphs.
• Analytical Processing: A data warehouse supports analytical processing of the
information stored in it. The data can be analyzed by means of basic OLAP
operations, including slice-and-dice, drill down, drill up, and pivoting.
• Data Mining: Data mining supports knowledge discovery by finding hidden
patterns and associations, constructing analytical models, performing classification
and prediction. These mining results can be presented using the visualization tools.
Difference in OLAP AND OLTP
Sr.N
Data Warehouse (OLAP) Operational Database(OLTP)
o.

1 It involves historical processing of information. It involves day-to-day processing.

OLAP systems are used by knowledge workers such as executives, managers,


2 OLTP systems are used by clerks, DBAs, or database professionals.
and analysts.

3 It is used to analyze the business. It is used to run the business.

4 It focuses on Information out. It focuses on Data in.

5 It is based on Star Schema, Snowflake Schema, and Fact Constellation Schema. It is based on Entity Relationship Model.

6 It focuses on Information out. It is application oriented.

7 It contains historical data. It contains current data.

8 It provides summarized and consolidated data. It provides primitive and highly detailed data.

9 It provides summarized and multidimensional view of data. It provides detailed and flat relational view of data.

10 The number of users is in hundreds. The number of users is in thousands.

11 The number of records accessed is in millions. The number of records accessed is in tens.

12 The database size is from 100GB to 100 TB. The database size is from 100 MB to 100 GB.

13 These are highly flexible. It provides high performance.


Integrating Heterogeneous Databases

To integrate heterogeneous databases.we have


two approaches −
• Query-driven Approach
• Update-driven Approach
Data Mining

Data Mining is the process of extracting useful information


and patterns from enormous data. Data Mining includes
collection, extraction, analysis and statistics of data. It is also
known as Knowledge discovery process, Knowledge Mining
from Data or data/ pattern analysis. Data Mining is a logical
process of finding useful information to find out useful data.
Data mining involves three steps. They are
• Exploration– In this step the data is cleared and converted
into another form. The nature of data is also determined
• Pattern Identification– The next step is to choose the
pattern which will make the best prediction
• Deployment– The identified patterns are used to get the
desired outcome.
Data Mining Techniques

One of the most important task in Data Mining is to select the correct data mining technique. Data
Mining technique has to be chosen based on the type of business and the type of problem your
business faces.
• Statistics
• Clustering
• Visualization
• Decision Tree
• Association Rules
Association rule offers two major information
• Support– How often is the rule applied ?
• Confidence– How often the rule is correct ?

• Neural Networks
There are two main parts of this technique – the node and the link
• The node– which freely matches to the neuron in the human brain
• The link– which freely matches to the connections between the neurons in the human brain

• Classification
Decision Tree
END

You might also like