Mis Unit 3
Mis Unit 3
System
Introduction
Updation Anomaly:
Deletion Anomaly:
Insertion Anomaly:
Data Consistency
Point-in-time consistency
Application Consistency
Transaction Consistency
Data Base Administrator
• Database Security:
• Database Tuning:
• Backup and Recovery:
• Producing Reports from Queries:
Database Management Systems
Concepts
• Database Management System (DBMS) is a software for storing and retrieving
users’ data while considering appropriate security measures.
Characteristics of DBMS
• Provides security and removes redundancy
• Self-describing nature of a database system
• Insulation between programs and data abstraction
• Support of multiple views of the data
• Sharing of data and multiuser transaction processing
• DBMS allows entities and relations among them to form tables.
• It follows the ACID concept ( Atomicity, Consistency, Isolation, and Durability).
• DBMS supports multi-user environment that allows users to access and
manipulate data in parallel.
Types of DBMS
• The term “Data Warehouse” was first coined by Bill Inmon in 1990. According to
Inmon, a data warehouse is a subject oriented, integrated, time-variant, and non-
volatile collection of data. This data helps analysts to take informed decisions in an
organization.
The key features of a data warehouse are discussed below −
• Subject Oriented: A data warehouse is subject oriented because it provides
information around a subject rather than the organization’s ongoing operations.
These subjects can be product, customers, suppliers, sales, revenue, etc. A data
warehouse does not focus on the ongoing operations, rather it focuses on
modelling and analysis of data for decision making.
• Integrated: A data warehouse is constructed by integrating data from
heterogeneous sources such as relational databases, flat files, etc. This integration
enhances the effective analysis of data.
• Time Variant: The data collected in a data warehouse is identified with a particular
time period. The data in a data warehouse provides information from the historical
point of view.
• Non-volatile: Non-volatile means the previous data is not erased when new data is
added to it. A data warehouse is kept separate from the operational database and
therefore frequent changes in operational database is not reflected in the data
warehouse.
Data Warehouse Applications
• Financial services
• Banking services
• Consumer goods
• Retail sectors
• Controlled manufacturing
Types of Data Warehouse
• Information processing, analytical processing, and data mining are the three types
of data warehouse applications that are discussed below −
• Information Processing: A data warehouse allows to process the data stored in it.
The data can be processed by means of querying, basic statistical analysis,
reporting using crosstabs, tables, charts, or graphs.
• Analytical Processing: A data warehouse supports analytical processing of the
information stored in it. The data can be analyzed by means of basic OLAP
operations, including slice-and-dice, drill down, drill up, and pivoting.
• Data Mining: Data mining supports knowledge discovery by finding hidden
patterns and associations, constructing analytical models, performing classification
and prediction. These mining results can be presented using the visualization tools.
Difference in OLAP AND OLTP
Sr.N
Data Warehouse (OLAP) Operational Database(OLTP)
o.
5 It is based on Star Schema, Snowflake Schema, and Fact Constellation Schema. It is based on Entity Relationship Model.
8 It provides summarized and consolidated data. It provides primitive and highly detailed data.
9 It provides summarized and multidimensional view of data. It provides detailed and flat relational view of data.
11 The number of records accessed is in millions. The number of records accessed is in tens.
12 The database size is from 100GB to 100 TB. The database size is from 100 MB to 100 GB.
One of the most important task in Data Mining is to select the correct data mining technique. Data
Mining technique has to be chosen based on the type of business and the type of problem your
business faces.
• Statistics
• Clustering
• Visualization
• Decision Tree
• Association Rules
Association rule offers two major information
• Support– How often is the rule applied ?
• Confidence– How often the rule is correct ?
• Neural Networks
There are two main parts of this technique – the node and the link
• The node– which freely matches to the neuron in the human brain
• The link– which freely matches to the connections between the neurons in the human brain
• Classification
Decision Tree
END