0% found this document useful (0 votes)

28 views27 pages

Unit 4

Uploaded by

jovialdarwin8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views27 pages

Unit 4

Uploaded by

jovialdarwin8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 27

Introduction to OLTP and OLAP

• Data drives nearly every business today; a company’s ability to harness

the value of its data is crucial to delivering customer experiences and
products/services that keep them relevant and competitive.
• There are two approaches to data processing systems: one focuses on
operations, and the other focuses on analytics for business
intelligence. Both are essential to leverage the full power of data.
• These two systems are Online Transaction Processing
(OLTP) and Online Analytical Processing (OLAP).
• Online transaction processing (OLTP) captures, stores, and processes
data from transactions in real time. Online analytical processing
(OLAP) uses complex queries to analyze aggregated historical data
from OLTP systems.
OLTP
• Online transactional processing (OLTP) is used for real-time execution of large
volumes of database transactions by large numbers of people. OLTP systems are
used for everyday transactions like ATMs, ecommerce purchases, online banking,
text messages, and account changes, among many other day-to-day transactions.
• These transactions use a relational database or SQL database to handle extensive
volumes of simple transactions, enable multi-user access to the same data, process
data quickly, provide index datasets for fast searches, and are available continually.
• An OLTP system captures and maintains transaction data in a database. Each
transaction involves individual database records made up of multiple fields or
columns. This process can be challenging without the right tools.
• In OLTP, the emphasis is on fast processing, because OLTP databases are read,
written, and updated frequently. If a transaction fails, built-in system logic ensures
data integrity.
• OLTP systems can be used to provide data for their OLAP systems, as the two work
together to optimize the value of data.
OLAP
• Data analysts and data engineers use online analytical processing (OLAP) for data
mining, analytics, and business intelligence. OLAP is used to process
multidimensional analysis on large volumes of data at very high speeds
(milliseconds). An OLTP system often processes and stores data in repositories, which
OLAP then sources for analysis. Many businesses use OLAP for financial analysis,
forecasting, budgeting, reporting, marketing and sales optimization, and decision
making.
• OLAP applies complex queries to large amounts of historical data aggregated from
OLTP databases and other sources. In OLAP, the emphasis is on response time to
these complex queries. Each query involves one or more columns of data aggregated
from many rows.
• Examples include year-over-year financial performance or marketing lead generation
trends. OLAP databases and data warehouses give analysts and decision-makers the
ability to use custom reporting tools to turn data into information. Query failure in
OLAP does not interrupt or delay transaction processing for customers, but it can
delay or impact the accuracy of business intelligence insights.
OLTP vs. OLAP: Key differences
DatawareHouse Architecture
• A data-warehouse is a heterogeneous collection of different data
sources organized under a unified schema.

• There are 2 approaches for constructing data-warehouse:

a) Top-down approach
b) Bottom-up approach

The essential components are discussed below:

Top Down Approach
External Sources :

External source is a source from where data is collected irrespective of

the type of data. Data can be structured, semi structured and
unstructured as well.

Stage Area :

Since the data, extracted from the external sources does not follow a
particular format, so there is a need to validate this data to load into
datawarehouse. For this purpose, it is recommended to use ETL tool.
• E(Extracted): Data is extracted from External data source.

• T(Transform): Data is transformed into the standard format.

• L(Load): Data is loaded into datawarehouse after transforming it into the standard
format.

Data-warehouse :

After cleansing of data, it is stored in the datawarehouse as central repository. It

actually stores the meta data and the actual data gets stored in the data marts.
• Note that datawarehouse stores the data in its purest form in this
top-down approach.
• Data Marts :
• Data mart is also a part of storage component. It stores the information of a
particular function of an organisation which is handled by single authority.
• There can be as many number of data marts in an organisation depending
upon the functions. We can also say that data mart contains subset of the data
stored in datawarehouse.

• Data Mining:

• The practice of analysing the big data present in datawarehouse is data mining.
It is used to find the hidden patterns that are present in the database or in
datawarehouse with the help of algorithm of data mining.
• This approach is defined by Inmon as – datawarehouse as a central repository
for the complete organisation and data marts are created from it after the
complete datawarehouse has been created.
Advantages of Top-Down Approach
1.Since the data marts are created from the datawarehouse, provides consistent dimensional
view of data marts.
2.Improved data consistency: The top-down approach promotes data consistency by ensuring
that all data marts are sourced from a common data warehouse. This ensures that all data is
standardized, reducing the risk of errors and inconsistencies in reporting.
3.Easier maintenance: Since all data marts are sourced from a central data warehouse, it is
easier to maintain and update the data in a top-down approach. Changes can be made to
the data warehouse, and those changes will automatically propagate to all the data marts
that rely on it.
4.Better scalability: The top-down approach is highly scalable, allowing organizations to add
new data marts as needed without disrupting the existing infrastructure. This is particularly
important for organizations that are experiencing rapid growth or have evolving business
needs.
5.Improved governance: The top-down approach facilitates better governance by enabling
centralized control of data access, security, and quality. This ensures that all data is managed
consistently and that it meets the organization’s standards for quality and compliance.
6. Reduced duplication: The top-down approach reduces data
duplication by ensuring that data is stored only once in the data
warehouse. This saves storage space and reduces the risk of data
inconsistencies.

7. Better reporting: The top-down approach enables better reporting by

providing a consistent view of data across all data marts. This makes it
easier to create accurate and timely reports, which can improve
decision-making and drive better business outcomes.
Disadvantages of Top Down Approach
1. The cost, time taken in designing and its maintenance is very high.
2. Complexity: The top-down approach can be complex to implement and maintain, particularly for
large organizations with complex data needs. The design and implementation of the data warehouse
and data marts can be time-consuming and costly.
3. Lack of flexibility: The top-down approach may not be suitable for organizations that require a high
degree of flexibility in their data reporting and analysis. Since the design of the data warehouse and
data marts is pre-determined, it may not be possible to adapt to new or changing business
requirements.
4. Limited user involvement: The top-down approach can be dominated by IT departments, which may
lead to limited user involvement in the design and implementation process. This can result in data
marts that do not meet the specific needs of business users.
5. Data latency: The top-down approach may result in data latency, particularly when data is sourced
from multiple systems. This can impact the accuracy and timeliness of reporting and analysis.
6. Data ownership: The top-down approach can create challenges around data ownership and control.
Since data is centralized in the data warehouse, it may not be clear who is responsible for
maintaining and updating the data.
Bottom-Up Approach
1.First, the data is extracted from external sources (same as happens in
top-down approach).

2.Then, the data go through the staging area (as explained above) and
loaded into data marts instead of datawarehouse. The data marts are
created first and provide reporting capability. It addresses a single
business area.

3.These data marts are then integrated into datawarehouse.

Advantages of Bottom-Up Approach
1. As the data marts are created first, so the reports are quickly generated.

2. We can accommodate more number of data marts here and in this way datawarehouse can
be extended.

3. Also, the cost and time taken in designing this model is low comparatively.
4. Incremental development: The bottom-up approach supports incremental development,
allowing for the creation of data marts one at a time. This allows for quick wins and
incremental improvements in data reporting and analysis.
5. User involvement: The bottom-up approach encourages user involvement in the design and
implementation process. Business users can provide feedback on the data marts and
reports, helping to ensure that the data marts meet their specific needs.
6. Flexibility: The bottom-up approach is more flexible than the top-down approach, as it
allows for the creation of data marts based on specific business needs. This approach can be
particularly useful for organizations that require a high degree of flexibility in their reporting
and analysis.
6. Faster time to value: The bottom-up approach can deliver faster
time to value, as the data marts can be created more quickly than a
centralized data warehouse. This can be particularly useful for smaller
organizations with limited resources.

7. Reduced risk: The bottom-up approach reduces the risk of failure, as

data marts can be tested and refined before being incorporated into a
larger data warehouse. This approach can also help to identify and
address potential data quality issues early in the process.
Disadvantage of Bottom-Up Approach
1. This model is not strong as top-down approach as dimensional view of data marts is not consistent
as it is in above approach.
2. Data silos: The bottom-up approach can lead to the creation of data silos, where different business
units create their own data marts without considering the needs of other parts of the organization.
This can lead to inconsistencies and redundancies in the data, as well as difficulties in integrating
data across the organization.
3. Integration challenges: Because the bottom-up approach relies on the integration of multiple data
marts, it can be more difficult to integrate data from different sources and ensure consistency
across the organization. This can lead to issues with data quality and accuracy.
4. Duplication of effort: In a bottom-up approach, different business units may duplicate effort by
creating their own data marts with similar or overlapping data. This can lead to inefficiencies and
higher costs in data management.
5. Lack of enterprise-wide view: The bottom-up approach can result in a lack of enterprise-wide view,
as data marts are typically designed to meet the needs of specific business units rather than the
organization as a whole. This can make it difficult to gain a comprehensive understanding of the
organization’s data and business processes.
6. Complexity: The bottom-up approach can be more complex than the top-down approach, as it
involves the integration of multiple data marts with varying levels of complexity and granularity. This
can make it more difficult to manage and maintain the data warehouse over time.
Characteristics of Datawarehouse
• Subject-oriented – A data warehouse is always a subject oriented as
it delivers information about a theme instead of organization’s
current operations. It can be achieved on specific theme.
• That means the data warehousing process is proposed to handle
with a specific theme which is more defined. These themes can be
sales, distributions, marketing etc.
A data warehouse never put emphasis only current operations.
Instead, it focuses on demonstrating and analysis of data to make
various decision.
• Integrated – It is somewhere same as subject orientation which is
made in a reliable format. Integration means founding a shared entity
to scale the all similar data from the different databases.
• The data also required to be resided into various data warehouse in
shared and generally granted manner.
A data warehouse is built by integrating data from various sources of
data such that a mainframe and a relational database. In addition, it
must have reliable naming conventions, format and codes. Integration
of data warehouse benefits in effective analysis of data.
• Time-Variant – In this data is maintained via different intervals of time
such as weekly, monthly, or annually etc. It founds various time limit
which are structured between the large datasets and are held in
online transaction process (OLTP).
• The time limits for data warehouse is wide-ranged than that of
operational systems. The data resided in data warehouse is predictable
with a specific interval of time and delivers information from the
historical perspective.
• Non-Volatile – As the name defines the data resided in data warehouse is
permanent. It also means that data is not erased or deleted when new
data is inserted. It includes the mammoth quantity of data that is inserted
into modification between the selected quantity on logical business.
• It evaluates the analysis within the technologies of warehouse. Data is
not updated, once it is stored in the data warehouse, to maintain the
historical data.
• In this, data is read-only and refreshed at particular intervals. This is
beneficial in analysing historical data and in comprehension the
functionality. It does not need transaction process, recapture and
concurrency control mechanism.
Background
• A Database Management System (DBMS) stores data in the form of
tables, uses ER model. For example, a DBMS of college has tables for
students, faculty, etc.
• A Data Warehouse is separate from DBMS, it stores a huge amount of
data, which is typically collected from multiple heterogeneous sources
like files, DBMS, etc.
• The goal is to produce statistical results that may help in decision
makings. For example, a college might want to see quick different
results, like how the placement of CS students has improved over the
last 10 years, in terms of salaries, counts, etc.
Need for Data Warehouse
• An ordinary Database can store MBs to GBs of data and that too for a
specific purpose. For storing data of TB size, the storage shifted to
Data Warehouse.
• Besides this, a transactional database doesn’t offer itself to analytics.
• To effectively perform analytics, an organization keeps a central Data
Warehouse to closely study its business by organizing, understanding,
and using its historic data for taking strategic decisions and analyzing
trends.
Benefits of Data Warehouse
• Better business analytics: Data warehouse plays an important role in every
business to store and analysis of all the past data and records of the
company. which can further increase the understanding or analysis of data
to the company.
• Faster Queries: Data warehouse is designed to handle large queries that’s
why it runs queries faster than the database.
• Improved data Quality: In the data warehouse the data you gathered from
different sources is being stored and analyzed it does not interfere with or
add data by itself so your quality of data is maintained and if you get any
issue regarding data quality then the data warehouse team will solve this.
• Historical Insight: The warehouse stores all your historical data which
contains details about the business so that one can analyze it at any time
and extract insights from it
Front Room and Back Room in
MetaData
• Data warehouse metadata systems are sometimes separated into two
sections:
• back room metadata that are used for Extract, transform, load
functions to get OLTP data into a data warehouse
• front room metadata that are used to label screens and create
reports
• Meta-data Management involves storing information about other
information. With different types of media being used references to
the location of the data can allow management of diverse
repositories.
• Back Room: "Closer to the data..."
Related to programs, data models & databases
Related to ETL
Useful to DBAs, modelers, developers, programmers etc.

• Front Room: "Closer to the user"

Descriptive & informative - includes semantic details
Related to queries and reports
Useful to anyone who writes queries & reports

DBMS SUPER 25 K-Scheme
No ratings yet
DBMS SUPER 25 K-Scheme
45 pages
Data Warehouse Architecture
100% (1)
Data Warehouse Architecture
5 pages
Modern Database Management Test Bank Chapter 7
100% (1)
Modern Database Management Test Bank Chapter 7
37 pages
OE BI Module 2
No ratings yet
OE BI Module 2
87 pages
MultiDimensional Data Model
No ratings yet
MultiDimensional Data Model
22 pages
DWM Unit 3. Data Warehousing Designing & OLAP II
100% (1)
DWM Unit 3. Data Warehousing Designing & OLAP II
21 pages
AWS Data Engineering Involves Using Amazon Web Services
No ratings yet
AWS Data Engineering Involves Using Amazon Web Services
2 pages
DW Notes
No ratings yet
DW Notes
57 pages
Answers by GRP
No ratings yet
Answers by GRP
22 pages
Fundamentals of Data Science Notes (Module - 2)
No ratings yet
Fundamentals of Data Science Notes (Module - 2)
11 pages
DWM Cheatsheet Sem 5
No ratings yet
DWM Cheatsheet Sem 5
27 pages
Qlik Interview Questions & Answers Updated
No ratings yet
Qlik Interview Questions & Answers Updated
20 pages
Unit 5 Implementing and Maintaining A Data Warehouse Environment
No ratings yet
Unit 5 Implementing and Maintaining A Data Warehouse Environment
23 pages
21IS503 UnitI LM4
No ratings yet
21IS503 UnitI LM4
26 pages
Data Warehousing and Data Mining
No ratings yet
Data Warehousing and Data Mining
36 pages
SQL Injection Attacks and Prevention Tec PDF
No ratings yet
SQL Injection Attacks and Prevention Tec PDF
4 pages
Dbms Unit 2
No ratings yet
Dbms Unit 2
138 pages
Introduction To DW
No ratings yet
Introduction To DW
28 pages
DWM 3
No ratings yet
DWM 3
15 pages
Data Warehouse Models
No ratings yet
Data Warehouse Models
3 pages
Oracle Searching Matching
No ratings yet
Oracle Searching Matching
27 pages
Acceptance Testing and ETL Process j8Mus6Ctvj
No ratings yet
Acceptance Testing and ETL Process j8Mus6Ctvj
19 pages
Bi Unit 3 Notes PDF
No ratings yet
Bi Unit 3 Notes PDF
31 pages
Supermarket Management System
No ratings yet
Supermarket Management System
17 pages
CH 3 SQL
No ratings yet
CH 3 SQL
44 pages
Session 8 9 Questions
100% (1)
Session 8 9 Questions
27 pages
COMPUTER 2 - Exam N Answers
No ratings yet
COMPUTER 2 - Exam N Answers
5 pages
Flight Booking System
No ratings yet
Flight Booking System
44 pages
2-Data Warehouse Architecture - Three-Tier Data Warehouse Architecture-16!12!2024
No ratings yet
2-Data Warehouse Architecture - Three-Tier Data Warehouse Architecture-16!12!2024
30 pages
Unit 2 Datawarehouse
No ratings yet
Unit 2 Datawarehouse
17 pages
MCS 221 Notes
No ratings yet
MCS 221 Notes
24 pages
Unit 3 Notes DWM
No ratings yet
Unit 3 Notes DWM
22 pages
Ais275 Jan18 Suggested Solutions
No ratings yet
Ais275 Jan18 Suggested Solutions
7 pages
Ignou MCSL 45
No ratings yet
Ignou MCSL 45
20 pages
Sem3 Unit1 DW
No ratings yet
Sem3 Unit1 DW
12 pages
Datawarehouse Architecture
No ratings yet
Datawarehouse Architecture
5 pages
Unit - 1 Introduction To Data Warehousing
No ratings yet
Unit - 1 Introduction To Data Warehousing
57 pages
Shaurya Sharma PS LAB WORK
No ratings yet
Shaurya Sharma PS LAB WORK
70 pages
Olap and Oltap
No ratings yet
Olap and Oltap
14 pages
Unit3 Notes
No ratings yet
Unit3 Notes
15 pages
1904401-Database Management System
No ratings yet
1904401-Database Management System
15 pages
SQL DML
No ratings yet
SQL DML
22 pages
DBMS-UNIT-1 R16 (Ref-2)
No ratings yet
DBMS-UNIT-1 R16 (Ref-2)
12 pages
Gocad Tutorial Old
No ratings yet
Gocad Tutorial Old
34 pages
Unit Ii-Ba (2) - 1
No ratings yet
Unit Ii-Ba (2) - 1
29 pages
DWM Unit 3 Data Warehousing Designing & OLAP II
No ratings yet
DWM Unit 3 Data Warehousing Designing & OLAP II
23 pages
DM Unit 2
No ratings yet
DM Unit 2
21 pages
Lobster Data-Broschure EN Web Jun2015
No ratings yet
Lobster Data-Broschure EN Web Jun2015
9 pages
CA2 Notes
No ratings yet
CA2 Notes
8 pages
DWM Unit 1
No ratings yet
DWM Unit 1
48 pages
An Improvement of DBSCAN Algorithm To Analyze Cluster For Large Dataset
No ratings yet
An Improvement of DBSCAN Algorithm To Analyze Cluster For Large Dataset
5 pages
MetaData and Its Classification f6qQfIZTfw
No ratings yet
MetaData and Its Classification f6qQfIZTfw
14 pages
Week 5 Information Access and Retrieval Tools
No ratings yet
Week 5 Information Access and Retrieval Tools
11 pages
Level of Testing Slides eMvO54ziWH
No ratings yet
Level of Testing Slides eMvO54ziWH
14 pages
DWDM Unit 1
No ratings yet
DWDM Unit 1
23 pages
Beginning MySQL
86% (37)
Beginning MySQL
865 pages
Snowflake Training Course v1.1
No ratings yet
Snowflake Training Course v1.1
10 pages
Dataware House Design and Modeling
No ratings yet
Dataware House Design and Modeling
5 pages
Data Warehousing: 5 April 2013 TCS Public
No ratings yet
Data Warehousing: 5 April 2013 TCS Public
50 pages
Spark
No ratings yet
Spark
6 pages
Unit Ii
No ratings yet
Unit Ii
59 pages
Bill User Menu: Pay Select From
No ratings yet
Bill User Menu: Pay Select From
3 pages
Data Warehouse Notes
No ratings yet
Data Warehouse Notes
21 pages
Data Warehousing and Data Mining Sample 2 PRESENTATION
No ratings yet
Data Warehousing and Data Mining Sample 2 PRESENTATION
21 pages
Unit Ii-Ba
No ratings yet
Unit Ii-Ba
16 pages
DW Concepts
No ratings yet
DW Concepts
40 pages
Unit I
No ratings yet
Unit I
18 pages
Library Management SQL Project
No ratings yet
Library Management SQL Project
4 pages
Final Interview Questions (Etl - Informatica) : Subject Oriented, Integrated, Time Variant, Non Volatile
100% (1)
Final Interview Questions (Etl - Informatica) : Subject Oriented, Integrated, Time Variant, Non Volatile
77 pages
Unit - II Data Warehouseing&OLAP
No ratings yet
Unit - II Data Warehouseing&OLAP
17 pages
Data Warehouse For Bignners
No ratings yet
Data Warehouse For Bignners
14 pages
Farooq Resume 1J23-2
No ratings yet
Farooq Resume 1J23-2
3 pages
Unit 3 Generating Functions and Recurrence Relations
No ratings yet
Unit 3 Generating Functions and Recurrence Relations
3 pages
Model Test Question Paper
No ratings yet
Model Test Question Paper
4 pages
Data Mining Unit-2 Notes
No ratings yet
Data Mining Unit-2 Notes
8 pages
DW Concepts
100% (1)
DW Concepts
40 pages
Fall 2013 Assignment Program: Bachelor of Computer Application Semester 6Th Sem Subject Code & Name Bc0058 - Data Warehousing
No ratings yet
Fall 2013 Assignment Program: Bachelor of Computer Application Semester 6Th Sem Subject Code & Name Bc0058 - Data Warehousing
9 pages
Assignment 2
No ratings yet
Assignment 2
6 pages
Module 2
No ratings yet
Module 2
43 pages
Assignment No 2
No ratings yet
Assignment No 2
26 pages
Data Warehouse Definition: - Users and System Orientation
No ratings yet
Data Warehouse Definition: - Users and System Orientation
6 pages
Data Warehouse
No ratings yet
Data Warehouse
56 pages
Data Warehouse - DWDM
No ratings yet
Data Warehouse - DWDM
54 pages
Apache NiFi and Hop Comparison
No ratings yet
Apache NiFi and Hop Comparison
1 page
Data Warehouse
No ratings yet
Data Warehouse
74 pages
What Is A Data Warehouse?
No ratings yet
What Is A Data Warehouse?
39 pages
Introduction On Data Warehouse With OLTP and OLAP: Arpit Parekh
No ratings yet
Introduction On Data Warehouse With OLTP and OLAP: Arpit Parekh
5 pages
Data Warehouse Concepts
No ratings yet
Data Warehouse Concepts
53 pages
DW Concepts
No ratings yet
DW Concepts
40 pages
Data Warehouse References
No ratings yet
Data Warehouse References
40 pages
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
From Everand
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
AJIT DASH
2/5 (2)
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Knight's Microsoft Business Intelligence 24-Hour Trainer: Leveraging Microsoft SQL Server Integration, Analysis, and Reporting Services with Excel and SharePoint
From Everand
Knight's Microsoft Business Intelligence 24-Hour Trainer: Leveraging Microsoft SQL Server Integration, Analysis, and Reporting Services with Excel and SharePoint
Brian Knight
3/5 (1)
The InfluxDB Handbook: Deploying, Optimizing, and Scaling Time Series Data
From Everand
The InfluxDB Handbook: Deploying, Optimizing, and Scaling Time Series Data
Robert Johnson
No ratings yet
Learn Data Warehousing in 24 Hours
From Everand
Learn Data Warehousing in 24 Hours
Alex Nordeen
No ratings yet

Unit 4

Uploaded by

Unit 4

Uploaded by

Introduction to OLTP and OLAP

• Data drives nearly every business today; a company’s ability to harness

• There are 2 approaches for constructing data-warehouse:

The essential components are discussed below:

External source is a source from where data is collected irrespective of

• T(Transform): Data is transformed into the standard format.

After cleansing of data, it is stored in the datawarehouse as central repository. It

7. Better reporting: The top-down approach enables better reporting by

3.These data marts are then integrated into datawarehouse.

7. Reduced risk: The bottom-up approach reduces the risk of failure, as

• Front Room: "Closer to the user"

You might also like