0% found this document useful (0 votes)
40 views4 pages

5.data Warehousing Interview Questions

Uploaded by

Zakeer Zak
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
40 views4 pages

5.data Warehousing Interview Questions

Uploaded by

Zakeer Zak
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

Data Warehousing Interview Questions

Q) What is Data Warehouse?

Fetch all the data from different OLTP’s, made it coherent (consistent manner), load
to Data Warehouse and generate the Reports from Data Warehouse.

Q) What is OLTP?

LTP stands for Online Transactional Process which stores transactional data. The
Operational systems are where the data is put in.

Q) What is OLAP?

LAP stands for Online Analytical Process which stores Analytical data that is used for
Analysis and Reporting. Data Warehouse is where we get the data out.

Q) What is the Need of Data Warehouse?

Let us assume ‘ABC’ bank operates in multiple Countries. And let us say Country1
data is residing in OLTP1, Country2 data in OLTP2 and Country3 data in OLTP3. If one
day ABC Bank requires consolidated Reports, means we need to go for Data
warehousing.

Q) Why we never Create Reports on the Top of OLTPS

He OLTP Systems does not maintain Complete History in order to have the better
transaction performance. So it is not possible to analyze the data completely for a
wide range.

Q) What are the Issues we face When Creating Reports On the Top of OLTPS

We are fetching the data from multiple transactional systems to generate


consolidated report obviously it takes some time to get the final consolidated report.
As the OLTP systems are highly normalized and hence to get the report output we
need to join more number of tables. Also it is not recommendable to Insert and
Retrieve Data from the same system at the same time.

Q) Benefits of OLAPs

OLAP will maintain complete History so that we can make Better Analysis using
complete Data. There will be no performance issues because we have the complete
data from all the transactional sources in Data Warehouse. Completely De-
Normalized
Q) ER-Modelling

Entity Relationship Modeling is used to design OLTP databases. Data is highly


normalized

Q) Dimension Modelling

Dimensional Modeling is used to design OLAP databases.


Dimensional Modeling is a particular design methodology of data modeling wherein
the goal of modeling is to improve query performance.

Q) Star Schema

A Star Schema is a Data Warehouse database design which contains a centrally


located fact table which is surrounded by multiple dimension tables. In star schema
all the dimensional tables directly connect to the fact table

Q) Snow-Flex Schema

A Snow Flake Schema consists of a fact table surrounded by multiple dimension


tables which can be connected to other dimension tables. In snow flake schema
some of the dimensions will not directly connect to fact table. When dimension
tables stores large number of rows with redundancy of data and space is such an
issue, we can use snow flake schema to save space.

Q) Galaxy Schema

This schema is viewed as collection of stars hence called Galaxy Schema or Fact
Constellation Schema. In a Galaxy schema a single dimension table is shared with
multiple fact tables.

Q) What is Dimension?

A dimension is a descriptive data which describes the key performance indicators


know as facts. E.g. Product, Customer Name, Date etc.

Q) Types Dimensions

We have 9 types Dimensions

Q) What is Fact?

A fact is something that is measurable or quantifiable.


Fact is the metric that business users would use for making business decisions.
Q) Types of Facts

Fact less Fact

Detailed Fact

Additive Facts

Q) What is Granularity?

Granularity refers to the level of detail of the data stored in any tables of a data
warehouse. High granularity refers to data that is at or near the transaction level.
Data that is at the transaction level is usually referred to as atomic level data.

Q) What is ETL?

ETL stands for Extraction, Transformation and Loading.

Q) What is Data Acquisition?

Data Acquisition is the process of Extracting, Transforming and Loading the data.
Data Acquisition will be having below processes.
Data Extraction (E)
Data Transformation (T)
Data Loading (L)

Q) What is the Need of ETL SOFTWARE?

It is used to integrate the Different Data Sources in one place

Q) What is Staging Area?

Data Staging Area is a database and is an intermediate storage area between the
Transactional sources and Data Warehouse / Data Mart. It is an integrated view of
multiple Transactional sources.

Q) What is ODS (Operational Data Store)

An ODS is a database designed to do immediate reporting with current operational


data. An ODS must be frequently refreshed so that it contains very current data. An
ODS can be updated daily, hourly or even immediately after transactions on
operational sources. It is used for real time and near real time reporting.

Q) What is Data Mart?

A Data Mart is a Subject Oriented database which supports the business needs of
specific department business managers.
A Data Mart is subset of an Enterprise Data Warehouse.
A Data Mart is a single Subject View and integration of multiple subject views is
called an Enterprise Data Warehouse.

1. Top to Bottom

First we Design the Data Warehouse and then on the top of the Datawarehouse we
design DataMart.

2. Bottom to Top

First we Design the Datamart’s and then on the top of the DataMart’s we design the
Datawarehouse

You might also like