0% found this document useful (0 votes)

421 views9 pages

DWH Question Bank

The document contains a question bank for a data warehouse with 40 multiple choice questions covering various concepts related to data warehousing including OLTP vs OLAP systems, ETL process, data cubes, dimensions, star and snowflake schemas, and OLAP. It also includes 25 additional questions on topics such as data mining, metadata, and designing a data warehouse.

Uploaded by

bala swaroop

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

421 views9 pages

DWH Question Bank

Uploaded by

bala swaroop

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Data Warehouse

Question Bank
Two Mark Question

1 Why do many enterprises need a data warehouse?

2 What are OLTP and OLAP database systems?

3 What is ODS and what is used for ?

5 List the major steps involved in the ETL process

6 What is the need for a separate database for decision makers?

7 What is a data warehouse and how it might be defined?

8 What are the likely benefits of building an enterprise data warehouse?

9 List some differences between an OLTP system and a data warehouse system.

11 What is OLTP database system?

12 What types of queries do managers need to pose to the enterprise’s database systems?

13 What is an ODS used for? How does it differ from an OLTP system
14 Give three most important guideline in implementing a data warehouse for a large
enterprise.

15 Give two major components of any data warehouse system.

16 What ETL?

17 Give two reasons for the dirty data being extracted from source systems?

18 List four steps of the ETL process.

19 Define the terms star schema and snowflake schema.

20 Give a simple data cube implementation.

21 Are all data cube entries non-zero? If not, why not?

22 What are the major differences between OLTP and a data warehouse system?

23 What is the differences between roll-up and Pivot?

24 What are the type of metadata that is maintained in a data warehouse.

25 What are the dimensions, members, measure and fact table?

26 What is OLAP?

27 List the characteristics of OLAP systems.

28 List some of the motivations for using OLAP.

29 Explain multidimensional view and a data cube.

30 What are the different implementations of a data cube?

31 What are the differences between ROLAP and MOlAP.

33 List some guidelines for implementations OLAP.

34 What OLAP software is available in the market?

35 List four types of aggregate queries that are possible with two variables.

36 What are dimension?

37 What is a measure?

38 What is fact and fact table?

39 Give a Simple definition of OLAP.

40 Define data cube in your own words.

41 Show how a data cube of two dimensions looks like.

PART B and C

1.Explain why ETL must deal with dirty data when extracting information from the source
systems.

2.List two major characteristics of OLAP.

3. Describe the operations roll-up, drill-down, slice and the dice and pivot.

4.Suppose that you are in the market to purchase a data mining system.

(a) Regarding the coupling of a data mining system with a database and/or data warehouse
system, what are the differences between no coupling, loose coupling, semitight coupling,
and tight coupling?

(b) What is the difference between row scalability and column scalability?

5.Discuss the major difference between the star schema and the snowflake schema?

6.Data warehousing is the only viable means to resolve the information crisis and to provide
strategic information. List four reasons to support this assertion and explain them.

7.Explain the star scheme technique of modelling a data warehouse.

8.Why is it important to store multiple types of data in the data warehouse? Give examples of
some nonstructured data likely to be found in the data warehouse of a health management
organization (HMO).

9.Describe the features of a data warehouse.

10.Discuss the major design issues that need to be addressed before proceeding with the data
design.

11.Discuss the difference between OLTP and Data warehouse with example

12.You are the data design specialist on the data warehouse project team for a manufacturing
company. Design a STAR schema to track the production quantities. Production quantities
are normally analyzed along the business dimensions of product, time, parts used, production
facility, and production run. State your assumptions.

13.Discuss the three major types of metadata in a data warehouse? Briefly mention the
purpose of each type.
14.Discuss any six different methods for information delivery.

15.Explain data smoothing and how it is applicable to the data warehouse.

16.Explain data transformation and how it is applicable to the data warehouse.

17.Discuss the various types OLTP with example

18.Data warehouse architect for a leading national department store chain. The data
warehouse has been up and running for nearly a year. Now the management has decided to
provide the power users with OLAP facilities. How will you alter the information delivery
component of data warehouse architecture? Make realistic assumptions and proceed.

19.Describe the operations of Data cubes with suitable example

20. The current trends in hardware/software technology make data warehousing feasible.
Explain via some examples how exactly technology trends do help.

21.Describe the operations of OLTP with suitable example

22.Describe the composition of the primary keys for the dimension and fact tables with
suitable example.

23.You are the data analyst on the project team building a data warehouse for an insurance
company. List the possible data sources from which you will bring the data into your data
warehouse. State your assumptions

24.Suppose that you are in the market to purchase a data mining system .Discuss how
different forms of data mining can be used in the application.

25. Describe the type of metadata that is maintained in a data warehouse.

26.You are a Senior Analyst in the IT department of a company manufacturing automobile

parts. The marketing VP is complaining about the poor response by IT in providing strategic
information. Draft a proposal to him explaining the reasons for the problems and why a data
warehouse would be the only viable solution.

27.Explain the motivating challenges in development of data mining.

28.Explain with example the data mining tasks

29.Describe the main theoretical foundations that have been proposed for data mining.
Comment on how they each satisfy (or fail to satisfy) the requirements of an ideal theoretical
framework for data mining.

30Describe the top-down and bottom-up approaches for building a data warehouse and
discuss the merits and disadvantages of each approach.
.31, In a STAR schema to track the shipments for a distribution company, the following
dimension tables are found: (1) time, (2) customer ship-to, (3) ship-from, (4) product, 5) type
of deal, and (6) mode of shipment. Review these dimensions and list the possible attributes
for each of the dimension tables. Also, designate a primary key for each table.

32 Explain various tasks of data mining with example for each.

33.Explain various steps of data pre-processing in Data warehouse.

34.Explain various steps of Dimension reduction in Data warehouse with suitable example.

35.Suppose a group of 12 sales price records has been sorted as follows:

5, 10, 11, 13, 15, 35, 50, 55, 72, 92, 204, 215
Partition them into three bins by each of the following methods:
(a) equal-frequency (equidepth) partitioning
(b) equal-width partitioning
(c) clustering

36. Describe four types of charts you are likely to see in the delivery of information from a
data mart supporting the finance department.

37.Describe with examples snapshot and transaction fact tables. How are they related?

38.Describe aggregate fact tables. Why are they needed? Give an example.

39.Explain multidimensional view and a data cube with suitable example.

40.Discuss how different forms of data mining can be used in the application.. Explain with an
example
IMPORTANT Question and Answers

1.What is data warehouse?

A data warehouse is a repository of multiple heterogeneous data sources organized under a

unified schema at a single site to facilitate management decision making . (or)A data
warehouse is a subject-oriented, time-variant and non-volatile collection of data in support of
management’s decision-making process.

2. What is data warehouse metadata?

Metadata are data about data. When used in a data warehouse, metadata are the data that
define warehouse objects. Metadata are created for the data names and definitions of the
given warehouse. Additional metadata are created and captured for time stamping any
extracted data, the source of the extracted data, and missing fields that have been added by
data cleaning or integration processes.

3. List the characteristics of a data ware house.

There are four key characteristics which separate the data warehouse from other major
operational systems:

1. Subject Orientation: Data organized by subject

2. Integration: Consistency of defining parameters

3. Non-volatility: Stable data storage medium

4. Time-variance: Timeliness of data and access terms

4. What are the various sources for data warehouse?

Handling of relational and complex types of data: Because relational databases and data
warehouses are widely used, the development of efficient and effective data mining systems
for such data is important.

Mining information from heterogeneous databases and global information systems:

Local- and wide-area computer networks (such as the Internet) connect many sources of
data, forming huge, distributed, and heterogeneous databases.
5. Compare OLTP and OLAP Systems.

If an on-line operational database systems is used for efficient retrieval, efficient storage and
management of large amounts of data, then the system is said to be on-line transaction
processing. Data warehouse systems serves users (or) knowledge workers in the role of data
analysis and decision-making. Such systems can organize and present data in various
formats. These systems are known as on-line analytical processing systems.

6. Differentiate fact table and dimension table.

Fact table contains the name of facts (or) measures as well as keys to each of the related
dimensional tables.

A dimension table is used for describing the dimension. (e.g.) A dimension table

for item may contain the attributes item_ name, brand and type.

7. Explain the differences between star and snowflake schema.

The dimension table of the snowflake schema model may be kept in normalized form to
reduce redundancies. Such a table is easy to maintain and saves storage space.

8. In the context of data warehousing what is data transformation?

`In data transformation, the data are transformed or consolidated into forms appropriate for
mining. Data transformation can involve the following:

Smoothing, Aggregation, Generalization, Normalization, Attribute construction

9. Define Slice and Dice operation

The slice operation performs a selection on one dimension of the cube resulting in

a sub cube.The dice operation defines a sub cube by performing a selection on two (or) more
dimensions.

10. Briefly discuss the schemas for multidimensional databases.

Stars schema: The most common modeling paradigm is the star schema, in which the data
warehouse contains (1) a large central table (fact table) containing the bulk of the data, with
no redundancy, and (2) a set of smaller attendant tables (dimension tables), one for each
dimension.

Snowflakes schema: The snowflake schema is a variant of the star schema model, where

some dimension tables are normalized, thereby further splitting the data into additional tables.
The resulting schema graph forms a shape similar to a snowflake.
Fact Constellations: Sophisticated applications may require multiple fact tables to share

dimension tables. This kind of schema can be viewed as a collection of stars, and hence is
called a galaxy schema or a fact constellation.

11. How is a data warehouse different from a database? How are they similar? (

Data warehouse is a repository of multiple heterogeneous data sources, organized under a

unified schema at a single site in order to facilitate management decision-making. A
relational databases is a collection of tables, each of which is assigned a unique name. Each
table consists of a set of attributes(columns or fields) and usually stores a large set of
tuples(records or rows). Each tuple in a relational table represents an object identified by a
unique key and described by a set of attribute values. Both are used to store and manipulate
the data.

12. List out the functions of OLAP servers in the data warehouse architecture.

The OLAP server performs multidimensional queries of data and stores the results in its
multidimensional storage. It speeds the analysis of fact tables into cubes, stores the cubes
until needed, and then quickly returns the data to clients.

Unit I – Part B QUESTIONS

1. Write in detail about the architecture and implementation of the data warehouse. (OR)
Diagrammatically illustrate and discuss the three tier data warehousing architecture. OR)
Write a detailed diagram describe the general architecture of data warehouse. (OR)
Describe the data warehouse architecture with a neat diagram.

Business Analysis Framework

Three-Tier Data Warehouse Architecture

Data Warehouse Models

• Virtual Warehouse

• Data mart
• Enterprise Warehouse

Load Manager Warehouse Manager Query Manager

2. List and discuss the major features of a data warehouse.

Some data is denormalized for simplification and to improve performance. Large amounts of
historical data are used.

Queries often retrieve large amounts of data. Both planned and ad hoc queries are common.

The data load is controlled.

3. Discuss the various types of warehouse schema with suitable example. (Nov/Dec’09) (OR)

What do you understand about database schemas? Explain. (Nov/Dec 2011)

Star Schema

Snowflake Schema

Fact Constellation Schema

4. Explain the types of OLAP server in detail.

• Relational OLAP (ROLAP)

• Multidimensional OLAP (MOLAP)

• Hybrid OLAP (HOLAP)

• Specialized SQL Servers

5. Enumerate the building blocks of a data warehouse. Explain the importance of metadata in
a data warehouse environment. What are the challenges in metadata management?

. Review formal definitions of a data warehouse

Discuss the defining features

Distinguish between data warehouses and data marts

Review the evolved architectural types

Study each component or building block that makes up a data warehouse

Introduce metadata and highlight its significance

10987C 01 PDF
No ratings yet
10987C 01 PDF
33 pages
CEA Form G NEW
100% (1)
CEA Form G NEW
5 pages
Hadoop Interview Questions New
No ratings yet
Hadoop Interview Questions New
9 pages
Mid - Term Examination in Methods of Research
No ratings yet
Mid - Term Examination in Methods of Research
3 pages
Dwques
75% (4)
Dwques
5 pages
Trends in Management Information Systems
67% (3)
Trends in Management Information Systems
1 page
DW Questions
0% (1)
DW Questions
35 pages
04 - Logical Design in Data Warehouse
No ratings yet
04 - Logical Design in Data Warehouse
39 pages
Schematic Model Manager User Guide
50% (2)
Schematic Model Manager User Guide
244 pages
Data Flow Testing
50% (2)
Data Flow Testing
51 pages
Dev 1 Boomi
100% (1)
Dev 1 Boomi
11 pages
Unit #2 - Data Warehouse and Data Mining
No ratings yet
Unit #2 - Data Warehouse and Data Mining
51 pages
Data Structure Interview Questions and Answers PDF
No ratings yet
Data Structure Interview Questions and Answers PDF
7 pages
The Function of Secondary Storage
No ratings yet
The Function of Secondary Storage
7 pages
SKP Engineering College: A Course Material On
No ratings yet
SKP Engineering College: A Course Material On
212 pages
DWDM Lecturenotes PDF
No ratings yet
DWDM Lecturenotes PDF
133 pages
CS614 FinalTerm Solved Papers
No ratings yet
CS614 FinalTerm Solved Papers
24 pages
Data Warehousing Concepts 2
No ratings yet
Data Warehousing Concepts 2
26 pages
SQL Server 2008
No ratings yet
SQL Server 2008
19 pages
Data Warehouse Concepts
No ratings yet
Data Warehouse Concepts
68 pages
Star and Snowflake Schema in Data Warehouse With Examples: What Is Multidimensional Schema?
No ratings yet
Star and Snowflake Schema in Data Warehouse With Examples: What Is Multidimensional Schema?
6 pages
Big Data Hadoop MCQ Question
No ratings yet
Big Data Hadoop MCQ Question
109 pages
Unit 1
No ratings yet
Unit 1
61 pages
Data Warehousing Components - L3 - L4 - L5
No ratings yet
Data Warehousing Components - L3 - L4 - L5
26 pages
DWM Unit 1
No ratings yet
DWM Unit 1
34 pages
Database Design Assesment
No ratings yet
Database Design Assesment
26 pages
Ad3381 - Data Base Design and Management Manual
No ratings yet
Ad3381 - Data Base Design and Management Manual
56 pages
Dimensional Modeling and Schemas: Data Modeling Research Paper
No ratings yet
Dimensional Modeling and Schemas: Data Modeling Research Paper
11 pages
Cs2253 - Computer Architecture 16 Marks Question Bank With Hints Unit - I 1. Explain Basic Functional Units of Computer. Input Unit
No ratings yet
Cs2253 - Computer Architecture 16 Marks Question Bank With Hints Unit - I 1. Explain Basic Functional Units of Computer. Input Unit
18 pages
Dsi 142
100% (1)
Dsi 142
19 pages
Building Capabilities of The Neophyte School Heads: Initiatives and Needs
No ratings yet
Building Capabilities of The Neophyte School Heads: Initiatives and Needs
9 pages
Data Warehouse
100% (1)
Data Warehouse
12 pages
APP Question Bank Unit3
100% (1)
APP Question Bank Unit3
5 pages
Data Warehousing Laboratory
0% (1)
Data Warehousing Laboratory
28 pages
CS 606 Advanced Database Technology
No ratings yet
CS 606 Advanced Database Technology
46 pages
Unit 3 Big Data MCQ AKTU: Royal Brinkman Gartenbaubedarf
No ratings yet
Unit 3 Big Data MCQ AKTU: Royal Brinkman Gartenbaubedarf
17 pages
Unit 1 Data Warehousing and Mining
100% (1)
Unit 1 Data Warehousing and Mining
19 pages
Data Warehousing
No ratings yet
Data Warehousing
29 pages
Unit 1
No ratings yet
Unit 1
14 pages
CH 2 Introduction To Data Warehousing
No ratings yet
CH 2 Introduction To Data Warehousing
31 pages
MCA - BigData Notes
No ratings yet
MCA - BigData Notes
136 pages
CCS341 Set3
100% (1)
CCS341 Set3
3 pages
DMW Question Paper
0% (1)
DMW Question Paper
7 pages
Untitled
No ratings yet
Untitled
13 pages
Unit-2 Notes DW 2021
No ratings yet
Unit-2 Notes DW 2021
45 pages
Data Warehouse and Data Mining Question Bank R13 PDF
No ratings yet
Data Warehouse and Data Mining Question Bank R13 PDF
12 pages
2-Tcp Ip Model
No ratings yet
2-Tcp Ip Model
17 pages
Q 4 Data Warehousing
No ratings yet
Q 4 Data Warehousing
4 pages
DWH
No ratings yet
DWH
48 pages
Data Mining Metrices
No ratings yet
Data Mining Metrices
6 pages
Ccs341-Question-Bank NNNNNN
No ratings yet
Ccs341-Question-Bank NNNNNN
10 pages
What Is The Level of Granularity of A Fact Table
No ratings yet
What Is The Level of Granularity of A Fact Table
15 pages
VJA 3330 3130 Lesson 5 Communication March 2015
No ratings yet
VJA 3330 3130 Lesson 5 Communication March 2015
38 pages
Top Strategic Technology Trends in Banking and Investment Services For 2022
100% (1)
Top Strategic Technology Trends in Banking and Investment Services For 2022
20 pages
DM Important Questions
100% (1)
DM Important Questions
2 pages
Applied Data Science With Machine Learning
100% (2)
Applied Data Science With Machine Learning
21 pages
Data Warehouse Questions
No ratings yet
Data Warehouse Questions
2 pages
Ccs334 Big Data Analytics
0% (1)
Ccs334 Big Data Analytics
2 pages
Data Warehouse Resume
100% (2)
Data Warehouse Resume
4 pages
Broad Problem Area
No ratings yet
Broad Problem Area
9 pages
Unit 3
No ratings yet
Unit 3
24 pages
DW Example
No ratings yet
DW Example
24 pages
Jucs Sample Paper Latex
No ratings yet
Jucs Sample Paper Latex
4 pages
DWDM UNIT-1 Lecture Notes
No ratings yet
DWDM UNIT-1 Lecture Notes
15 pages
Data Warehousing and Data Mining Syllabus
No ratings yet
Data Warehousing and Data Mining Syllabus
1 page
Bachelor Thesis Topics Computer Science
75% (4)
Bachelor Thesis Topics Computer Science
4 pages
CCS341 - Data Warehousing 2023 Nov Dec
No ratings yet
CCS341 - Data Warehousing 2023 Nov Dec
2 pages
Data Warehousing Interview Questions and Answers
No ratings yet
Data Warehousing Interview Questions and Answers
5 pages
Anna University Data Warehousing and Data Mining November December 2011 Question Paper
No ratings yet
Anna University Data Warehousing and Data Mining November December 2011 Question Paper
3 pages
Lesson Plan F1.1-DMDW
No ratings yet
Lesson Plan F1.1-DMDW
3 pages
DWH by Concepts - v1
No ratings yet
DWH by Concepts - v1
56 pages
DBMS Question DBMS
100% (1)
DBMS Question DBMS
14 pages
Need of Two Types of Data: Information
No ratings yet
Need of Two Types of Data: Information
7 pages
Advantages of Data Warehouse
No ratings yet
Advantages of Data Warehouse
2 pages
Chapter1 Components of Computer System Hardware
No ratings yet
Chapter1 Components of Computer System Hardware
26 pages
Field Extraction Index Time Vs Search Time
No ratings yet
Field Extraction Index Time Vs Search Time
5 pages
Vss Cheat Sheet
No ratings yet
Vss Cheat Sheet
1 page
Lab Chapter 4
No ratings yet
Lab Chapter 4
10 pages
DWH QB
No ratings yet
DWH QB
10 pages
Gaurav Yadav Resume
No ratings yet
Gaurav Yadav Resume
1 page
Kodo Survey 2021 Learning Impact Measurement Benchmark Report
No ratings yet
Kodo Survey 2021 Learning Impact Measurement Benchmark Report
15 pages
OpenText InfoArchive
No ratings yet
OpenText InfoArchive
14 pages
CCS341 Data Warehousing Unit 2 Notes - Ccs341-Data-warehousing-unit-2-Notes
No ratings yet
CCS341 Data Warehousing Unit 2 Notes - Ccs341-Data-warehousing-unit-2-Notes
32 pages
Starfish Description by Farmer
No ratings yet
Starfish Description by Farmer
7 pages
Building A Data Warehouse To Store Data On Ethiopian Medical Business Data Scraped From Telegram Channels
No ratings yet
Building A Data Warehouse To Store Data On Ethiopian Medical Business Data Scraped From Telegram Channels
23 pages
Bsbsin603 Assignment - 1
No ratings yet
Bsbsin603 Assignment - 1
43 pages
? Master SQL DDL With These 50 Interview Questions!
No ratings yet
? Master SQL DDL With These 50 Interview Questions!
8 pages
Decision Support System: Fundamentals and Applications for The Art and Science of Smart Choices
From Everand
Decision Support System: Fundamentals and Applications for The Art and Science of Smart Choices
Fouad Sabry
No ratings yet
Pentaho Data Integration Cookbook - Second Edition
From Everand
Pentaho Data Integration Cookbook - Second Edition
María Carina Roldán
No ratings yet
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
From Everand
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
Janet Laane Effron
No ratings yet