0% found this document useful (0 votes)
54 views3 pages

Time Allowed: Three Hours 5 January 2018, 9AM-12PM: Instructions To Candidates

This document contains instructions and questions for an exam on data warehousing and mining. It is divided into 5 questions. Question 1 asks about data warehouses, slowly changing dimensions, and comparing databases and data warehouses. Question 2 covers the ETL process and data marts. Question 3 involves drawing a star schema based on a business scenario. Question 4 includes short notes on fact table granularity, junk dimensions, OLAP types, and snowflake vs star schemas. Question 5 covers data mining concepts like processes, importance, algorithms.

Uploaded by

muthu rangi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
54 views3 pages

Time Allowed: Three Hours 5 January 2018, 9AM-12PM: Instructions To Candidates

This document contains instructions and questions for an exam on data warehousing and mining. It is divided into 5 questions. Question 1 asks about data warehouses, slowly changing dimensions, and comparing databases and data warehouses. Question 2 covers the ETL process and data marts. Question 3 involves drawing a star schema based on a business scenario. Question 4 includes short notes on fact table granularity, junk dimensions, OLAP types, and snowflake vs star schemas. Question 5 covers data mining concepts like processes, importance, algorithms.

Uploaded by

muthu rangi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

NATIONAL INSTITUTE OF BUSINESS MANAGEMENT

Higher Diploma in Computer Based Information Systems – 17.1- HNDCBIS -3-3-09


Higher Diploma in Software Engineering – 17.1- HNDSE -3-3-09
DATA WAREHOUSING AND MINING

Time allowed: Three hours 5th January 2018, 9AM-12PM

INSTRUCTIONS TO CANDIDATES

 This paper contains of 5 questions on 2 pages(Page 2 -Page 3).


 The total marks obtainable for this examination is 100.
 Marks for each question is indicated.
 All questions are mandatory and need to attempt

ADDITIONAL MATERIALS

 None

Page 1 of 3
1.
a) What is a Data Warehouse? (5 Marks)
b) What are the properties of a Data Warehouse? Explain each briefly. (5 Marks)
c) What are the different types of SCD's used in data warehousing? (5 Marks)
d) Compare Database & Data Warehouse. (5 Marks)

2.
a) What is ETL process in Data Warehousing? Explain each step in ETL process briefly.
(5 Marks)
b) What is a Data Mart? Give reasons for designing a Data Mart. (10 Marks)
c) What are the three types of fact tables? Explain them briefly. (5 Marks)

3.
Identify dimensions and fact table according to the below business scenario and draw the
stars schema for solve some of the notable questions given below. Name the attributes of
the each dimensions and fact table.

An electronic manufacturer has recently acquired a competitor to extend its existing product
line. The acquisition has provided them with an established customer base in a new
geographic region that was considered for development. The manufacture plans future
acquisitions to strength the current portfolio and enable growth. While acquisition presents
new opportunities for expansion and growth, competition across existing product, lines and
higher raw material cost have limited their ability to expand. In addition, the manufacture has
encountered several internal challenges that have limited growth.

This manufacture has set aside a budget to construct a sales reporting solution to help solve
many internal reporting challenges. The proposed solution will be used by sales, finance and
manufacturing to answer the most significant queries. Some of the most notable questions to
be answered include:

 Sales analysis:
 What are the sales by quarter, sales representative and geography?
 How are sales trending in industry forecasts?
 How do sales compare in the each province?

Page 2 of 3
 Product Profitability:
 Which product lines are the highest revenue procedures this year?
 Which product and product lines are the most profitable this quarter?
 Which product lines are above seasonal forecasts?

 Sales representative analysis:


 Who are the top five sales representatives by sales volume?
 Who are the most productive sales representatives in divisions, region and
territories?
 Which sales divisions, region and territories generate the highest revenues and
margins?

 Customer analysis:
 Who are the best customers?
 Who are the most profitable customer?
 What percentage of sales is generated from the top five customers?
 Which customer purchase the most products by product line?
 Which industry has experienced the fastest growth over last year?

(20 Marks)
4. Write short notes on followings.

a. What is level of granularity of a fact table? (5 Marks)


b. What is junk dimension? (5 Marks)
c. Which one is faster, Multidimensional OLAP or Relational OLAP? (5 Marks)
d. What is the difference between Snowflake and Star Schema? What are situations
where Snowflake Schema is better than Star Schema when the opposite is true?
(5 Marks)

5.
a. What is Data Mining? (4 Marks)
b. Why is data mining important? You can use proper examples for your explanation.
(8 Marks)
c. Explain the steps of Knowledge Discovery Process with a suitable example.
(4 Marks)
d. Explain clustering and association algorithm in Data mining. (4 Marks)

Page 3 of 3

You might also like