0% found this document useful (0 votes)
580 views2 pages

Cs 1004 - Data Warehousing and Mining

This document contains questions for an exam on data warehousing and mining. It asks students to compare OLTP and OLAP systems, define terms like data warehouse metadata and dimensionality reduction, describe concepts like iceberg queries and classification, discuss techniques such as cluster analysis and web usage mining, and summarize differences between data cube implementations, statistical outlier detection approaches, and types of data usage like analytical processing. Students are asked to provide examples and algorithms for mining association rules, Bayes classifiers, regression, outliers, multidimensional analysis in spatial and multimedia databases, and time series analysis.

Uploaded by

subhaz
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
580 views2 pages

Cs 1004 - Data Warehousing and Mining

This document contains questions for an exam on data warehousing and mining. It asks students to compare OLTP and OLAP systems, define terms like data warehouse metadata and dimensionality reduction, describe concepts like iceberg queries and classification, discuss techniques such as cluster analysis and web usage mining, and summarize differences between data cube implementations, statistical outlier detection approaches, and types of data usage like analytical processing. Students are asked to provide examples and algorithms for mining association rules, Bayes classifiers, regression, outliers, multidimensional analysis in spatial and multimedia databases, and time series analysis.

Uploaded by

subhaz
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

B.E. / B.

Tech DEGREE EXAMINATION, APRIL / MAY 2008

Seventh Semester

Information Technology

CS 1004 – DATA WAREHOUSING AND MINING

(Regulation 2004)

Time: Three hours Maximum: 100 marks

Answer ALL questions.

PART A – (10 x 2 = 20 marks)

1. Compare OLTP and OLAP systems.


2. What is Data Warehouse Metadata?
3. What is Dimensionality Reduction?
4. What is Concept Description?
5. List two interesting measures for association rules.
6. What are Iceberg queries?
7. What is classification?
8. What is cluster analysis?
9. What is Web Usage Mining?
10. What is Visual Data Mining?

PART B – (5 x 16 = 80 marks)

11. (a) Briefly compare the following concepts. Explain your points with an example
(i) Snowflake schema, fact constellation, star net query model [Marks 5]
(ii) Data cleaning, data transformation, refresh [Marks 5]
(iii) Discovery-driven cube, multifeature cube, virtual warehouse [Marks 6]
(b) What are the difference between three main types of data usage: information
processing, analytical processing and data mining? Discuss the motivation behind
OLAP mining. [Marks 16]

12. (a) For class characterization, what are the main differences between a data
cube based
implementation and a relational implementation such as attribute-oriented
induction. Discuss which
method is most efficient and under what condition this is so. [Marks 16]

Or
(b) (i) List and discuss the various data mining primitives. [Marks 8]
(ii) With relevant examples discuss the role of statistics in data mining. [Marks 8]

13. (a) Explain with an algorithm, how to mine single dimensional Boolean
Association Rules from
transactional database. Give relevant example. [Marks 16]
Or
(b) With an algorithm explain constraint-based association mining. Give relevant
example. [Marks 16]

14. (a) What are Bayesian classifiers? Explain in detail about:


(i) Naïve Bayesian classification [Marks 8]
(ii) Linear and multiple regression. [Marks 8]
Or
(b) Why is outline mining important? Briefly describe the different approaches
behind
statistical based outlier detection, distance-based outlier detection and deviation-
based
outlier detection. [Marks 16]

15. (a) (i) What is multidimensional analysis? Discuss the same with an example.
[Marks 6]
(ii) Discuss how data mining is done is spatial databases. [Marks 10]
Or
(b) (i) Discuss data mining in multimedia databases. [Marks 10]
(ii) What is time series analysis? Discuss the same with an example. [Marks 6]

You might also like