0% found this document useful (0 votes)
174 views3 pages

MCS 221

The document is a test paper for the subject MCS-221: Data Warehousing and Data Mining. It contains 5 questions, with question 1 having 4 subparts and the other questions having 2 subparts each. The questions cover topics like characteristics of data warehouses, OLAP architectures, data cleaning, decision trees, text mining techniques, mining multimedia data, data lakes, star schema, metadata, ETL processes and concepts like data marts, association rules, clustering and Bayes' theorem.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
174 views3 pages

MCS 221

The document is a test paper for the subject MCS-221: Data Warehousing and Data Mining. It contains 5 questions, with question 1 having 4 subparts and the other questions having 2 subparts each. The questions cover topics like characteristics of data warehouses, OLAP architectures, data cleaning, decision trees, text mining techniques, mining multimedia data, data lakes, star schema, metadata, ETL processes and concepts like data marts, association rules, clustering and Bayes' theorem.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

No.

of Printed Pages : 3 MCS-221

MASTER OF COMPUTER
APPLICATIONS (NEW)
(MCA-NEW)
Term-End Examination
December, 2021

MCS-221 : DATA WAREHOUSING


AND DATA MINING

Time : 3 hours Maximum Marks : 100


(Weightage : 70%)

Note : Question no. 1 is compulsory. Answer any


three questions from the rest.

1. (a) Define a Data Warehouse. List and explain


the four characteristics of a Data
Warehouse. 10

(b) Explain the following OLAP architectures


and draw their architectural diagram : 10

(i) Multidimensional Online Analytical


Processing (MOLAP)

(ii) Hybrid Online Analytical Processing


(HOLAP)
MCS-221 1 P.T.O.
(c) Define ‘‘data cleaning’’ which is a data
preprocessing technique. In this context,
explain the concept of Noisy data cleaning
along with some suitable examples. 10

(d) What is a Decision Tree ? How is it useful in


classification ? With the help of an example,
explain the process of construction of a
decision tree and its representation. 10

2. (a) What is Text Mining ? Where is it used ?


Explain any two text mining techniques
with the help of a suitable example for
each. 10

(b) In the context of mining multimedia data on


the web, explain the following terms : 10

(i) Page Rank

(ii) Hits

(iii) Page Layout Analysis

(iv) Vision Page Segmentation

3. (a) Define a Data Lake. Explain the


step-by-step process of creating a data lake. 10

(b) With the help of an example, explain the


star schema dimensional model. 10
MCS-221 2
4. (a) What is Metadata ? What are its contents ?
Justify how metadata can be an important
component in Data Warehousing ? Also
mention its types. 10

(b) Explain the three components of ETL. Also


mention how to improve the ETL
performance. 10

5. Write short notes on any four of the


following : 45=20

(a) Data Marts

(b) Association Rule Generation

(c) Apriori Algorithm

(d) Clustering and its Methods

(e) Bayes’ Theorem

MCS-221 3 P.T.O.

You might also like