The document is a test paper for the subject MCS-221: Data Warehousing and Data Mining. It contains 5 questions, with question 1 having 4 subparts and the other questions having 2 subparts each. The questions cover topics like characteristics of data warehouses, OLAP architectures, data cleaning, decision trees, text mining techniques, mining multimedia data, data lakes, star schema, metadata, ETL processes and concepts like data marts, association rules, clustering and Bayes' theorem.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
174 views3 pages
MCS 221
The document is a test paper for the subject MCS-221: Data Warehousing and Data Mining. It contains 5 questions, with question 1 having 4 subparts and the other questions having 2 subparts each. The questions cover topics like characteristics of data warehouses, OLAP architectures, data cleaning, decision trees, text mining techniques, mining multimedia data, data lakes, star schema, metadata, ETL processes and concepts like data marts, association rules, clustering and Bayes' theorem.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3
No.
of Printed Pages : 3 MCS-221
MASTER OF COMPUTER APPLICATIONS (NEW) (MCA-NEW) Term-End Examination December, 2021
MCS-221 : DATA WAREHOUSING
AND DATA MINING
Time : 3 hours Maximum Marks : 100
(Weightage : 70%)
Note : Question no. 1 is compulsory. Answer any
three questions from the rest.
1. (a) Define a Data Warehouse. List and explain
the four characteristics of a Data Warehouse. 10
(b) Explain the following OLAP architectures
and draw their architectural diagram : 10
(i) Multidimensional Online Analytical
Processing (MOLAP)
(ii) Hybrid Online Analytical Processing
(HOLAP) MCS-221 1 P.T.O. (c) Define ‘‘data cleaning’’ which is a data preprocessing technique. In this context, explain the concept of Noisy data cleaning along with some suitable examples. 10
(d) What is a Decision Tree ? How is it useful in
classification ? With the help of an example, explain the process of construction of a decision tree and its representation. 10
2. (a) What is Text Mining ? Where is it used ?
Explain any two text mining techniques with the help of a suitable example for each. 10
(b) In the context of mining multimedia data on
the web, explain the following terms : 10
(i) Page Rank
(ii) Hits
(iii) Page Layout Analysis
(iv) Vision Page Segmentation
3. (a) Define a Data Lake. Explain the
step-by-step process of creating a data lake. 10
(b) With the help of an example, explain the
star schema dimensional model. 10 MCS-221 2 4. (a) What is Metadata ? What are its contents ? Justify how metadata can be an important component in Data Warehousing ? Also mention its types. 10
(Advances in Intelligent Systems and Computing 577) Wojciech Mitkowski, Janusz Kacprzyk, Krzysztof Oprzędkiewicz, Paweł Skruch (Eds.) - Trends in Advanced Intelligent Control, Optimization and Automat