DS Midsem
DS Midsem
Instructions:
1. All questions are compulsory
2. Figures to the right indicate full marks.
3. Assume suitable data wherever required
Q1) a) Explain Data Analytics life cycle with the help of diagram. [10]
b) List different phases in data analytics life cycle and explain Model Building
phase in detail. [8]
OR
Q2) a) What are different phases in data analytics life cycle? Explain
Operationalize phase in detail. [10]
b) Explain Model building phase with its challenges. [8]
Q5) a) What is clustering? With suitable example explain the steps involved in
k-means algorithm. [7]
b) Discuss Holdout method and Random Sampling methods. [6]
c) Wirte short note on [4]
i) Confusion matrix
ii) AVC- ROC curve
OR
Q6) a) What do you mean by text analysis? Why text analysis need to be done?
Explain the following text analysis steps with suitable examples [11]
i) Part of speech (POS) tagging
ii) Lemmatization
iii) Stemming
b) Wirte short note on [6]
i) Time series Analysis
ii) TF- IDF.
Q7) a) What is data visualization? What are the different methods of data
visualization explain in detail. [6]
b) Explain in detail the Hadoop Ecosystem with suitable diagram. [11]
OR
Q8) a) Describe the Data visualization tool “Tableau”. Explain its applications
in brief. [6]
b) With a suitable example explain and draw a Box plot and explain its
usages. [6]
c) With a suitable example explain Histogram and explain its usages. [5]