Bda Simp-23
Bda Simp-23
Disclaimer: These questions are prepared by the TIE review team teachers/mentors by
referring to various question banks and Internal question papers from more than 10
colleges. The sole purpose of this is to give a thorough idea about the Questions in the
final assessment paper(sem-end exams).
Module-1
1. Explain why Big Data is Needed in the Modern World , Mention its types? Explain
its evolution and also Mention its characteristics(4V’s)
2. With a neat Sketch, Discuss the five layers in Big Data Architecture Design
3. Discuss various case studies and applications of Big data
4. Differentiate between the following - 5M each
(i) Distributed computing v/s Grid computing v/s Cluster computing
(ii)Horizontal Scalability vs Vertical Scalability
(iii)Structured v/s Unstructured v/s Semi Structured Data
5. Mention any 6 techniques used for Data Preprocessing, also mention the
advantages of BDAS by understanding its future scope in the field of Big Data
6. Define:(i)Hadoop (ii) Mesos (iii)SQL and NoSQL(with features) (iv)DDBMS
(v)In-memory column and row format data
Module-2
Module-3
Module-5
1. Explain in detail web content mining and diff phases for web usage mining
2. Difference between (i)Linear and non-linear relationship (ii)Standard deviation
and standard error
3. Explain apriori algorithm for frequent itemsets and association rule mining
4. Explain Social Network as graph and its analytics
5. Describe the regression analysis using linear and non linear models, explain KNN in
detail
6. Explain the following (i) Probability Distributions, and Correlations (ii)Page rank
(iii)Web Usage Analytics
Module-4
1. Describe the MapReduce execution steps when a client submits a job with neat
diagram
2. What is Hive in Big data? List the features of Hive? Also, explain Hive architecture
with relevant diagrams
3. Write a short note on Pig Data Model(pig architecture) along with its features,
also list out commands used in Pig Data Model by explaining its Data types -12M
4. What are MapReduce Tasks?Explain with examples
5. Write a note on HiveQL, and its Queries.
6. Explain how hive interacts with Hadoop(VBQ)