0% found this document useful (0 votes)
89 views2 pages

BDAV Question Bank

Bank question

Uploaded by

pratikshukla1107
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
89 views2 pages

BDAV Question Bank

Bank question

Uploaded by

pratikshukla1107
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

BDAV Question Bank:

Unit 1, 2 & 3
1. Explain 5 V’s of Big Data.
2. Application of Big Data
3. Differentiate between Traditional Vs Big Data.
4. Explain Types of Big Data and give examples.
5. What are differences between NameNode and Standby NameNode
6. Draw and Explain Secondary Name Node and Check Pointing Mechanism6. What is Rack
Awareness in HADOOP? Define Rack Awareness Policies.
7. Explain Speculative Execution? How Map Reduce job can be optimized
8. using Speculative Execution?
9. Describe the map reduce algorithm for matrix and vector multiplication.
10. What is shuffling and sorting in Map Reduce?
11. What is Input Format? Write difference between HDFS Block and Input
12. Split.
13. Illustrate the main component of Hadoop system.
14. What is Map Reduce Partitioner? What is need of Partitioner? How many
15. partitioners are there in HADOOP?
16. Explain in detail Shared nothing architecture.
17. Explain Computing Selection and Projection by Map Reduce
18. Explain Computing Grouping and Aggregation by Map Reduce
19. Short note on sorting and natural joins
20. Explain Hadoop ecosystem with core component. Explain its architecture.
21. What are the different frameworks that run under yarn? Discuss the various yard Daemons.
22. What is Map Reduce Combiner? Write advantages and dis advantages of
23. Map Reduce Combiner?
24. What is role of record reader in HADOOP?
25. write a short note on master & slave V/s peer to peer
26. write a short note on mapper task and reducer task
27. List and explain types of NO SQL Databases with examples?
28. What is the CAP Theorem?
29. Explain BASE Properties of NoSQL Database.
30. Discuss the different architecture pattern of NoSQL.
31. Explain distribution models Master Slave and Peer to peer with the help of
32. diagram.
33. What are the various applications of NoSQL in industry?
34. What are the benefits of HBase over other NoSQL databases?
35. Draw and explain Hbase Architecture-Read, Write Mechanism.
36. Draw and Explain Hbase Architecture-Compaction and Region Split
37. Write a Short Note on Region Server
38. What are the major components of HBase Data model? Explain each one in brief
39. What important role do Region Server and Zookeeper play in HBase architecture?
Unit 4
1. Write 5 pig build in function
2. Explain main component and the working of Apache Pig. Discuss the Load() and Store()
commands in Pig Framework.
3. What is the significance of apache pig in Hadoop context? Explain the main component and
working of Apache pig with the help of diagram.
4. Explain the architecture of apache hive in the Hadoop ecosystem. List out the Hive build in
function.
5. What is HIVE? Explain its architecture.
6. Write a short note on warehouse directory and meta store.
7. What is HIVE query language? Explain Built in functions in HIVE.
8. How data is sorted and aggregated in HIVEQL?
9. What is PIG? Explain its architecture in detail.

Unit 5
1. What is RDD and how is data partitioned in RDD?
2. What is apache kafka? Explain its architecture in detail with proper diagram.
3. Explain apache spark with suitable diagram. Explain aggregating data with pair RDD.
4. What are the advantages of apache spark over map reduce?
5. Explain Bulk Synchronous processing (BSP)and graph processing with respect to Apache
Spark.
6. Discuss the Apache kafka fundamentals. Explain the kafka cluster architecture with suitable
diagram.
7. What is Apache Kafka?
8. What are the benefits of Kafka?
9. Explain each component of Kafka.
10. Briefly explain Publish-Subscribe Messaging System.
11. Explain Cluster Architecture.
12. What is the role of Zookeeper?
13. Explain Cluster Architecture Components.

Unit 6
1. Short note on Visualization tool
2. Explain data visualization. Describe the use of dashboard in big data visualization.
3. Write a short note on “D3 and bigdata”
4. Explain main features of Tableau

You might also like