Sapthagiri College of Engineering: Department of Information Science and Engineering Big Data Analytics Question Bank
Sapthagiri College of Engineering: Department of Information Science and Engineering Big Data Analytics Question Bank
Sapthagiri College of Engineering: Department of Information Science and Engineering Big Data Analytics Question Bank
Module 2
1. Explain Apache Pig along with commands.
2. Explain Apache Hive with commands
3. Explain Apache Sqoop ? Explain Apache Sqoop Import and Export methods.
4. Describe Apache Flume Agent Components with neat sketch. [Include
pipeline and also consolidation network]
5. Explain in detail Apache Oozie with workflow DAG.
6. Explain HBase in detail.
7. Explain Structure of YARN Applications.
8. Explain the following –
Apache Tez, Apache Giraph, Apache Storm, Apache Spark, Apache Flink
9. Explain YARN architecture taking two clients with neat diagram.
1
Module 3
1. How BI can be used for better decisions ?
2. Explain BI tools in detail.
3. Explain any 2 BI applications in detail.
4. List three Business intelligence applications in Healthcare and wellness.
5. List three Business intelligence applications in Education
6. List three Business intelligence applications in Customer relationship management.
7. What are the design considerations for Data warehouse ?
13. What is DataMining ? What are supervised and unsupervised learning techniques.
19. What are the major mistakes to be avoided when doing Data Mining ?
1. What is clustering? Explain the applications of clustering. Write the generic pseudo code for
clustering.
4. Explain the construction of the decision tree and pseudo code of making a decision tree.
2
5. Write an architectural diagram for text mining and explain. What are the applications of text
mining?
6. Explain the web content mining, structure mining and usage mining in detail.