21cs71BDA Question Bank

Uploaded by

someshgowda7975

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

152 views4 pages

21cs71BDA Question Bank

Uploaded by

someshgowda7975

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Big Data Analytics Question Bank 21CS71 :[based on previous years papers & Model

question papers]
Module1: Introduction to Big Data Analytics.
1. Define Big Data. Explain the Evolution of Big Data and their characteristics
2. What is grid computing? List and explain the features, drawbacks of grid computing
3. Discuss the functions of each of the five layers in Big Data architecture design
4. Illustrate the various phases involved in Big Data Analytics with neat diagram.
5. Discuss the evolution of BigData
6. Explain the characteristics of BigData
7. Write a neat block diagram,Explain data architecture design.
8. Write a notes on Analytical scalability to big data and Massive Parallel Processing
Platforms.
9. Highlight Big Data Analytics with one case study?
10. Define BigData. Explain the classification of bigdata?
11. Define Scalability and its types along with the examples.
12. Explain the functions of each layer in Big data architecture design with a diagram.
13. Define data preprocessing. Explain in brief the needs of preprocessing?
14. Explain the following terms. i. Scalability & Parallel Processing ii. Grid & Cluster
Computing.
15. What is Cloud Computing? Explain different services of Cloud.
16. Explain any two Big Data different Applications.
17. How does Berkeley data analytics stack help in analytics take?

Module:2 Introduction to Hadoop (T1), Hadoop Distributed File System Basics (T2), Essential
Hadoop Tools (T2).
1. Illustrate the Hadoop core components with neat diagram
2. Discuss the Hadoop system and ecosystem components in four layers
3. Illustrate YARN based execution model and its functions With a neat diagram
4. Discuss the Apache sqoop import and export methods with neat diagram.
5. What are the core components of Hadoop? Explain in brief its each of its components?
6. Explain Hadoop Distributed File System?
7. Define MapReduce Framework and its functions?
8. Write down the steps on the request to MapReduce and the types of process in
MapReduce.
9. Write short noted on Flume Hadoop Tool.
10. What is HDFS? Highlight the important design features of the HDFS
11. Bring out the concepts of the HDFS block replication with an example
12. Explain Apache sqoop import and export method with neat diagram
13. Demonstrate any six HBase commands with output?
14. Write short note on Apache hive.
15. Explain Apache Oozie with neat diagram.
16. Explain YARN application framework.

Module : 3 NoSQL Big Data Management, MongoDB and Cassandra:

1.Discuss the NoSQL data stores and their characteristic features
2. Illustrate the key value pairs in data architectural patterns with an example
3. Discuss the functions of MongoDB query language and database commands
4. Illustrate the CQL commands and their functionality.
5. Define key-value store with example. What are the advantages of key-value store?
6.Write down the steps to provide client to read and write values using key-value store?What
are the typical uses of keyValue store?
7.Discuss the characteristics of NoSQL data store along with the features in NOSQL
transactions?
8.With neat diagrams,explain the following Shared-Nothing Architecture for
BigDataTasks,Explain the following distribution model? (i) Single server model
(ii)Sharding very large databases (iii)Master Slave distribution model (iv) Peer to peer
distribution model.
9.Explain about NOSQL datastore and its characteristics.
10.Describe the principle of working of the CAP theorem
11.Demonstrate the working of key-value store with an example.
12.Describe the principle of working of the CAP theorem.
13.Demonstrate the working of key-value store with an example
14.Describe the features of MongoDB, and its industrial application
15. Explain NOSQL Data Architecture Patterns.
16. Explain MONGO DATABASE.[10m]

Module 4: Map Reduce, Hive and PIG

1.Describe the MapReduce execution steps with neat diagram.
2. Explain Key Value pairing in Map Reduce.
3. Discuss the functions of Group By, partitioning and combining using one example for each
4. Illustrate main features and Architecture of Hive with neat diagram.
5. Discuss the pig Latin data types and examples.
6.With a neat diagram, Explain the process in MapReduce when client submitting a Job?
7.Explain Hive Integration and workflow steps involved with a diagram?
8.Using HiveQL for the following:
a. Create a table with partition
b. Add, rename and drop a partition to a table
9.What is Pig in Big Data? Explain the features of PIG?
10.Describe the Map tasks, Reduce tasks and Map reduce Execution process
11.Describe the Hive architecture and its characteristics.
12.Demonstrate the pig architecture for scripts dataflow and processing
13.Differentiate between pig and Map reduce give industrial application for each.

Module:5 Machine Learning Algorithms for Big Data Analytics &

Text, Web Content, Link, and Social Network Analytics.
1. Discuss Analysis of Variances(ANOVA) and correlation indicators of linear relationship
2. Describe the regression analysis predict the value of the dependent variable in case
of linear regression
3. Illustrate the various phases in text mining process pipeline
4. Describe the web content mining and three phases for web usage mining
5. In Machine Learning explain linear and non-linear relationship with essential graphs?
6. Write the block diagram of text mining process and explain its phases?
7. Define multiple regressions. Write down the examples involved in forecasting and
optimization in regression.
8. Explain the parameters in social graph network topological analysis using centralities
and PageRank?
9. Explain the simple linear regression analysis?
10. Demonstrate frequent item set mining and association rule mining.
11. Explain the purpose of web usage analytics and the significance of web graphs
12. What is Machine Learning? Explain different types of Regression Analysis.
13. Explain with neat diagram K-means clustering.
14. Explain Naïve Bayes Theorem with example.
Reference books:
1. Raj Kamal and Preeti Saxena, “Big Data Analytics Introduction to Hadoop, Spark, and
Machine Learning”, McGraw Hill Education, 2018 ISBN: 9789353164966, 9353164966
2. Douglas Eadline,[refer for module1,2(half),3,4,5]
2. "Hadoop 2 Quick-Start Guide: Learn the Essentials of Big Data Computing in the
Apache Hadoop 2 Ecosystem", 1 stEdition, Pearson Education, 2016. ISBN13: 978
9332570351 [module 2 only]

cp5293 Big Data Analytics Question Bank
0% (1)
cp5293 Big Data Analytics Question Bank
13 pages
20IT503 - Big Data Analytics - Unit4
No ratings yet
20IT503 - Big Data Analytics - Unit4
73 pages
The Stress Analysis - FEM
No ratings yet
The Stress Analysis - FEM
41 pages
Question Bank
No ratings yet
Question Bank
3 pages
18CS72-BDA Question Bank of First Internal Syllabus
No ratings yet
18CS72-BDA Question Bank of First Internal Syllabus
1 page
Big Data SV Publication
No ratings yet
Big Data SV Publication
142 pages
Bda Unitwise QB
No ratings yet
Bda Unitwise QB
3 pages
BAD601 Important Question
No ratings yet
BAD601 Important Question
2 pages
BDA Syllabus - Sem VII - Mumbai University
No ratings yet
BDA Syllabus - Sem VII - Mumbai University
3 pages
QB Ia1 Bda
No ratings yet
QB Ia1 Bda
1 page
Syllabus
No ratings yet
Syllabus
3 pages
BDA Simp Tie
No ratings yet
BDA Simp Tie
2 pages
Big Data Analytics Digital Notes
No ratings yet
Big Data Analytics Digital Notes
119 pages
Bda Imp Questions Sem 7
No ratings yet
Bda Imp Questions Sem 7
7 pages
Super Important Questions For BDA-18CS72: Module-1
No ratings yet
Super Important Questions For BDA-18CS72: Module-1
2 pages
MCAD2232 (PRESS) BIG DATA and Its Applications
No ratings yet
MCAD2232 (PRESS) BIG DATA and Its Applications
140 pages
1) Introduction To Big Data
No ratings yet
1) Introduction To Big Data
6 pages
Imp For Exam
No ratings yet
Imp For Exam
2 pages
Bda Simp-23
No ratings yet
Bda Simp-23
2 pages
Big Data Analytics (R18a0529)
No ratings yet
Big Data Analytics (R18a0529)
134 pages
Big Data Analytics Comp Syllabus Sem7
No ratings yet
Big Data Analytics Comp Syllabus Sem7
4 pages
CS8091 Big Data Analytics
No ratings yet
CS8091 Big Data Analytics
28 pages
Big Data Analytics
No ratings yet
Big Data Analytics
61 pages
Introduction of Subject
No ratings yet
Introduction of Subject
28 pages
BDA - AIDS Syllabus
No ratings yet
BDA - AIDS Syllabus
2 pages
SEM VII BDA Syllabus Theory
No ratings yet
SEM VII BDA Syllabus Theory
4 pages
No SQL Database in Bda
No ratings yet
No SQL Database in Bda
84 pages
Mrcet R20 Iv 1 QB
No ratings yet
Mrcet R20 Iv 1 QB
79 pages
Important Big Data Questions AKTU
No ratings yet
Important Big Data Questions AKTU
3 pages
BgiData QB
100% (1)
BgiData QB
3 pages
Gujarat Technological University: Sr. No. Content Total Hrs % Weightage 1 13
No ratings yet
Gujarat Technological University: Sr. No. Content Total Hrs % Weightage 1 13
3 pages
Sapthagiri College of Engineering: Department of Information Science and Engineering Big Data Analytics Question Bank
No ratings yet
Sapthagiri College of Engineering: Department of Information Science and Engineering Big Data Analytics Question Bank
3 pages
TIE - 21CS71 SIMP With Key Answers
No ratings yet
TIE - 21CS71 SIMP With Key Answers
19 pages
BD Imp Ques 1
No ratings yet
BD Imp Ques 1
22 pages
20ai402 Data Analytics Unit-2
No ratings yet
20ai402 Data Analytics Unit-2
72 pages
MCA - BigData Notes
No ratings yet
MCA - BigData Notes
136 pages
Big Data 2023
No ratings yet
Big Data 2023
18 pages
CS8091 BDA Unit1
No ratings yet
CS8091 BDA Unit1
63 pages
Big Data Analytics
No ratings yet
Big Data Analytics
31 pages
Big Data
No ratings yet
Big Data
22 pages
Big Data Analytics Course Outline (Fall 2020) : Dr. Tariq Mahmood 830 Am - 11 Am (Monday) Scope
No ratings yet
Big Data Analytics Course Outline (Fall 2020) : Dr. Tariq Mahmood 830 Am - 11 Am (Monday) Scope
3 pages
B.Tech. CS - CE and CSE Syllabus 3rd Year 2024-25
No ratings yet
B.Tech. CS - CE and CSE Syllabus 3rd Year 2024-25
2 pages
Cp5293 Big Data Analytics Question Bank
0% (1)
Cp5293 Big Data Analytics Question Bank
13 pages
Big Data Syllabus
No ratings yet
Big Data Syllabus
1 page
BDA Assignm-1
No ratings yet
BDA Assignm-1
2 pages
It - (R20) - 4-1 - Big Data Analytics - Digital Notes
No ratings yet
It - (R20) - 4-1 - Big Data Analytics - Digital Notes
117 pages
J. B. Institute of Engineering and Technology
No ratings yet
J. B. Institute of Engineering and Technology
1 page
BIG Data - Unit - 1
No ratings yet
BIG Data - Unit - 1
24 pages
General Question Bank
No ratings yet
General Question Bank
5 pages
Bda Sem 7 Book
No ratings yet
Bda Sem 7 Book
188 pages
Big Data Analytics - Sem 7 CVMU
No ratings yet
Big Data Analytics - Sem 7 CVMU
4 pages
Big Data Analytics-Digital Notes
No ratings yet
Big Data Analytics-Digital Notes
86 pages
IOT Analytics - AI361
No ratings yet
IOT Analytics - AI361
3 pages
Part B Questions
No ratings yet
Part B Questions
3 pages
Big Data Analytics
No ratings yet
Big Data Analytics
19 pages
Big Data Notes Pdf3
No ratings yet
Big Data Notes Pdf3
114 pages
Big Data Lec4
No ratings yet
Big Data Lec4
38 pages
ITS Module 2
No ratings yet
ITS Module 2
9 pages
AIML Mod-5
No ratings yet
AIML Mod-5
18 pages
AIML Mod-2
No ratings yet
AIML Mod-2
57 pages
AIML Mod-1
No ratings yet
AIML Mod-1
40 pages
FRP Vs HDPE
No ratings yet
FRP Vs HDPE
5 pages
Lab Session: 4: Demonstrate The Behavior of A Silicon Diode in Full Wave Rectifier
No ratings yet
Lab Session: 4: Demonstrate The Behavior of A Silicon Diode in Full Wave Rectifier
10 pages
M1 Learning Activity
No ratings yet
M1 Learning Activity
8 pages
FTI Sampling Testing Cleaning Ade
No ratings yet
FTI Sampling Testing Cleaning Ade
2 pages
Connected Particles PPQ
No ratings yet
Connected Particles PPQ
21 pages
Hydrogen-Tour - SiemensEnergyGoerlitz - Stefanie Randig & Karolin GrÃ SCHL
No ratings yet
Hydrogen-Tour - SiemensEnergyGoerlitz - Stefanie Randig & Karolin GrÃ SCHL
13 pages
Saurabh Thesis M.SC
No ratings yet
Saurabh Thesis M.SC
34 pages
2 - Leopold's Maneuver
No ratings yet
2 - Leopold's Maneuver
24 pages
11 Sensor Leads
No ratings yet
11 Sensor Leads
4 pages
Ajeet Singh Iffco PDF
No ratings yet
Ajeet Singh Iffco PDF
38 pages
Manual Samart Watch
No ratings yet
Manual Samart Watch
11 pages
2022 LASC Rules & Requirements Document R00
No ratings yet
2022 LASC Rules & Requirements Document R00
31 pages
"Ready For The New Epidemic?" - Prof. David Russell - May 2007
No ratings yet
"Ready For The New Epidemic?" - Prof. David Russell - May 2007
1 page
Serpentine PDF
No ratings yet
Serpentine PDF
1 page
Hewan Invertebrata Kupu
No ratings yet
Hewan Invertebrata Kupu
13 pages
Elijah Carigon - QBA HW 2
No ratings yet
Elijah Carigon - QBA HW 2
2 pages
Neuroscience ABCs
100% (5)
Neuroscience ABCs
261 pages
Medieval Whales and Whaling
No ratings yet
Medieval Whales and Whaling
6 pages
MW My First Dictionary
100% (1)
MW My First Dictionary
6 pages
Chest Physician in Pune - Dr. Sharadchandra Yadav
No ratings yet
Chest Physician in Pune - Dr. Sharadchandra Yadav
8 pages
Histology A Text and Atlas With Correlat
No ratings yet
Histology A Text and Atlas With Correlat
14 pages
Origen Volkerpsychologie
No ratings yet
Origen Volkerpsychologie
11 pages
Biology Lab: DNA Extraction From Wheat Germ: Purpose: Background
No ratings yet
Biology Lab: DNA Extraction From Wheat Germ: Purpose: Background
2 pages
Different Types of Housing Systems of Livestock and Poultry, (By Krushna Keshab Purohit)
No ratings yet
Different Types of Housing Systems of Livestock and Poultry, (By Krushna Keshab Purohit)
22 pages
Electronics and Communication Engineering
No ratings yet
Electronics and Communication Engineering
9 pages
National Taiwan University Department of Civil Engineering
No ratings yet
National Taiwan University Department of Civil Engineering
2 pages
BDP PDF
No ratings yet
BDP PDF
28 pages
CE 200A-102 - Surveying Lab
No ratings yet
CE 200A-102 - Surveying Lab
6 pages
Chapter 16 Interviewing The Wright Brothers
No ratings yet
Chapter 16 Interviewing The Wright Brothers
14 pages

21cs71BDA Question Bank

Uploaded by

21cs71BDA Question Bank

Uploaded by

Big Data Analytics Question Bank 21CS71 :[based on previous years papers & Model

Module : 3 NoSQL Big Data Management, MongoDB and Cassandra:

Module 4: Map Reduce, Hive and PIG

Module:5 Machine Learning Algorithms for Big Data Analytics &

You might also like