0% found this document useful (0 votes)

38 views6 pages

Bda End Sem

Question Bank of BDA

Uploaded by

Kaustubh Desale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views6 pages

Bda End Sem

Question Bank of BDA

Uploaded by

Kaustubh Desale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

BDA END SEM

Module : 1
1. Mention four characteristics of big data and explain in detail.(5 marks/24)
2. Write three important characteristics of big data and explain any one with real life
example. (5 marks/24)
3. Explain how big data problems are handled by Hadoop system(5 marks/24)
4. Explain Hadoop ecosystem components - Hive and Pig.(5 marks/24)
5. Hadoop advantages and limitations(5 marks/24)
6. Mention four characteristics of big data. Elaborate these characteristics with respect to
social media websites.(5 marks/24)
7. What is the basic difference between traditional RDBMS and Hadoop?(5/23)
8. What are the 3 V’s of big data? Give two big data case studies indicating respective
V’s with justification. (5/23)
9. Why is HDFS more suited for applications having large datasets and not when there
are small files? Elaborate. (5/22)
10. What are the Core Hadoop components? Explain in detail.(10/22)
11. Give a brief overview of hadoop core components and Hadoop Ecosystem
Components. (5/22)
12. Mention the 4 characteristics of bigdata. Elaborate these characteristics w.r.to social
media websites. (5/20)
13. List down at least 4 different sources of bigdata from different domain and justify
how they can be considered as bigdata applications.(5/20)
14. When it comes to big data how NoSQL scores over RDBMS. (5/19)
15. Give difference between Traditional data management and analytics approach Versus
Big data Approach(5/19)
16. What is Hadoop? Describe HDFS architechure with diagram. (10/19)
17.

Module : 2
1. Write a map reduce pseudo code for word count problem. Illustrate with an example
showing all the steps.(10 marks/24)
2. Explain selection and projection relational algebraic operation using MapReduce. (10
marks/24)
3. Explain MapReduce programming model in detail. (5 marks/24)
4. Discuss 1-step Matrix-Matrix Multiplication MapReduce algorithm and apply to the

following problem (10 marks/24)

5. Illustrate different Relational Algebra operations using MapReduce(10 marks/24)
6. Distinguish between Name node and Data node. (5 marks/23)
7. Write a map reduce pseudo code to multiply two matrices. Apply map reduce

8. working to perform following matrix multiplication. (10/23)

9. Explain natural join and grouping and aggregation relational algebraic operation using
MapReduce. (10/23)
10. Explain how node failure is handled in Hadoop.(10/23)
11. Write a map reduce pseudo code to multiply two matrices. Apply map reduce working

to perform following matrix multiplication.

(10/23)
12. Explain Map Reduce execution pipeline with suitable example(10/23)
13. What is function of Map Tasks in the Map Reduce framework? Explain with the help
of an example. (5/22)
14. Write a map reduce pseudo code for word count problem. Apply map reduce working
on the following document: “This is an apple. Apple is red in color”.(10/22)
15. What are the properties and limitations of Hadoop? (5/22)
16. Explain how Hadoop's mapper and reducer work, with an example of performing any
relational algebra operation using Map Reduce.(10/22)
17. Explain the Map Reduce working and apply the working on the following document.
“I like an apple and a banana. He likes an apple and a melon. I also like a melon.”
(5/20)
18. Apply Map Reduce Vector Multiplication algorithm to perform the following matrix

vector multiplication. (5/20)

19. What is the role of JobTracker and TaskTracker in MapReduce.Illustrate Map
Reduce execution pipeline with Word count example(10/19)

Module : 3
1. List and explain the core business drivers behind the NoSQL movement. (5 marks/24)
2. Differentiate between SQL and NoSQL system.(5 marks/24)
3. Recall all NoSQL design patterns with example. Justify CAP property(10 marks/24)
4. b) List and explain the core business drivers behind the NoSQL movement. (5
marks/23)
5. What is a key-value store? What are the benefits of using a key-value store? (10/23)
6. Describe the four ways by which big data problems are handled by NoSQL.(10/23)
7. Demonstrate how business problems have been successfully solved faster, cheaper
and more effectively considering NoSQL Google’s MapReduce case study. Also
illustrate the business drivers and the findings in it.(5/22)
8. Name the three ways that resources can be shared between computer systems. Name
the architecture used in big data solutions and describe it in detail.(10/22)
9. Compare KeyValue No-SQL datastore with Document based NoSQL datastore.
(5/22)
10. Explain in detail any two Big data Applications based on NoSQL.(5/22)
11. List all variation of NoSQL database with two features of each and two examples of
each(5/20)
12. Explain CAP theorem of NoSQL database. As No SQL database is not able to adopt
ACID properties can we adopt NoSQL for traditional banking application?(5/20)
13. Explain different ways by which big data problems are handled by NoSQL. (10/19)

Module 4 :
1. Explain the concept of bloom filter with an example.(5/24)
2. Suppose the stream is S = {4, 2, 5 ,9, 1, 6, 3, 7}. Let hash functions h(x) = 3x + 7mod
32 for some a and b, treat result as a 5-bit binary integer. Show how the Flajolet-
Martin algorithm will estimate the number of distinct elements in this stream.(10/24)
3. Explain DGIM algorithm for counting ones in a stream with example(10/24)
4. Explain DGIM algorithm for counting ones in a stream with example.(10/24)
5. FM algorithm(5/24)
6. List and explain the different issues and challenges in data stream query
processing.(5/23)
7. Suppose the stream is S = {2, 1, 6, 1, 5, 9, 2, 3, 5}. Let hash functions h(x) = ax + b
mod 16 for some a and b, treat result as a 4-bit binary integer. Show how the Flajolet-
Martin algorithm will estimate the number of distinct elements, h(x) = 4x + 1 mod
16.(10/23)
8. With a neat sketch, explain the architecture of the data-stream management
system.(10/23)
9. List down all six constraints that must be satisfied for representing a stream by
buckets using DGIM algorithm with examples.(5/23)
10. Suppose the stream is S = {4, 2, 5 ,9, 1, 6, 3, 7}. Let hash functions h(x) = x + 6 mod
32 for some a and b, treat result as a 5-bit binary integer. Show how the Flajolet-
Martin algorithm will estimate the number of distinct elements in this stream.(10/23)
11. Explain DGIM algorithm for counting ones in a stream with example.(10/23)
12. Explain the concept of bloom filter with an example(5/22)
13. Suppose the stream is 1, 3, 2, 1, 2, 3, 4, 3, 1, 2, 3, 1. Let h(x) = 6x + 1 mod 5. Show
how the Flajolet- Martin algorithm will estimate the number of distinct elements in
this stream.(10/22)
14. With a neat sketch, explain the architecture of the data-stream management
system(10/22)
15. Why is it difficult to work with stream data?(5/21)
16. Explain the architecture of Data Stream Management Systems. How is it different
from DBMS?(10/21)
17. Investigate problems in Flajolet-Martin (FM) algorithm to count distinct elements
in a stream.(5/21)
18. Explain the DGIM algorithm and solve the following problem :
Consider the data stream shown below with N=14. 10011010101011101
i) Show one way of how the above initial stream will be divided into buckets and
count distinct 1’s.
ii) ii) The following bits enter the window one at a time: 10101. What is the
bucket configuration in the window after this sequence of bits has been
processed by DGIM and count distinct 1’s.(10/21)
19. Consider the stock market stream data. Justify the data stream features and draw the
model of data stream management for the mention system. Give two examples of
onetime query and continuous query from stock marketing stream.(10/20)
20. Explain with block diagram architechure of Data stream Management System.(10/19)
21. What do you mean by Counting Distinct Elements in a stream. Illustrate with an
example working of an Flajolet - Martin Algorithm used to count number of distinct
elements.(10/19)

Module 5 :
1. What is graph store? Give an example where a graph store can be used to
effectively solve a particular business problem.(10/24)
2. Determine communities for the given social network graph using Girvan-
Newman algorithm. (10/24)

3. Describe collaborative filtering in recommendation system. (10/24)

4. Investigate to find all communities in the graph given below using CPM
method.(10/24)

5. Comment on usefulness of different types of Recommendation System in real life

with example.(10/24)
6. Find PAGERANK of each page in the following figure after 3rd iteration.(10/24)
7. Determine communities for the given social network graph using Girvan-
Newman algorithm.(10)

8. Define collaborative filtering. Using an example of an e-commerce site like

flipkart or amazon describe how it can be used to provide recommendation to
users.(10)
9. Determine communities for the given social network graph using Girvan-
Newman algorithm.(10/23)

10. How recommendation is done based on properties of the product? Explain with
the help of an example(10/23)
11. Determine communities for the given social network graph using Girvan-
Newman algorithm.(10/22)

12. How recommendation is done based on properties of product? Elaborate with a

suitable example.(10/22)
13. Describe any one Community detection algorithm for social media with an
example.(10/21)
14. Compare Content based recommendation system with collaborative
recommendation. Give an example of Utility Matrix for the most popular movie
recommendation system for the user profile and the item profile and mention the
methods by which you can find the similar users.(10/21)
15. Write the algorithm for Clique Percolation Method. Apply the same to find the
communities on the following graph. (Show the stepwise execution of the
algorithm).(10/21)
16. Write the algorithm for Clique Percolation Method. Apply the same to find the
communities on the following graph. (Show the stepwise execution of the
algorithm).

(10/21)
17. What is the use of Recommender System. How is classification algorithm used in
recommendation system.(10/19)

18.
(10/19)

Bda Sem 7 Book
No ratings yet
Bda Sem 7 Book
188 pages
A Concise Encyclopedia of Islam
100% (8)
A Concise Encyclopedia of Islam
257 pages
Question Bank - Big Data Analytics - Final1
100% (1)
Question Bank - Big Data Analytics - Final1
6 pages
BDA Important Questions
No ratings yet
BDA Important Questions
9 pages
BDA 6TH SEM Question Bank
No ratings yet
BDA 6TH SEM Question Bank
6 pages
Bda Pyqs
No ratings yet
Bda Pyqs
5 pages
League of Nations
No ratings yet
League of Nations
6 pages
Big Data 2020
No ratings yet
Big Data 2020
13 pages
Bda Imp Questions Sem 7
No ratings yet
Bda Imp Questions Sem 7
7 pages
Big Data Analytics, NLP, Game Theory and Deep Learning
No ratings yet
Big Data Analytics, NLP, Game Theory and Deep Learning
13 pages
Sample Questions
No ratings yet
Sample Questions
8 pages
BDA SEM-7 IMP by MK - 250216 - 041059
No ratings yet
BDA SEM-7 IMP by MK - 250216 - 041059
8 pages
Ut2 QB Bda
No ratings yet
Ut2 QB Bda
1 page
QB
No ratings yet
QB
4 pages
Practice Question Bank
No ratings yet
Practice Question Bank
2 pages
Last Year Question Paper - Big Data - (BCS 061)
No ratings yet
Last Year Question Paper - Big Data - (BCS 061)
9 pages
BDA Techmax (Searchable)
No ratings yet
BDA Techmax (Searchable)
150 pages
BDA - AIDS Syllabus
No ratings yet
BDA - AIDS Syllabus
2 pages
Model Question Paper - Big Data - 2024-25 - Kca022
No ratings yet
Model Question Paper - Big Data - 2024-25 - Kca022
3 pages
Q - Bank BDA - IA2 - 2024-25
No ratings yet
Q - Bank BDA - IA2 - 2024-25
2 pages
Enterprise Artificial Intelligence Transformation
From Everand
Enterprise Artificial Intelligence Transformation
Rashed Haq
No ratings yet
BDA Questions
No ratings yet
BDA Questions
8 pages
The Big Picture B2 Intermediate
No ratings yet
The Big Picture B2 Intermediate
170 pages
Sem7 Bda-Cbcgs Dec19
No ratings yet
Sem7 Bda-Cbcgs Dec19
1 page
Question Bank DSBDA
No ratings yet
Question Bank DSBDA
4 pages
Bda IA2
No ratings yet
Bda IA2
12 pages
Dissrtatn Cmplte PDF
No ratings yet
Dissrtatn Cmplte PDF
162 pages
Bda Unitwise QB
No ratings yet
Bda Unitwise QB
3 pages
Check Point FW MONITOR Cheat Sheet 3.1d
No ratings yet
Check Point FW MONITOR Cheat Sheet 3.1d
2 pages
Big Data 2023
No ratings yet
Big Data 2023
18 pages
Jira Certification Sample Questions
No ratings yet
Jira Certification Sample Questions
7 pages
Evolution of Entrepreneurship: The 17 Century The Middle Ages The Earliest Stage
0% (1)
Evolution of Entrepreneurship: The 17 Century The Middle Ages The Earliest Stage
2 pages
Question Bank (4-5-6)
No ratings yet
Question Bank (4-5-6)
7 pages
Big Data Analtytics QB
No ratings yet
Big Data Analtytics QB
3 pages
1) Introduction To Big Data
No ratings yet
1) Introduction To Big Data
6 pages
Mrcet R20 Iv 1 QB
No ratings yet
Mrcet R20 Iv 1 QB
79 pages
Imp For Exam
No ratings yet
Imp For Exam
2 pages
Wa0037.
No ratings yet
Wa0037.
3 pages
SEM VII BDA Syllabus Theory
No ratings yet
SEM VII BDA Syllabus Theory
4 pages
GX 7, GX 11: Instruction Book
No ratings yet
GX 7, GX 11: Instruction Book
76 pages
BDAV Question Bank
No ratings yet
BDAV Question Bank
2 pages
BDA Question Bank
No ratings yet
BDA Question Bank
3 pages
Bda Question Bank
No ratings yet
Bda Question Bank
10 pages
Data Analytics Important Questions
No ratings yet
Data Analytics Important Questions
2 pages
21PCS203 - Big Data Analytics
No ratings yet
21PCS203 - Big Data Analytics
4 pages
BDA Syllabus - Sem VII - Mumbai University
No ratings yet
BDA Syllabus - Sem VII - Mumbai University
3 pages
IAT-I-BDA-odd Sem
No ratings yet
IAT-I-BDA-odd Sem
1 page
Islamic Political System (Basic Concept) : Madiha Ashraf
100% (1)
Islamic Political System (Basic Concept) : Madiha Ashraf
13 pages
Negative
No ratings yet
Negative
2 pages
BDA Question Bank - 2023
No ratings yet
BDA Question Bank - 2023
4 pages
21cs71BDA Question Bank
No ratings yet
21cs71BDA Question Bank
4 pages
QB Ia1 Bda
No ratings yet
QB Ia1 Bda
1 page
A Brief Overview of Artificial Intelligence
No ratings yet
A Brief Overview of Artificial Intelligence
2 pages
HD07 - Amadeus Reservation and Ticketing Help Desk - Air - Help Desk Module - Jan2018 - 3903939 - en - US
No ratings yet
HD07 - Amadeus Reservation and Ticketing Help Desk - Air - Help Desk Module - Jan2018 - 3903939 - en - US
66 pages
BDA Assignment
No ratings yet
BDA Assignment
2 pages
@vtucode - in 18CS72 Previous Year Paper
No ratings yet
@vtucode - in 18CS72 Previous Year Paper
2 pages
Please Use Either of The 3 Option Given Below While Setting Up The Subjective/descriptive Questions
No ratings yet
Please Use Either of The 3 Option Given Below While Setting Up The Subjective/descriptive Questions
22 pages
Comp Sem 7 BD R-2016
No ratings yet
Comp Sem 7 BD R-2016
7 pages
MR20 Vi-I Syllabus
No ratings yet
MR20 Vi-I Syllabus
22 pages
Important Questions-Bigdata
No ratings yet
Important Questions-Bigdata
4 pages
BDA Qbank (2016-2020) : Chapter 1: Introduction To Big Data and Hadoop
No ratings yet
BDA Qbank (2016-2020) : Chapter 1: Introduction To Big Data and Hadoop
7 pages
Important Da
No ratings yet
Important Da
9 pages
Banyuhay: Katutubong Sayaw Sa Makabagong Pananaw Playbill
No ratings yet
Banyuhay: Katutubong Sayaw Sa Makabagong Pananaw Playbill
18 pages
21 Reasons Kettlebells PDF
No ratings yet
21 Reasons Kettlebells PDF
4 pages
Cyber Security Unit 1
No ratings yet
Cyber Security Unit 1
11 pages
19ECS442: BIG DATA Question Bank
No ratings yet
19ECS442: BIG DATA Question Bank
4 pages
18CS72
No ratings yet
18CS72
2 pages
Extc Sem 7 Bda R-2016
No ratings yet
Extc Sem 7 Bda R-2016
4 pages
BDA Questions
No ratings yet
BDA Questions
2 pages
GIVER Study Guide
No ratings yet
GIVER Study Guide
5 pages
Bda r16 Csdlo7032 QP
No ratings yet
Bda r16 Csdlo7032 QP
4 pages
IGNOU MCA Data Warehousing and Data Mining Previous Years Unsolved Papers MCS 221
From Everand
IGNOU MCA Data Warehousing and Data Mining Previous Years Unsolved Papers MCS 221
Manish Soni
No ratings yet
TWGMC 1N4007 - C727081 - Diode 1N4001 Surface Mount
No ratings yet
TWGMC 1N4007 - C727081 - Diode 1N4001 Surface Mount
3 pages
The Elements and Principles of Art
No ratings yet
The Elements and Principles of Art
4 pages
Big Data Analytics Comp Syllabus Sem7
No ratings yet
Big Data Analytics Comp Syllabus Sem7
4 pages
Manual de Instalación XLED
No ratings yet
Manual de Instalación XLED
92 pages
Question Paper Code:: (10×2 20 Marks)
No ratings yet
Question Paper Code:: (10×2 20 Marks)
2 pages
Module - 3: Engineering As Social Experimentation
No ratings yet
Module - 3: Engineering As Social Experimentation
16 pages
Straightforward A2 - Unit 1 - Mini Test
No ratings yet
Straightforward A2 - Unit 1 - Mini Test
4 pages
BD Problem Solving - I
No ratings yet
BD Problem Solving - I
2 pages
Itlog Ni Jan
No ratings yet
Itlog Ni Jan
10 pages
Tugas Inggris Ridwan TaufikC1B230115 An23 Kls Pesantren
No ratings yet
Tugas Inggris Ridwan TaufikC1B230115 An23 Kls Pesantren
5 pages
Lectura Log Error Calculo Barras
No ratings yet
Lectura Log Error Calculo Barras
12 pages
He Sas 1
No ratings yet
He Sas 1
3 pages
Abstract WCPC - The 'I Think' As Gluon
No ratings yet
Abstract WCPC - The 'I Think' As Gluon
2 pages
Contoh Soal - Imrona-Ngantang 1
No ratings yet
Contoh Soal - Imrona-Ngantang 1
3 pages
At Home and Abroad
No ratings yet
At Home and Abroad
6 pages
1.) The One Great Heart by Alexander Solzhenitsyn
No ratings yet
1.) The One Great Heart by Alexander Solzhenitsyn
4 pages
Project Report PDF
No ratings yet
Project Report PDF
15 pages
System Monitoring With Sar and Ksar
No ratings yet
System Monitoring With Sar and Ksar
9 pages

Bda End Sem

Uploaded by

Bda End Sem

Uploaded by

BDA END SEM

following problem (10 marks/24)

8. working to perform following matrix multiplication. (10/23)

to perform following matrix multiplication.

vector multiplication. (5/20)

3. Describe collaborative filtering in recommendation system. (10/24)

5. Comment on usefulness of different types of Recommendation System in real life

8. Define collaborative filtering. Using an example of an e-commerce site like

12. How recommendation is done based on properties of product? Elaborate with a

You might also like