0% found this document useful (0 votes)
125 views2 pages

Midsem Regular QP

This document contains a mid-semester test for the course "Stream Processing & Analytics". It consists of 5 questions related to streaming data systems, architectures, and Apache Kafka. The test is closed book, worth 30% of the student's grade, and lasts 2 hours. It provides instructions to follow and notes that answers should start on a new page with any assumptions clearly stated.

Uploaded by

srirams007
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
125 views2 pages

Midsem Regular QP

This document contains a mid-semester test for the course "Stream Processing & Analytics". It consists of 5 questions related to streaming data systems, architectures, and Apache Kafka. The test is closed book, worth 30% of the student's grade, and lasts 2 hours. It provides instructions to follow and notes that answers should start on a new page with any assumptions clearly stated.

Uploaded by

srirams007
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Birla Institute of Technology & Science, Pilani

Work Integrated Learning Programmes Division


Second Semester 2019-2020

Mid-Semester Test
(EC-2 Regular)

Course No. : DSECLZG556


Course Title : Stream Processing & Analytics
Nature of Exam : Closed Book
Weightage : 30% No. of Questions = 05
Duration : 2 Hours
Date of Exam :
Note to Students:
1. Please follow all the Instructions to Candidates given on the cover page of the answer book.
2. All parts of a question should be answered consecutively. Each answer should start from a fresh page.
3. Assumptions made if any, should be stated clearly at the beginning of your answer.

Q.1. What are streaming data systems? Explain the Generalized Streaming Data architecture and
its various components? [6]

Q.2. For parliamentary elections vote counting updates, a system has been developed which can
be used by interested parties to receive the vote counting related updates. Each constituency is
divided into six blocks. Each block has several voting centers in it. Counting is done center wise
which approximately takes 30 minutes for each center. Once the counting for all the centers in a
block is done then the central system is notified about the latest state of votes received by various
candidates. Giving three reasons, Justify whether the above described system is case of
streaming data or not.
[6]

Q.3 Compare the different streaming data delivery protocols with respect to the following points:
I. Message frequency
II. Communication direction
III. Message Latency
IV. Efficiency [6]

Q.4 Consider an international airline which operates both in passenger segment and cargo
segment. For every flight that is flying, the airline captures a lot of data in real time which can be
used for live tracking of flight status, modelling the flight schedules as well as for preventive
maintenance schedule etc. Also, at the same time, the same data was used for various analytical
purposes which are oriented towards improving the airline operations and also for the predicting
the passenger loads, cargo loads in near future and devising the marketing strategies around it.
Identify the appropriate data processing architecture that can help in achieving these use cases.
With a help of architectural diagram, represent the proposed system architecture.
[6]

1|Page
Q.5 A producer produces messages which are fed to a Kafka topic which has three partitions into
it. Another producer produces messages which are fed to the earlier mentioned Kafka topic as
well as into a different Kafka topic having two partitions into it. There are 6 Kafka brokers in the
system and 3 consumers out of which first two listens to the Kafka topic partitions of first topic
whereas the last one listens to the partitions of second Kafka topic. For each topic partition, 2
replicas are maintained in the cluster. Draw a suitable Kafka Cluster architectural diagram
fulfilling the above-mentioned requirements. [6]

2|Page

You might also like