Test2 1819

This document contains questions for a second exam on stream processing. The questions cover topics like: why MapReduce is not suitable for real-time stream processing; time-based windows in stream processing; Spark APIs; buffering in Flink; Kafka partitions; time-series data compression; energy saving in sensor networks; and programming exercises to analyze a stream of sensor data using different stream processing systems and queries.

Uploaded by

Ermando 8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views3 pages

Test2 1819

Uploaded by

Ermando 8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

DI/FCT/UNL

Mestrado Integrado em Engenharia Informática

Processamento de Streams
2nd Semester, 2018/2019

Second Test (14/June/2019)

Part 1 – closed book

Question 1
Discuss why map-reduce model (and Hadoop implementation) is not appropriate for
realtime stream processing.

Question 2
When executing computations over a stream of events, it is possible to define windows based
on the timestamps of the events and based on the time at which the event arrives to the event
processing system.
a) Discuss the implications of adopting each approach to the result of the computations
(give example when appropriate).
b) Discuss the implications of each approach to the event processing system.

Question 3
Spark system has two APIs for expressing computations: Base API and Dataframes/SQL.
Compare both APIs, presenting advantages of each.

Question 4
Apache Flink operation env.setBufferTimeout(timeoutMillis) is used to force transmission
after some time. Explain why this is necessary in Apache Flink (discussing how events are
processed).

Question 5
Apache Kafka topics can be broken up into partitions.
a) Explain what are partitions in Kafka and why this is an interesting mechanism.
b) Discuss what are the guarantees when consuming event from a topic that has
multiple partitions.

Question 6
Time-series databases include sophisticated mechanisms for compressing information.
a) Explain why data compression is a key feature in these systems.
b) What makes compression particularly efficient in time-series databases.

Question 7
In sensor networks (and in IoT-based sensing systems), nodes are often organized in a tree,
where a sensor node only communicates with its parent and children. Briefly present two
mechanisms used by these systems to save energy when processing a stream of events from
sensors.
DI/FCT/UNL
Mestrado Integrado em Engenharia Informática
Processamento de Streams
2nd Semester, 2018/2019

Second Test (14/June/2019)

Part 2 – open book

Consider a stream of events from sensors with the following format (the type of each value is
presented in parenthesis):

timestamp (date), coord x (double), coord y (double), sensor id (long), sensor type (int),
value (double)

The sensor id is a unique identifier of the sensor.

The sensor type identifies the type of sensor (temperature, light, etc.)
Coord x and coord y are the coordinates of the position where the sensor is placed.
An area is a square identified by a pair (x,y), such that two points are in the same are if the
value of the area, area(coord x, coord y), is the same, with area(x,y) = (round(x*1000),
round(y*1000)).

Question 8
For your favorite event processing system, write a program that reads the above stream of
events from Kafka topic “IoT”, and continuously outputs for each area the average value for
each sensor type.

Question 9
For your favorite event processing system, write a program that reads the above stream of
events from Kafka topic “IoT”, and outputs alarms when the readings from sensors with
sensor type = 47 are larger than ten times the average of the value in the last 10 days – for
each area, you should output a single alarm every 5 seconds.
NOTE: if you cannot answer this question, write a program that outputs alarms when the
readings from sensors with sensor type = 47 are larger than 100 – for each area, you should
output a single alarm every 5 seconds if at least two sensors read values larger than 100.

Answer to one of the following questions.

Question 10
Present the pseudo-code that should run in each node of a TinyDB deployment for executing
the computation of question 9. Express the code in function exec(local evt, rcv evts): out evts
, where local evt is the value read from the local sensor, expressed using the format presented
previously; rcv evts is a list of messages received from children nodes; and out evts is a list of
messages to send to the parent node.

Question 11
Some timeseries databases use LSM-trees to store events. Databases that use LSM-trees often
keep in memory Bloom filters that summarize the keys that are present in a given tree – this
allows efficient access to the data.
Discuss if this approach is efficient for executing queries for a time interval. If so, present in
pseudo-code, the algorithm used for executing a query. If not, propose which additional
information should be maintained and present, in pseudo-code the algorithm used for
executing a query.

Grade 2 Tos Sum1
No ratings yet
Grade 2 Tos Sum1
5 pages
Ace of PACE Sample Paper
55% (20)
Ace of PACE Sample Paper
5 pages
Hydrograph Analysis
100% (1)
Hydrograph Analysis
48 pages
Dictionary - Programs Questions and Answers - Class 11
No ratings yet
Dictionary - Programs Questions and Answers - Class 11
17 pages
Advantage Workstation 4.3 SM
100% (1)
Advantage Workstation 4.3 SM
346 pages
Iare DS Lecture Notes 2
No ratings yet
Iare DS Lecture Notes 2
135 pages
MacOS Monograph
No ratings yet
MacOS Monograph
58 pages
Chemistry Acid and Basic Radicals
87% (15)
Chemistry Acid and Basic Radicals
1 page
Computer Awareness: Computer Awareness For IBPS PO/MT and Clerk
No ratings yet
Computer Awareness: Computer Awareness For IBPS PO/MT and Clerk
10 pages
CBSE Computer Science Class 12 Question Paper 2024 Solutions FREE PDF
No ratings yet
CBSE Computer Science Class 12 Question Paper 2024 Solutions FREE PDF
44 pages
SImple and Compound Interest Notes Lyst6475
No ratings yet
SImple and Compound Interest Notes Lyst6475
11 pages
AR253 History 2 - Structuralism and Metabolism
No ratings yet
AR253 History 2 - Structuralism and Metabolism
55 pages
JNV. Chemistry Viva
No ratings yet
JNV. Chemistry Viva
30 pages
Fisher Thermo Scientific Catalogue V Dear
100% (1)
Fisher Thermo Scientific Catalogue V Dear
72 pages
Course: 141 Tig Welding of Stainless Steel
No ratings yet
Course: 141 Tig Welding of Stainless Steel
17 pages
Cortex™ M3
No ratings yet
Cortex™ M3
384 pages
Ec34 Question Bank
No ratings yet
Ec34 Question Bank
6 pages
Lect 6
No ratings yet
Lect 6
8 pages
IDS Syllabus
No ratings yet
IDS Syllabus
3 pages
BES - Lecture 10 - Simple Linear Regression
No ratings yet
BES - Lecture 10 - Simple Linear Regression
15 pages
Trojan Port List
No ratings yet
Trojan Port List
13 pages
Chemistry-Neet Chemical Kinetics (Easy) Solution
No ratings yet
Chemistry-Neet Chemical Kinetics (Easy) Solution
8 pages
KCPSM6 User Guide 30sept14 PDF
No ratings yet
KCPSM6 User Guide 30sept14 PDF
124 pages
CSC270 DB CDF V4.0
No ratings yet
CSC270 DB CDF V4.0
2 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
49 pages
Hydraulic Power Unit: RE 51057, Edition: 2020-11, Bosch Rexroth AG
No ratings yet
Hydraulic Power Unit: RE 51057, Edition: 2020-11, Bosch Rexroth AG
20 pages
14 Slide
No ratings yet
14 Slide
44 pages
Study of Suspension System in All Terrain Vehicle: Presented by
No ratings yet
Study of Suspension System in All Terrain Vehicle: Presented by
14 pages
EC3355 SS IAT II Question Paper
No ratings yet
EC3355 SS IAT II Question Paper
2 pages
Chapter 2 Fiber Optics A Brief History of Fiber Optics Lesson 4
No ratings yet
Chapter 2 Fiber Optics A Brief History of Fiber Optics Lesson 4
5 pages
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6458)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (648)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (650)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (1005)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (582)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (464)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5181)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2016)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4103)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2886)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2814)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1022)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4372)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (280)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4135)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
3.5/5 (2141)

Test2 1819

Uploaded by

Test2 1819

Uploaded by

DI/FCT/UNL

Mestrado Integrado em Engenharia Informática

Second Test (14/June/2019)

Second Test (14/June/2019)

The sensor id is a unique identifier of the sensor.

Answer to one of the following questions.

You might also like