0% found this document useful (0 votes)
4 views2 pages

New Text Document

The document contains a series of questions and answers related to data processing architectures and technologies, focusing on concepts like Lambda Architecture, Apache Spark, and stream processing. Key points include the identification of core components of modern data platforms and the role of different processing engines. It also highlights data delivery semantics and the function of Spark's Standalone Cluster Manager.

Uploaded by

sindhusri1205
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views2 pages

New Text Document

The document contains a series of questions and answers related to data processing architectures and technologies, focusing on concepts like Lambda Architecture, Apache Spark, and stream processing. Key points include the identification of core components of modern data platforms and the role of different processing engines. It also highlights data delivery semantics and the function of Spark's Standalone Cluster Manager.

Uploaded by

sindhusri1205
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 2

1.

Which of the following is an example of a hybrid architecture combining both


batch and streaming processing?
a) Lambda Architecture
b) Monolithic Architecture
c) Microservices Architecture
d) Event-Driven Architecture

Answer: a) Lambda Architecture

2.Which of the following is NOT a core component of a modern data platform?


a) Data Ingestion
b) Data Processing
c) Data Deletion
d) Data Storage

Answer: c) Data Deletion


3.What is Apache Spark primarily used for in stream processing?
A) Managing relational databases
B) Performing high-speed, real-time data processing
C) Storing unstructured data permanently
D) Visualizing data

Answer: B) Performing high-speed, real-time data processing


4.How does Apache Spark handle memory usage efficiently?
A) By storing all data on disk to avoid memory overflow
B) By leveraging disk for execution optimization
C) By using in-memory computation and caching frequently accessed data
D) By processing data using batch jobs only

Answer: C) By using in-memory computation and caching frequently accessed data

5.a data platform, which component is responsible for real-time data processing?
a) Batch Processing Engine
b) Stream Processing Engine
c) Data Warehouse
d) File System

Answer: b) Stream Processing Engine

6.is the goal of throughput-oriented processing in stream processing systems?


A) To prioritize high-speed data ingestion and processing
B) To reduce data loss in batch jobs
C) To minimize real-time data streams
D) To increase processing delays

Answer: A) To prioritize high-speed data ingestion and processing

7.what is event-time in streaming application.

8Which of the following data delivery semantics ensures that messages are delivered
at least once?
A) At-most-once
B) At-least-once
C) Exactly-once
D) None of the above

Answer: B) At-least-once
9. Spark’s Own Cluster Manager
Q2: What is the primary function of Spark’s Standalone Cluster Manager?
A) To manage distributed file systems
B) To allocate resources and schedule Spark jobs
C) To replace HDFS
D) To execute SQL queries

Answer: B) To allocate resources and schedule Spark jobs


10. Which of the following is NOT a cluster manager supported by Apache Spark?
A) Apache Mesos
B) Kubernetes
C) Apache Flink
D) Spark’s Standalone Cluster Manager

Answer: C) Apache Flink

You might also like