0% found this document useful (0 votes)

9 views4 pages

Class 5 - MsgQueues, PubSub, Kafka

Uploaded by

tysgart

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views4 pages

Class 5 - MsgQueues, PubSub, Kafka

Uploaded by

tysgart

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Async vs Sync

● Synchronous tasks are high-priority tasks that require immediate execution and user feedback. They
are generally associated with user actions that need immediate system response.
● Asynchronous tasks can be processed in the background and are not time-sensitive. They don't need
immediate user feedback and often involve long-running operations that can be offloaded to
background systems.

Message Queues:

● Asynchronous Communication: Message queues enable asynchronous communication between

different services, meaning one service doesn't need to wait for another service to complete its task
before moving on to its next task.
● Load Balancing: They can also help distribute the load evenly among different services or instances
of a service.
● Controlling Throughput: By adjusting the rate at which messages are sent or received, you can
control the throughput of the system.
● Decoupling Components: Message queues decouple the services, meaning the services do not need
to interact with each other directly.
● Scaling: As the load increases, more queues or services reading from the queues can be added to
scale the system.
● Buffering & Throttling: Queues can act as a buffer, holding messages when the processing service is
not ready. Throttling can be implemented to control the rate of message processing based on the
current load on the system.

Distributed Message Queue vs Non-Distributed Message Queue

Non-Distributed Message Queue Distributed Message Queue

Lower: Since the system isn't distributed, a Higher: Distributed queues are designed to
Availability single point of failure can cause the entire avoid single points of failure. If one node fails,
service to be unavailable. the system can still continue to operate.

Depends on the specific queue technology More Robust: Messages in a distributed queue
and its configuration. Some may support can be replicated across multiple nodes,
Message Persistence
persistent messaging, but may not be as ensuring that no data is lost in case of a node
robust as distributed systems. failure.

Limited: The capacity is limited by the Higher: Since the system is distributed, it can
Scalability resources of the single machine where it be easily scaled up by adding more nodes to
operates. the system.

Lower: Being limited to a single machine's Higher: As you can distribute the load across
Throughput resources, the throughput might be limited multiple machines, you can achieve much
compared to distributed systems. higher throughput.
Enabled: Nodes can be spread across
Limited: All the data resides on a single
Geographical different geographical locations which can help
machine, which might be located in one
Distribution in reducing latency and enhancing data
geographic location.
locality.

Higher: The distributed nature of these

Lower: Since there's only a single machine, if
Reliability systems allows for built-in redundancy. If one
it fails, the service becomes unavailable.
node fails, others can take over its load.

Higher: Even when individual nodes fail, the

Lower: A single machine's failure can disrupt
Resilience system as a whole can continue functioning,
the whole system.
making it highly resilient to faults.

Producer/Consumer vs Publish/Subscribe

Producer/Consumer (One-to-one communication): In this pattern, a producer sends messages to a queue,

and a consumer reads from that queue. The key characteristics are:

● The producer and consumer are decoupled.

● The producer adds messages to the queue without knowing about the consumer's state.
● The consumer can consume messages from the queue at its own pace.

Example: Each order on an e-commerce platform (Amazon, for instance) can be seen as a message produced
by the Order Management Service. The Delivery Service, which is responsible for processing these orders,
acts as the consumer. It takes orders from the queue and processes them for delivery.

Tools: RabbitMQ, Apache Kafka, Amazon SQS.

Publish/Subscribe (One-to-many communication): In this pattern, a publisher sends messages to a topic,

and multiple subscribers can receive those messages. The key characteristics are:

● It involves one-to-many communication, where one publisher sends messages to multiple subscribers.
● Subscribers express interest in receiving specific types of messages by subscribing to relevant topics.

Example: When a customer places an order, the Order Management Service publishes a message (order
details). Multiple services like Delivery Service (to process delivery) and Receipt Service (to generate a
receipt) are interested in this message. They subscribe to this topic and receive the message.

Tools: Google Pub/Sub, Apache Kafka, RabbitMQ, AWS SNS.

These two communication patterns serve different purposes and the choice between them depends on the
specific use case. The Producer/Consumer pattern is used when you need to distribute tasks among different
workers (like processing orders). The Publish/Subscribe pattern is used when you want to broadcast
messages to multiple receivers (like notifying different services about a new order).
Kafka

● Distributed streaming platform for real-time data streaming and processing.

● Ideal for high-throughput, responsive applications, surpassing the capabilities of JMS, RabbitMQ, and
AMQP.

Key Kafka Concepts in E-commerce Example

● Producer: Generates and pushes records (messages) into topics. E.g., Order Management Service
creates order messages.
● Consumer: Reads data from Kafka topics. E.g., Delivery Service processes order messages.
● Topic: A category for records where multiple consumers can subscribe. E.g., "Orders" topic for order
messages.
● Broker: Servers storing and managing data in a Kafka cluster.
● Cluster: A set of brokers, scalable without downtime.
● Partition: Divides topics for organization and scalability. Hosted on different servers.
● Offset: Unique record identifier in a partition.
● Replica: Copies of partitions for fault tolerance.
● Consumer Group: A group of consumers that collaboratively process data.

Message Structure

● Key (optional): Used for partitioning topics.

● Value: Event details (e.g., string/object).
● Timestamp.
● Compression type.
● Headers (optional).
● Partition and offset ID (assigned once written to a topic).

How Consumer Consumes and Tracks

● Kafka consumers maintain their position using "offsets."

● Periodic heartbeat updates to Kafka with the latest offset.
● Kafka does not track whether a message is consumed by all consumers.
● Multiple consumers are organized into consumer groups.
● Each consumer group reads from specific partitions, ensuring each message is delivered to only one
consumer.

Replication of Partition

● Kafka replicates each partition across multiple brokers for data reliability and fault tolerance.
● One broker is the "leader," handling data requests, while others are "followers" duplicating the leader's
data.
● Replicas are distributed across brokers, ensuring data availability during broker failures.

Zookeeper and Its Evolution in Kafka

● Zookeeper is a coordination service used for maintaining configuration, synchronization, and group
services in distributed systems.
● Zookeeper was initially essential for Kafka to manage metadata and cluster status.
● Maintaining Zookeeper added complexity and potential single points of failure.
● Since Kafka 2.8.0, Kafka introduced its internal metadata management system, reducing the
dependency on Zookeeper.
● Kafka's internal system, known as KRaft mode, simplifies Kafka's architecture, improves performance,
and enhances reliability.
● Kafka can now operate independently without Zookeeper, making it more manageable and robust.

Kafka vs RabbitMQ

Criteria Apache Kafka RabbitMQ

Log-based publish-subscribe (pub/sub) model Supports multiple messaging models like

optimized for real-time data feeds. Kafka pub/sub, request/reply, and point-to-point.
Messaging Model
retains all messages for a set period, allowing RabbitMQ's focus is more on message
consumers to replay the stream. routing, delivery, and guarantee.

High throughput, handling millions of Good performance for many use-cases, but
Performance messages per second, which makes it ideal for typically doesn't match Kafka's extremely
heavy-load scenarios. high throughput.

Kafka stores data on disk and provides RabbitMQ also provides message durability
Durability intra-cluster replication, ensuring message by storing data on disk and supports
durability. replication between nodes.

More suitable for traditional messaging, task

Best for real-time streaming data analysis, log
Use-cases distribution, and situations where complex
aggregation, and event sourcing.
routing to multiple consumers is needed.

Kafka is more complex to set up and manage, RabbitMQ is easier to set up and manage,
Ease of Use due to its distributed nature and more and it offers a user-friendly web-based
configuration options. management interface.

Supports at-most-once (where messages can

be lost), at-least-once (where messages can
At-least-once delivery is standard, but
Message Delivery be duplicated), and exactly-once (where
exactly-once delivery is also supported with
Semantics message delivery is assured but with
more complex configuration.
considerable performance implications)
delivery semantics.

Kafka provides the producer and consumer RabbitMQ has wide language support with
Language Support API in multiple languages including Java, libraries available for many modern
Python, .NET, Go, etc. programming languages.

Note that the choice between Kafka and RabbitMQ depends on specific use-case requirements, and each has
its strengths and weaknesses.

Big Data Presentation Slide
100% (1)
Big Data Presentation Slide
30 pages
Apache Kafka
No ratings yet
Apache Kafka
27 pages
Microsoft - PL-900.vOct-2023.by .X .101q
No ratings yet
Microsoft - PL-900.vOct-2023.by .X .101q
85 pages
Kafka Using Spring Boot
No ratings yet
Kafka Using Spring Boot
136 pages
Unit 5 Apache Kafka Notes
No ratings yet
Unit 5 Apache Kafka Notes
54 pages
MICS Analyzer Option For DB2 ENU
No ratings yet
MICS Analyzer Option For DB2 ENU
545 pages
Module Pool Programming: Sap Abap Training Document
No ratings yet
Module Pool Programming: Sap Abap Training Document
7 pages
2 Kafka Eventstorming
No ratings yet
2 Kafka Eventstorming
104 pages
Activemq-Artemis-1 4 0
0% (1)
Activemq-Artemis-1 4 0
389 pages
Kafka Using Spring Boot v2
No ratings yet
Kafka Using Spring Boot v2
150 pages
Cours - Kafka
No ratings yet
Cours - Kafka
72 pages
Kafka Notes
No ratings yet
Kafka Notes
7 pages
Apache Kafka Description
No ratings yet
Apache Kafka Description
36 pages
Apache - Kafka Notes
No ratings yet
Apache - Kafka Notes
9 pages
Chapter 1 - Introduction To KAFKA: Objectives
No ratings yet
Chapter 1 - Introduction To KAFKA: Objectives
17 pages
A Fair Comparison of Message Queuing Systems
0% (1)
A Fair Comparison of Message Queuing Systems
12 pages
Configuring Kafka For High Throughput
No ratings yet
Configuring Kafka For High Throughput
11 pages
Kafka
No ratings yet
Kafka
15 pages
Kafka Monitoring
No ratings yet
Kafka Monitoring
64 pages
Graffersid Blogs
No ratings yet
Graffersid Blogs
44 pages
Introduction To - Messaging Systems-My Version
No ratings yet
Introduction To - Messaging Systems-My Version
43 pages
Module 3
No ratings yet
Module 3
77 pages
Prospectus For Varad Pal
No ratings yet
Prospectus For Varad Pal
74 pages
Big Data - Group 14
No ratings yet
Big Data - Group 14
26 pages
Fundamentals and Architecture of Apache Kafka
No ratings yet
Fundamentals and Architecture of Apache Kafka
30 pages
Kafka Overview
No ratings yet
Kafka Overview
36 pages
Kafka
No ratings yet
Kafka
43 pages
08 Apache Kafka
No ratings yet
08 Apache Kafka
45 pages
MPLS VPN Security Best Practice Guidelines
0% (1)
MPLS VPN Security Best Practice Guidelines
9 pages
Introduction To Apache Kafka and Its Setup
No ratings yet
Introduction To Apache Kafka and Its Setup
29 pages
Indirect 2
No ratings yet
Indirect 2
35 pages
Incident and Service Request Management For Academic Information System Based On COBIT
No ratings yet
Incident and Service Request Management For Academic Information System Based On COBIT
5 pages
Security Enhancement and Time Delay Consumption For Cloud Computing Using AES and RC6 Algorithm
No ratings yet
Security Enhancement and Time Delay Consumption For Cloud Computing Using AES and RC6 Algorithm
6 pages
Kafka Clustering v1.0.0
No ratings yet
Kafka Clustering v1.0.0
20 pages
Kafka Notes2
No ratings yet
Kafka Notes2
19 pages
Using Kafka For Real Time Data Ingestion With .NET KevinFeasel
No ratings yet
Using Kafka For Real Time Data Ingestion With .NET KevinFeasel
33 pages
Kafka Overview
No ratings yet
Kafka Overview
22 pages
Interservice Communication
No ratings yet
Interservice Communication
9 pages
Kafka
No ratings yet
Kafka
12 pages
CompTIA Premium PT0-001 by - VCEplus 65q-DEMO
No ratings yet
CompTIA Premium PT0-001 by - VCEplus 65q-DEMO
36 pages
System Admin Report
No ratings yet
System Admin Report
10 pages
Kafkha
No ratings yet
Kafkha
32 pages
Apache Kafka
No ratings yet
Apache Kafka
17 pages
Message Queue - Kafka
No ratings yet
Message Queue - Kafka
8 pages
AWS Cloud Computing Unit 2
No ratings yet
AWS Cloud Computing Unit 2
13 pages
7 - SQS - SNS
No ratings yet
7 - SQS - SNS
8 pages
Introduction To Apache Kafka
No ratings yet
Introduction To Apache Kafka
18 pages
Real Time Analytics With Apache Kafka and Spark: Rahul Jain
No ratings yet
Real Time Analytics With Apache Kafka and Spark: Rahul Jain
54 pages
Below Is A Detailed Report On Distributed System Technology
No ratings yet
Below Is A Detailed Report On Distributed System Technology
6 pages
Terraform Interview Questions
No ratings yet
Terraform Interview Questions
5 pages
Kafka Vs Message Queue - A Quick Comparison 2024
No ratings yet
Kafka Vs Message Queue - A Quick Comparison 2024
9 pages
Kafka MQTT1
No ratings yet
Kafka MQTT1
13 pages
Ronan de Araújo Souza - TCC Ciência Da Computação 2020
No ratings yet
Ronan de Araújo Souza - TCC Ciência Da Computação 2020
12 pages
Kafka
No ratings yet
Kafka
5 pages
Kafka
No ratings yet
Kafka
23 pages
Confluent Certified Developer for Apache Kafka® Exam kit
From Everand
Confluent Certified Developer for Apache Kafka® Exam kit
PRIYANKA
No ratings yet
RabbitMQ vs. Kafka - Head-To-Head - Better Programming
No ratings yet
RabbitMQ vs. Kafka - Head-To-Head - Better Programming
19 pages
Intermodal Transportation Management System (ITMS)
No ratings yet
Intermodal Transportation Management System (ITMS)
24 pages
Handle Large Messages in Apache Kafka
No ratings yet
Handle Large Messages in Apache Kafka
59 pages
Full Stack Development in 7 Days
No ratings yet
Full Stack Development in 7 Days
4 pages
DBMS (Unit (1,2,3,4,5) )
No ratings yet
DBMS (Unit (1,2,3,4,5) )
39 pages
Advanced Message Communication Models
No ratings yet
Advanced Message Communication Models
4 pages
FMO Architecture Design v1.0-2
No ratings yet
FMO Architecture Design v1.0-2
33 pages
Gy PYv S80 HM
No ratings yet
Gy PYv S80 HM
2 pages
01 - Chapter Introduction To AMQ Streams
No ratings yet
01 - Chapter Introduction To AMQ Streams
10 pages
Dice Resume CV Che Ndipowa
No ratings yet
Dice Resume CV Che Ndipowa
3 pages
DoS Host Alert 20353289
No ratings yet
DoS Host Alert 20353289
6 pages
Design and Implementation of Firewall Security Policies Using Linux Iptables, UFW, Firewalld
No ratings yet
Design and Implementation of Firewall Security Policies Using Linux Iptables, UFW, Firewalld
11 pages
Alumni
No ratings yet
Alumni
5 pages
RabbitMQ Architecture
No ratings yet
RabbitMQ Architecture
8 pages
Getting To Know Kafka: Ola Is The First Course in The Series of Courses Covering All The Aspects of Kafka
No ratings yet
Getting To Know Kafka: Ola Is The First Course in The Series of Courses Covering All The Aspects of Kafka
23 pages
Chapter 2 Architectural Models
No ratings yet
Chapter 2 Architectural Models
44 pages
Apache Kafka - Introduction - Tutorialspoint
No ratings yet
Apache Kafka - Introduction - Tutorialspoint
3 pages
Apache Kafka - Introduction
No ratings yet
Apache Kafka - Introduction
2 pages
Quizizz: Sempoa: Quiz Started On: Thu 05, Nov 07:58 AM Total Attendance: 43 Average Score: 4820 Class Level # Correct
No ratings yet
Quizizz: Sempoa: Quiz Started On: Thu 05, Nov 07:58 AM Total Attendance: 43 Average Score: 4820 Class Level # Correct
28 pages
Understanding Apache Kafka White Paper
No ratings yet
Understanding Apache Kafka White Paper
7 pages
Asynchronous Messaging: Page 1 of 3
No ratings yet
Asynchronous Messaging: Page 1 of 3
3 pages
2048 Game Using C++
No ratings yet
2048 Game Using C++
21 pages
6.SE Question Bank
No ratings yet
6.SE Question Bank
7 pages
Kafka for Distributed Systems: Definitive Reference for Developers and Engineers
From Everand
Kafka for Distributed Systems: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
KMBNIT04 2023-24 Question Paper AKTU MBA
No ratings yet
KMBNIT04 2023-24 Question Paper AKTU MBA
2 pages
Ese24 P
No ratings yet
Ese24 P
4 pages
C Layer Powerpoint
No ratings yet
C Layer Powerpoint
12 pages
Pulsar for Scalable Messaging Systems: Definitive Reference for Developers and Engineers
From Everand
Pulsar for Scalable Messaging Systems: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
(Ebook PDF) Database Systems Design, Implementation, & Management 13th Edition Instant Download
100% (1)
(Ebook PDF) Database Systems Design, Implementation, & Management 13th Edition Instant Download
57 pages
KeyWords ISTQB FL v3.1
No ratings yet
KeyWords ISTQB FL v3.1
1 page
IGNOU MCS 231 Mobile Computing Previous Year Solved Papers
From Everand
IGNOU MCS 231 Mobile Computing Previous Year Solved Papers
Manish Soni
No ratings yet
7.3 Development and Testing
No ratings yet
7.3 Development and Testing
10 pages
NoSQL Paper 2
No ratings yet
NoSQL Paper 2
18 pages
Interpretation of Google Analytics For The Google Merchandise Website
No ratings yet
Interpretation of Google Analytics For The Google Merchandise Website
5 pages

Class 5 - MsgQueues, PubSub, Kafka

Uploaded by

Class 5 - MsgQueues, PubSub, Kafka

Uploaded by

Async vs Sync

● Asynchronous Communication: Message queues enable asynchronous communication between

Distributed Message Queue vs Non-Distributed Message Queue

Non-Distributed Message Queue Distributed Message Queue

Higher: The distributed nature of these

Higher: Even when individual nodes fail, the

Producer/Consumer (One-to-one communication): In this pattern, a producer sends messages to a queue,

● The producer and consumer are decoupled.

Tools: RabbitMQ, Apache Kafka, Amazon SQS.

Publish/Subscribe (One-to-many communication): In this pattern, a publisher sends messages to a topic,

Tools: Google Pub/Sub, Apache Kafka, RabbitMQ, AWS SNS.

● Distributed streaming platform for real-time data streaming and processing.

Key Kafka Concepts in E-commerce Example

● Key (optional): Used for partitioning topics.

How Consumer Consumes and Tracks

● Kafka consumers maintain their position using "offsets."

Zookeeper and Its Evolution in Kafka

Criteria Apache Kafka RabbitMQ

Log-based publish-subscribe (pub/sub) model Supports multiple messaging models like

More suitable for traditional messaging, task

Supports at-most-once (where messages can

You might also like