Kafka

Kafka is a distributed publish-subscribe messaging system that uses messages as the fundamental data unit, consisting of keys, values, offsets, and timestamps. It organizes messages into topics, which can be regular or compacted, and supports producers that send messages and consumers that retrieve them. Kafka brokers manage data storage and communication within a cluster, ensuring reliability and scalability through partitioning and replication.

Uploaded by

Khả Võ Văn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views3 pages

Kafka

Uploaded by

Khả Võ Văn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Kafka Introduction

Kafka is a unique distributed publish-subscribe messaging system written in the

Scala language with multi-language support and runs on the Java Virtual
Machine (JVM).
Message
Message is the fundamental data unit in Apache Kafka. It represents a record of
information that is produced by producers and consumed by consumers in
Kafka system.
Structure of Kafka message:
-Key: the key is used to determine the patition of the message. All messages
with the same key will are sent to the same partition.
-Value: is the actual data and payload in a message. Value can be any form of
data.
-Offset: is a unique sequential number assigned to each message in a partition.
Offset is used for determining the position of messages.
-Timestamp: used to determine the time when message was produced.
TOPIC and PARTITIONS
Topic basically is a category or a channel to which messages are stored and
transmitted between producers and consumers.
Kafka supports two types of topic:
-Regular topic: can be configured with a specific retention time or space bound.
When there are messages that are older than specified retention time, or the
space bound is exceeded for a partition, Kafka is allowed to detele those
messages to free space. By default, topics are configured with a retention time
of 7 days, but it's also possible to store data indefinitely.
-Compacted topic: messages are not deleted based on retention time or space
bound. Instead, Kafka treats later messages as updates to earlier messages with
the same key and guarantees never to delete the latest message per key. Only
the older messages with the same key are removed and the latest version of
each key is kept.
Topic is split into multiple partitions, with partition, kafka provide the
parallelism and scalability of data. That is, consumers can consume data from
multiple partitions distributed across different brokers in parallel.
PRODUCERS
Producer is a client process that publishes or sends messages to Kafka topics.
Producers are responsible for sending data to Kafka in a reliable, distributed
and scalable manner.
The producers can specifies the topic and the partition of that topic to which the
message should be sent, either by specifying a key or using a default
partitioning strategy.
CONSUMERS
Consumer is a client process that consumes the messages stored in topics.
A consumer must subscribe to one or more topics from which it wants to
consume messages. Consumers pull data from Kafka brokers, processing the
data in the order it was stored in the topic’s partitions.
Each consumer in a consumer group is assigned different partitions, ensuring
that no two consumers in the same group read the same partition
simultaneously.
After successfully processing a message, the consumer commits its offset to
Kafka, this helps the consumer to keep track of which messages is has
processed.
BROKERS and CLUSTERS
Kafka broker is a server or a node within a Kafka cluster that is responsible for
storing data and handling communication between producers and consumers.
A Kafka broker is a component of the Kafka cluster which receives data from
producers and stores it in Kafka topics, sends data to consumers when they
request it.
Broker also manages partitions within a topic and ensures data is distributed
across the cluster.
It replicates data to multiple brokers so data is ensured even if some brokers
fail.
Each broker is identified by a unique ID.
Kafka brokers act as leaders or followers for partitions. The leader broker
handles read and write requests, while the follower brokers replicate the
leader’s data. If the leader goes down, a new leader is elected from the
followers.
A cluster is a distributed system consisted of multiple brokers working
together. Cluster use Zookeeper for managing cluster state, keeping track of
which broker is the leader of each partition , and monitoring the health of
brokers.
KAFKA ARCHITECTURE

MANG2011完成稿
No ratings yet
MANG2011完成稿
13 pages
Kafka Using Spring Boot
No ratings yet
Kafka Using Spring Boot
136 pages
Sizing of Amine Absorber
No ratings yet
Sizing of Amine Absorber
7 pages
Apache Kafka - Thi Nguyen's Blog
No ratings yet
Apache Kafka - Thi Nguyen's Blog
39 pages
Unit 5 Apache Kafka Notes
No ratings yet
Unit 5 Apache Kafka Notes
54 pages
Kafka Notes Linkedin
No ratings yet
Kafka Notes Linkedin
33 pages
Kafka Introduction1
No ratings yet
Kafka Introduction1
11 pages
Big Data - Group 14
No ratings yet
Big Data - Group 14
26 pages
Apache Kafka
No ratings yet
Apache Kafka
27 pages
Data and AI Kafka Overview 1740507867
No ratings yet
Data and AI Kafka Overview 1740507867
20 pages
Configuring Kafka For High Throughput
No ratings yet
Configuring Kafka For High Throughput
11 pages
5 Kafka 2.7m
No ratings yet
5 Kafka 2.7m
46 pages
Kafka
No ratings yet
Kafka
23 pages
Kafka Topic Questions
No ratings yet
Kafka Topic Questions
9 pages
Apache Kafka
No ratings yet
Apache Kafka
27 pages
Kafka Notes
No ratings yet
Kafka Notes
7 pages
Kafka Concepts For SQS User
No ratings yet
Kafka Concepts For SQS User
17 pages
Kafka
No ratings yet
Kafka
88 pages
Apache Kafka Beginner Guide
No ratings yet
Apache Kafka Beginner Guide
40 pages
Fundamentals and Architecture of Apache Kafka
No ratings yet
Fundamentals and Architecture of Apache Kafka
30 pages
Kafka Interview Questions
No ratings yet
Kafka Interview Questions
10 pages
Pache Kafka Is An Open-Source Distr
No ratings yet
Pache Kafka Is An Open-Source Distr
1 page
Apache Kafka
No ratings yet
Apache Kafka
10 pages
Kafka Architectures Notes
No ratings yet
Kafka Architectures Notes
9 pages
Kafka
No ratings yet
Kafka
5 pages
Introduction To Apache Kafka and Its Setup
No ratings yet
Introduction To Apache Kafka and Its Setup
29 pages
Kafka Using Spring Boot v2
No ratings yet
Kafka Using Spring Boot v2
150 pages
Apache Kafka
No ratings yet
Apache Kafka
43 pages
Kafka Streaming Data
No ratings yet
Kafka Streaming Data
154 pages
Apache Kafka 360 1631077800
No ratings yet
Apache Kafka 360 1631077800
137 pages
Apache Kafka
No ratings yet
Apache Kafka
17 pages
Kafka
No ratings yet
Kafka
19 pages
Kafka Clustering v1.0.0
No ratings yet
Kafka Clustering v1.0.0
20 pages
Kafka With Spring Boot
No ratings yet
Kafka With Spring Boot
48 pages
? Kafka
No ratings yet
? Kafka
2 pages
08 Apache Kafka
No ratings yet
08 Apache Kafka
45 pages
SITA1603 Unit 3 Material
No ratings yet
SITA1603 Unit 3 Material
45 pages
Kafka
No ratings yet
Kafka
12 pages
Apache Kafka Beginner Guide Final
No ratings yet
Apache Kafka Beginner Guide Final
3 pages
Kafka SlidesShare
No ratings yet
Kafka SlidesShare
100 pages
Kafka
No ratings yet
Kafka
43 pages
KAFKAExample 2
No ratings yet
KAFKAExample 2
12 pages
Apache Kafka Description
No ratings yet
Apache Kafka Description
36 pages
Introduction To Apache Kafka
No ratings yet
Introduction To Apache Kafka
18 pages
Kafka
No ratings yet
Kafka
15 pages
Mastering Apache Kafka
No ratings yet
Mastering Apache Kafka
17 pages
Big Data-Kafka
No ratings yet
Big Data-Kafka
14 pages
AK
No ratings yet
AK
22 pages
Introduction To Apache Kafka - 070224-1155-334
No ratings yet
Introduction To Apache Kafka - 070224-1155-334
7 pages
Apache - Kafka Notes
No ratings yet
Apache - Kafka Notes
9 pages
Kafkha
No ratings yet
Kafkha
32 pages
Documentation
No ratings yet
Documentation
105 pages
Kafka
No ratings yet
Kafka
26 pages
Kafka Patterns and Anti-Patterns
No ratings yet
Kafka Patterns and Anti-Patterns
7 pages
Kafka Monitoring
No ratings yet
Kafka Monitoring
64 pages
Cours - Kafka
No ratings yet
Cours - Kafka
72 pages
KAFKA
No ratings yet
KAFKA
11 pages
Kafka Overview
No ratings yet
Kafka Overview
36 pages
GA05 Guide To LEED Certification Commercial
No ratings yet
GA05 Guide To LEED Certification Commercial
10 pages
Ielts Practice-Reading-Skimming and Scanning
No ratings yet
Ielts Practice-Reading-Skimming and Scanning
5 pages
6FM9Y
No ratings yet
6FM9Y
2 pages
Logcat 1711449573996
No ratings yet
Logcat 1711449573996
27 pages
FlexRig Fleet International
No ratings yet
FlexRig Fleet International
2 pages
Introduction To GIS and Its Applications Saurav Gautam
No ratings yet
Introduction To GIS and Its Applications Saurav Gautam
29 pages
Kijoms S 24 00282
No ratings yet
Kijoms S 24 00282
16 pages
113 Trellix NX 4600 Ds Trellix Network Security Tech Specifications Datasheet
No ratings yet
113 Trellix NX 4600 Ds Trellix Network Security Tech Specifications Datasheet
9 pages
Untitled
No ratings yet
Untitled
3 pages
Automation Services Work Portfolio in Telecom Industry: Open Source. Cloud. Automation
No ratings yet
Automation Services Work Portfolio in Telecom Industry: Open Source. Cloud. Automation
34 pages
Padovan 2014
No ratings yet
Padovan 2014
11 pages
PDI Demo
No ratings yet
PDI Demo
6 pages
Chapter 1 - Shining Resonance Refrain Walkthrough - Neoseeker
No ratings yet
Chapter 1 - Shining Resonance Refrain Walkthrough - Neoseeker
6 pages
Electropneumatics Basic Level: Festo Worldwide
No ratings yet
Electropneumatics Basic Level: Festo Worldwide
34 pages
Tos Tle Cookery Third Quarter Bahian
100% (1)
Tos Tle Cookery Third Quarter Bahian
2 pages
API ISCAN-LITE Scanner
No ratings yet
API ISCAN-LITE Scanner
4 pages
MX - Road Design
No ratings yet
MX - Road Design
287 pages
PA-FD-ID Fans
100% (3)
PA-FD-ID Fans
53 pages
Linux Commands
No ratings yet
Linux Commands
4 pages
Educ630 Web-Based Assessment Assignment
No ratings yet
Educ630 Web-Based Assessment Assignment
3 pages
Viewsonic-Manuals N3235w-1M SM 1a
No ratings yet
Viewsonic-Manuals N3235w-1M SM 1a
100 pages
Internship - Report NETWORKING PDF
No ratings yet
Internship - Report NETWORKING PDF
24 pages
Overview of Photonic Layer Functional Elements V4go
No ratings yet
Overview of Photonic Layer Functional Elements V4go
142 pages
Unified Modeling Language (Uml) : Assignment
No ratings yet
Unified Modeling Language (Uml) : Assignment
32 pages
Oscorp Style Guide: Logos
No ratings yet
Oscorp Style Guide: Logos
2 pages
Num5 Ibm
No ratings yet
Num5 Ibm
222 pages
EE102 Lab 4
No ratings yet
EE102 Lab 4
10 pages
Crashing
No ratings yet
Crashing
33 pages

Kafka

Uploaded by

Kafka

Uploaded by

Kafka Introduction

Kafka is a unique distributed publish-subscribe messaging system written in the

You might also like