Apache Kafka

Apache Kafka is a distributed messaging system that facilitates data exchange between different parts of a computer system through a publish-subscribe model. It features a robust ecosystem that includes topics, brokers, partitions, and consumers, allowing for high throughput, fault tolerance, and scalability. Key applications of Kafka include real-time data processing for services like Ola and Zomato, and it supports exactly-once message delivery semantics.

Uploaded by

anisha.kse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views10 pages

Apache Kafka

Uploaded by

anisha.kse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Apache Kafka

Apache Kafka
Apache Kafka is like a communication system that help different parts of a
computer system exchange data by publishing and subscribing to topic

Subscriber

Publisher
Sender Apache Kafka Receiver
Why we use Apache Kafka
• Ola driver location update
• Zomato live food tracking
• Notification system to huge users
• Increase database throughput
Zomato boy
• Difficult to read write at frequent
basic

User Live location store Data base

Zomato server
Zomato Boy

Update
User

Publish
Zomato
server
Kafka
Topic

Bulk Batch OP
Kafka Architecture
Kafka cluster
Kafka Ecosystem
Offset Kafka Broker 1
Topic A

Topic A Partition

Producer Consumer

Topic B

Kafka Broker 2

Zookeeper
Key concepts
• Kafka Ecosystem: The Kafka ecosystem refers to the entire suite of tools,
libraries, and components that complement Apache Kafka for building real-time
data pipelines, stream processing applications, and other distributed systems.

• Kafka Topic: A Kafka topic is a category or feed name to which messages are
published by producers. Topics are divided into partitions to allow parallelism and
scalability within a Kafka cluster. Each message published to a topic is appended
to one of its partitions.

• Kafka Broker: A Kafka broker is a Kafka server that runs in a Kafka cluster. It stores
and manages partitions, handles producer requests, and serves consumer
requests. Brokers are responsible for storing and replicating data across the
cluster.
• Kafka Cluster: A Kafka cluster is a group of Kafka brokers working together to
store and manage topics and handle the load from producers and consumers.
Kafka clusters provide scalability, fault tolerance, and high availability by
distributing data partitions across multiple brokers.

• Partition: A partition is a unit of parallelism in Kafka. Topics are divided into

partitions, and each partition is replicated across multiple brokers for fault
tolerance. Messages within a partition are ordered and assigned a sequential id
called an offset.

• Offset: An offset is a unique identifier assigned to each message within a

partition. Offsets are sequential integers that represent the position of a message
within the partition. Consumers use offsets to track their position in a partition
and retrieve messages.
• Zookeeper: Apache Zookeeper is a centralized service used by Kafka for
managing and coordinating Kafka brokers and maintaining cluster metadata. It
handles tasks such as leader election, maintaining configuration information, and
detecting broker failures.

• Producer: A Kafka producer is a client application that publishes messages to

Kafka topics. Producers send messages (key-value pairs) to Kafka brokers, which
then append the messages to the appropriate topic partitions based on the
message key (optional) and partitioning strategy.

• Consumer:A Kafka consumer is a client application that subscribes to topics and

reads messages from Kafka brokers. Consumers read messages from partitions,
process them, and maintain their own offset to track their position in each
partition. Consumers can be part of a consumer group for load balancing and
parallelism.
Key Features of Kafka
• Distributed Messaging System: Kafka is designed as a distributed messaging system,
providing a unified platform for handling real-time data feeds with high-throughput,
fault tolerance, and horizontal scalability.
• Partitioning:Kafka topics are divided into partitions, allowing data within a topic to be
distributed across multiple Kafka brokers. Partitioning enables horizontal scalability
and improves parallelism for data processing.
• Replication: Kafka replicates partitions across multiple brokers to ensure fault
tolerance and data durability. Replication ensures that data is not lost even if some
brokers or nodes fail.
• High Throughput: Kafka is optimized for high throughput and low latency, making it
suitable for handling large volumes of data and supporting real-time data processing
and analytics.
• Fault Tolerance: Kafka provides built-in replication and leader election mechanisms to
maintain availability and durability of data, even in the event of broker failures.
• Scalability: Kafka scales horizontally by adding more brokers to the cluster and
partitioning topics across multiple nodes. This scalability allows Kafka to handle
increasing data volumes and growing workloads.
• Streaming:Kafka supports stream processing with the Kafka Streams API and
integration with Apache Kafka Connect for connecting Kafka with external systems
such as databases and data lakes.
• Exactly-once Semantics:Kafka guarantees exactly-once semantics for message
delivery between producers and consumers. This ensures that messages are
processed exactly once, addressing concerns about data consistency.
• Connectivity and Integration: Kafka Connect simplifies integration with external
systems by providing connectors for various data sources and sinks. It allows
seamless data movement between Kafka and other systems.
• Ecosystem and Community:Kafka has a vibrant ecosystem with support for
monitoring, management, and integration tools. It is backed by a strong
community and active development, ensuring continuous improvement and
innovation.

Apache Kafka
No ratings yet
Apache Kafka
27 pages
Kafka
No ratings yet
Kafka
88 pages
Kafka Using Spring Boot
No ratings yet
Kafka Using Spring Boot
136 pages
Kafka Interview Questions
No ratings yet
Kafka Interview Questions
10 pages
Apache Kafka 360 1631077800
No ratings yet
Apache Kafka 360 1631077800
137 pages
Apache Kafka Beginner Guide
No ratings yet
Apache Kafka Beginner Guide
40 pages
Kafka
No ratings yet
Kafka
19 pages
BKNET - VSICM8 - M9 - Deploying and Configuring Vsphere Clusters
No ratings yet
BKNET - VSICM8 - M9 - Deploying and Configuring Vsphere Clusters
97 pages
Mastering Apache Kafka
No ratings yet
Mastering Apache Kafka
17 pages
Kafka Using Spring Boot v2
No ratings yet
Kafka Using Spring Boot v2
150 pages
Apache Kafka
No ratings yet
Apache Kafka
27 pages
Kafka - Interview Questions
No ratings yet
Kafka - Interview Questions
4 pages
5 Kafka 2.7m
No ratings yet
5 Kafka 2.7m
46 pages
AK
No ratings yet
AK
22 pages
Kafka Streaming Data
No ratings yet
Kafka Streaming Data
154 pages
Kafka Overview
No ratings yet
Kafka Overview
36 pages
SITA1603 Unit 3 Material
No ratings yet
SITA1603 Unit 3 Material
45 pages
Kafkha
No ratings yet
Kafkha
32 pages
Kafka Notes
No ratings yet
Kafka Notes
7 pages
Internet and Web Page Design All Unit
No ratings yet
Internet and Web Page Design All Unit
179 pages
Apache Kafka - Thi Nguyen's Blog
No ratings yet
Apache Kafka - Thi Nguyen's Blog
39 pages
Apache Kafka
No ratings yet
Apache Kafka
13 pages
Kafka
No ratings yet
Kafka
15 pages
Unit 5 Apache Kafka Notes
No ratings yet
Unit 5 Apache Kafka Notes
54 pages
KAFKA
No ratings yet
KAFKA
11 pages
Introduction To Apache Kafka and Its Setup
No ratings yet
Introduction To Apache Kafka and Its Setup
29 pages
Kafka
No ratings yet
Kafka
43 pages
Kafka
No ratings yet
Kafka
12 pages
Unit 3
No ratings yet
Unit 3
26 pages
Big Data - Group 14
No ratings yet
Big Data - Group 14
26 pages
Kafka With Spring Boot
No ratings yet
Kafka With Spring Boot
48 pages
Apache Kafka
No ratings yet
Apache Kafka
17 pages
Kafka Clustering v1.0.0
No ratings yet
Kafka Clustering v1.0.0
20 pages
Apache Kafka Beginner Guide Final
No ratings yet
Apache Kafka Beginner Guide Final
3 pages
Cours - Kafka
No ratings yet
Cours - Kafka
72 pages
Kafka My Kafka Note v67
No ratings yet
Kafka My Kafka Note v67
55 pages
Fundamentals and Architecture of Apache Kafka
No ratings yet
Fundamentals and Architecture of Apache Kafka
30 pages
KAFKAExample 2
No ratings yet
KAFKAExample 2
12 pages
Apache Kafka 101
No ratings yet
Apache Kafka 101
25 pages
Big Data-Kafka
No ratings yet
Big Data-Kafka
14 pages
Kafka Interview Questions
No ratings yet
Kafka Interview Questions
10 pages
7sj62 Catalog Sip E6
No ratings yet
7sj62 Catalog Sip E6
38 pages
Configuring Kafka For High Throughput
No ratings yet
Configuring Kafka For High Throughput
11 pages
Documentation
No ratings yet
Documentation
105 pages
Apache Kafka Key Concepts
100% (1)
Apache Kafka Key Concepts
8 pages
Apache Kafka Description
No ratings yet
Apache Kafka Description
36 pages
Introduction To Apache Kafka
No ratings yet
Introduction To Apache Kafka
18 pages
Gamatronic Power RM50 User Manual 208V Nov2011
No ratings yet
Gamatronic Power RM50 User Manual 208V Nov2011
191 pages
Kafka
No ratings yet
Kafka
23 pages
Kafka Arch
No ratings yet
Kafka Arch
4 pages
10 Successful Online Businesses
No ratings yet
10 Successful Online Businesses
12 pages
Apache Kafka Essentials
No ratings yet
Apache Kafka Essentials
10 pages
Apache Kafka Tutorial
No ratings yet
Apache Kafka Tutorial
6 pages
Kafka Notes
No ratings yet
Kafka Notes
7 pages
Introduction To Apache Kafka - 070224-1155-334
No ratings yet
Introduction To Apache Kafka - 070224-1155-334
7 pages
Kafka Topic Questions
No ratings yet
Kafka Topic Questions
9 pages
Des 3010g Manual en Uk
No ratings yet
Des 3010g Manual en Uk
260 pages
PTN 6300 Packet Transport Product Hardware Introduction
No ratings yet
PTN 6300 Packet Transport Product Hardware Introduction
39 pages
Catalog Unica2013 en
No ratings yet
Catalog Unica2013 en
120 pages
Getting To Know Kafka: Ola Is The First Course in The Series of Courses Covering All The Aspects of Kafka
No ratings yet
Getting To Know Kafka: Ola Is The First Course in The Series of Courses Covering All The Aspects of Kafka
23 pages
Kafka: Big Data Huawei Course
No ratings yet
Kafka: Big Data Huawei Course
14 pages
Kafka
No ratings yet
Kafka
3 pages
Manual Wi-Fi
100% (1)
Manual Wi-Fi
23 pages
Chapters-Project Report Final
No ratings yet
Chapters-Project Report Final
58 pages
User Manual Folks Automation FC100B - Compressed
No ratings yet
User Manual Folks Automation FC100B - Compressed
59 pages
AlienVault Getting Started Guide v47
No ratings yet
AlienVault Getting Started Guide v47
18 pages
Password Recovery For GPON ZTE ZXA10 F660
No ratings yet
Password Recovery For GPON ZTE ZXA10 F660
13 pages
Apache Kafka
No ratings yet
Apache Kafka
9 pages
We Are Social y Hootsuite (2018)
No ratings yet
We Are Social y Hootsuite (2018)
188 pages
GSM Architecture
No ratings yet
GSM Architecture
15 pages
Gigabyte Motherboard Manual GA-B85-HD3
No ratings yet
Gigabyte Motherboard Manual GA-B85-HD3
36 pages
Pache Kafka Is An Open-Source Distr
No ratings yet
Pache Kafka Is An Open-Source Distr
1 page
? Kafka
No ratings yet
? Kafka
2 pages
TH38M User Manual V1.0
0% (1)
TH38M User Manual V1.0
45 pages
9.3.2.13 Lab - Configuring and Verifying Extended ACLs
No ratings yet
9.3.2.13 Lab - Configuring and Verifying Extended ACLs
8 pages
Colour Qube 9301-9302-9303
No ratings yet
Colour Qube 9301-9302-9303
8 pages
Archer C7 Datasheet 4.0
No ratings yet
Archer C7 Datasheet 4.0
7 pages
HP Ux Cluster
No ratings yet
HP Ux Cluster
4 pages
PM Report Format
No ratings yet
PM Report Format
3 pages
FT 112D Datasheet en 1.0.0
No ratings yet
FT 112D Datasheet en 1.0.0
2 pages
Data Sheet 6ES7515-2AM01-0AB0: General Information
No ratings yet
Data Sheet 6ES7515-2AM01-0AB0: General Information
12 pages
Jquery Easy Help
No ratings yet
Jquery Easy Help
13 pages
Abbreviations SDH / SONET / ATM: Abbreviation Meaning A
No ratings yet
Abbreviations SDH / SONET / ATM: Abbreviation Meaning A
10 pages
Your Document Checklist - Citizenship and Immigration Canada
No ratings yet
Your Document Checklist - Citizenship and Immigration Canada
3 pages
Ajay Resume Dxe
No ratings yet
Ajay Resume Dxe
2 pages
VN PPR
No ratings yet
VN PPR
6 pages
New Application For Chinese Proprietary Medicine (CPM) Wholesale Dealer'S Licence
No ratings yet
New Application For Chinese Proprietary Medicine (CPM) Wholesale Dealer'S Licence
3 pages
Advanced Apache Kafka: Engineering High-Performance Streaming Applications
From Everand
Advanced Apache Kafka: Engineering High-Performance Streaming Applications
Peter Jones
No ratings yet
Mastering Kafka Streams: From Basics to Expert Proficiency
From Everand
Mastering Kafka Streams: From Basics to Expert Proficiency
William Smith
No ratings yet
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
From Everand
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
Eric Chou
No ratings yet

Apache Kafka

Uploaded by

Apache Kafka

Uploaded by

Apache Kafka

User Live location store Data base

• Partition: A partition is a unit of parallelism in Kafka. Topics are divided into

• Offset: An offset is a unique identifier assigned to each message within a

• Producer: A Kafka producer is a client application that publishes messages to

• Consumer:A Kafka consumer is a client application that subscribes to topics and

You might also like