0% found this document useful (0 votes)

1 views

getting-started-with-apache-kafka

Uploaded by

drivesankofa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1 views

getting-started-with-apache-kafka

Uploaded by

drivesankofa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

TIBCO solution brief

Getting Started with

Apache Kafka

Designed by LinkedIn, and now an Apache solution, Apache

Kafka message distribution provides a strong, simple, and
straightforward architectural approach based on logs. Kafka
uses a common and well understood publish/subscribe
messaging paradigm in which publishers publish messages to
topics stored in a broker that are then delivered to consumers
who subscribe to those topics.
Kafka contains three main components: the Kafka event broker;
Kafka Connect that allows you to connect producers and
consumers to the broker; and Kafka Streams for real-time data
processing. Applications publish and subscribe to topics and
topic partitions, while brokers handle distribution based on
interest. Because Kafka is distributed, servers can be added to
provide additional scale.

PRODUCER PRODUCER PRODUCER

APP APP APP

KAFKA CLUSTER

CONSUMER CONSUMER CONSUMER

APP APP APP

Figure 1: Basic Kafka Architecture

TIBCO solution brief | 2

Publish-Subscribe in Kafka
When a publisher publishes a message to a topic partition, the
Kafka broker appends the message to the topic partition’s physical
log. This model allows for messages to be indexed by their offset in
the message log versus a traditional approach that uses a message
ID for indexing and message lookup. In this way, it reduces
complexity, and more importantly, reduces state management
compared to other broker-based messaging systems.

PRODUCER
APP

0 1 2 3 4 5 6

CONSUMER CONSUMER CONSUMER

APP APP APP

Figure 2: Kafka Message Storage

Consumers are managed in much the same way as producers. They

sequentially consume messages from a given topic partition. Since
sequential consumption is built into the architecture, consumers
can acknowledge all messages received by simply acknowledging
the last message received in the sequence. In addition, Kafka
brokers do not maintain any information about consumption. This
allows them to be stateless, and for messages to be purged after a
configurable time period. New consumers coming online can replay
history and existing consumers can rewind and re-consume data
on demand.
Because Kafka treats the topic stream like a log, the only
information retained in the Kafka server is the per-consumer
offset. The position of a consumer in the process stream is
maintained in the Kafka server, but unlike other server-based
messaging solutions, the rest of the metadata about the
consumer is held in the consumer application. This method
provides a fast and lightweight way to store and retrieve data.
TIBCO solution brief | 3

The Kafka server persists message data based on a

configurable retention period. Consumers can replay and/
or catch up to the data stream based on the time period the
administrator has configured for the stream to persist data.
After this retention period, the message data is discarded, and
the space is reclaimed.
Like many messaging solutions, Kafka provides a guarantee
of at-least-once delivery for each message, but in some cases,
messages may be received multiple times. The community
building Kafka has taken the approach that less is more and
provides this delivery model as the sole approach, although
they provide architectural guidelines for managing duplicate
detection in the consumer application.

Kafka Connect
Building on the simple approach and design that has made
Kafka so attractive, the Kafka Connect toolkit provides a
flexible and scalable approach to integration with other
systems. Kafka Connect defines a connector as the ingress or
egress point of data. It defines a common framework for an
integration point for third-party systems to interact with the
core Kafka messaging system.
Like Kafka, Kafka Connect is designed to provide a simple,
scalable approach to integration. It acts as a data pump into
and out of Kafka core messaging. For importing data, a source
connector is used, and for exporting data, a sink connector.

PRODUCER PRODUCER PRODUCER

APP APP APP
KAFKA SOURCE CONNECTOR

TIBCO FTL
KAFKA CLUSTER

CONSUMER CONSUMER CONSUMER

APP APP APP

Figure 3: Kafka Connect Source Connector

TIBCO solution brief | 4

PRODUCER PRODUCER PRODUCER

APP APP APP
KAFKA SINK CONNECTOR

RELATIONAL DB
KAFKA CLUSTER

CONSUMER CONSUMER CONSUMER

APP APP APP

Figure 4: Kafka Connect Sink Connector

The Kafka event broker treats Kafka Connect source and sink
connectors just like Kafka publishers and subscribers. The
Kafka core is not affected by how data comes in or goes out,
which keeps the broker architecture simple. The logic and
processing for a given data source or sink happens within
Kafka Connect through a special connector for the given
integration point.

Kafka Streams
Some applications require real-time stream processing on top
of Kafka’s simple publish/subscribe interface. Building stream
processing into an application adds additional complexity. The
Kafka Streams library allows developers to invoke real-time stream
processing without building it themselves. Client applications can
access functions purpose built for real-time stream processing like
data filtration, aggregation, and grouping.
The Kafka Streams interface provides client applications with
the flexibility to not only consume data natively from Kafka,
but transform it in the message flow, improving data visibility
and data access. This streaming approach opens the Kafka
message flow and provides optimizations for applications
built to provide data analytics, data monitoring, and real-time
decision-making, supporting event-driven architectures.
TIBCO solution brief | 5

Getting Started with Apache Kafka

Now that you understand how Kafka works, follow these steps
to try it.

1 Download and install the software for your operating

system from https://www.tibco.com/products/tibco-
messaging/downloads
2 Start the Zookeeper server, which manages the Kafka brokers
3 Start a Kafka broker
4 Create a topic
5 Publish and consume messages

For additional information on deploying Apache Kafka into

Kubernetes environments and connecting it to other TIBCO
Messaging components, see https://community.tibco.com/wiki/
tibco-messaging-and-tibco-activespaces-article-links-quick-
access

Conclusion
While Apache Kafka was built for real-time data distribution, it
will not fit all the requirements of every enterprise application.
Alternatives like Apache Pulsar, Eclipse Mosquitto, and many
others may be worth investigating, especially if requirements
prioritize large scale global infrastructure where built-in
replication is needed or if native IoT/MQTT support is needed.
For more information on comparisons between Apache Kafka
and other data distribution solutions, please see the Resources
section at https://www.tibco.com/solutions/apache-kafka.

Global Headquarters TIBCO Software Inc. unlocks the potential of real-time data for making faster, smarter decisions. Our Connected
3307 Hillview Avenue Intelligence platform seamlessly connects any application or data source; intelligently unifies data for greater
Palo Alto, CA 94304 access, trust, and control; and confidently predicts outcomes in real time and at scale. Learn how solutions to our
+1 650-846-1000 TEL customers’ most critical business challenges are made possible by TIBCO at www.tibco.com.
+1 800-420-8450 ©2020, TIBCO Software Inc. All rights reserved. TIBCO and the TIBCO logo are trademarks or registered trademarks of TIBCO Software Inc. or its subsidiaries
in the United States and/or other countries. Apache, Kafka, and Pulsar are trademarks of The Apache Software Foundation in the United States and/or other
+1 650-846-1005 FAX countries. All other product and company names and marks in this document are the property of their respective owners and mentioned for identification
www.tibco.com purposes only.
16Sep2020

Mike Haran, PGMP, PMP
0% (1)
Mike Haran, PGMP, PMP
5 pages
Kafka Using Spring Boot
No ratings yet
Kafka Using Spring Boot
136 pages
Understanding Apache Kafka White Paper
No ratings yet
Understanding Apache Kafka White Paper
7 pages
Advanced Apache Kafka: Engineering High-Performance Streaming Applications
From Everand
Advanced Apache Kafka: Engineering High-Performance Streaming Applications
Peter Jones
No ratings yet
Unit 5 Apache Kafka Notes
No ratings yet
Unit 5 Apache Kafka Notes
54 pages
Configuring Kafka For High Throughput
No ratings yet
Configuring Kafka For High Throughput
11 pages
Big Data - Group 14
No ratings yet
Big Data - Group 14
26 pages
Kafka
No ratings yet
Kafka
23 pages
Apache Kafka - Introduction
No ratings yet
Apache Kafka - Introduction
2 pages
Kafka Mastery Guide: Comprehensive Techniques and Insights
From Everand
Kafka Mastery Guide: Comprehensive Techniques and Insights
Adam Jones
No ratings yet
Kafka Clustering v1.0.0
No ratings yet
Kafka Clustering v1.0.0
20 pages
Apache Kafka Long Polling
No ratings yet
Apache Kafka Long Polling
20 pages
Apache Kafka Essentials
No ratings yet
Apache Kafka Essentials
10 pages
The Apache Kafka® and Generative AI Handbook
From Everand
The Apache Kafka® and Generative AI Handbook
Joseph Matthew Stein
No ratings yet
Kafka
No ratings yet
Kafka
3 pages
Mastering Kafka Streams: From Basics to Expert Proficiency
From Everand
Mastering Kafka Streams: From Basics to Expert Proficiency
William Smith
No ratings yet
Kafka Topic Questions
No ratings yet
Kafka Topic Questions
9 pages
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
From Everand
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
Eric Chou
No ratings yet
Introduction To Apache Kafka
No ratings yet
Introduction To Apache Kafka
18 pages
Apache Kafka | Thi Nguyen's Blog
No ratings yet
Apache Kafka | Thi Nguyen's Blog
39 pages
Kafka Introduction1
No ratings yet
Kafka Introduction1
11 pages
Instaclustr Understanding Apache Kafka White Paper
No ratings yet
Instaclustr Understanding Apache Kafka White Paper
8 pages
Apache Kafka - PPT
No ratings yet
Apache Kafka - PPT
27 pages
Apache Kafka - Introduction - Tutorialspoint
No ratings yet
Apache Kafka - Introduction - Tutorialspoint
3 pages
08_Apache_Kafka
No ratings yet
08_Apache_Kafka
45 pages
4. Introduction to Apache Kafka and its setup (3)
No ratings yet
4. Introduction to Apache Kafka and its setup (3)
29 pages
Kafka Notes
No ratings yet
Kafka Notes
7 pages
Kafka Using Spring Boot v2
No ratings yet
Kafka Using Spring Boot v2
150 pages
Apache Kafka
No ratings yet
Apache Kafka
17 pages
Kafka
No ratings yet
Kafka
12 pages
kafka
No ratings yet
kafka
43 pages
Confluent Certified Developer for Apache Kafka® Exam kit
From Everand
Confluent Certified Developer for Apache Kafka® Exam kit
PRIYANKA
No ratings yet
Kafka Sparkstreaming
No ratings yet
Kafka Sparkstreaming
75 pages
HD Mod011 Kafka
No ratings yet
HD Mod011 Kafka
29 pages
kafka-overview
No ratings yet
kafka-overview
36 pages
Pache Kafka Is An Open-Source Distr
No ratings yet
Pache Kafka Is An Open-Source Distr
1 page
Data Engineering 101 Kafka Concepts 1721892046
No ratings yet
Data Engineering 101 Kafka Concepts 1721892046
76 pages
Data Engineering 101 - Kafka Concept
No ratings yet
Data Engineering 101 - Kafka Concept
76 pages
Introduction To Apache Kafka - 070224-1155-334
No ratings yet
Introduction To Apache Kafka - 070224-1155-334
7 pages
Apache Kafka
No ratings yet
Apache Kafka
27 pages
Fundamentals and Architecture of Apache Kafka
No ratings yet
Fundamentals and Architecture of Apache Kafka
30 pages
Kafka
No ratings yet
Kafka
88 pages
Kafka Patterns and Anti-Patterns
No ratings yet
Kafka Patterns and Anti-Patterns
7 pages
Apache Kafka(1)
No ratings yet
Apache Kafka(1)
10 pages
Advanced Real-Time Data Integration: Apache Kafka and Spark Streaming Techniques
From Everand
Advanced Real-Time Data Integration: Apache Kafka and Spark Streaming Techniques
Adam Jones
No ratings yet
_Data_and_AI_Kafka_Overview_1740507867
No ratings yet
_Data_and_AI_Kafka_Overview_1740507867
20 pages
Apache Kafka Description
No ratings yet
Apache Kafka Description
36 pages
KAFKA PPT
No ratings yet
KAFKA PPT
11 pages
Getting To Know Kafka: Ola Is The First Course in The Series of Courses Covering All The Aspects of Kafka
No ratings yet
Getting To Know Kafka: Ola Is The First Course in The Series of Courses Covering All The Aspects of Kafka
23 pages
Apache Kafka Beginner Guide
No ratings yet
Apache Kafka Beginner Guide
40 pages
Chapter 1 - Introduction To KAFKA: Objectives
No ratings yet
Chapter 1 - Introduction To KAFKA: Objectives
17 pages
5_kafka_2.7m
No ratings yet
5_kafka_2.7m
46 pages
Kafka Notes Linkedin
No ratings yet
Kafka Notes Linkedin
33 pages
KAFKAExample2
No ratings yet
KAFKAExample2
12 pages
Kafka Reference Architecture
No ratings yet
Kafka Reference Architecture
12 pages
Kafka - Premiera Ola
100% (3)
Kafka - Premiera Ola
2 pages
Kafka in Action
100% (1)
Kafka in Action
209 pages
Kafka Architectures Notes
No ratings yet
Kafka Architectures Notes
9 pages
Kafka With Spring Boot
No ratings yet
Kafka With Spring Boot
48 pages
Kafka and Mongodb
No ratings yet
Kafka and Mongodb
15 pages
01 - Chapter Introduction To AMQ Streams
No ratings yet
01 - Chapter Introduction To AMQ Streams
10 pages
ECO2147_Asgm1_Summer2025V3 (1)-1
No ratings yet
ECO2147_Asgm1_Summer2025V3 (1)-1
8 pages
02 - Docker
No ratings yet
02 - Docker
34 pages
spring-kafka-reference
No ratings yet
spring-kafka-reference
226 pages
kafka-overview
No ratings yet
kafka-overview
51 pages
Java_test
No ratings yet
Java_test
2 pages
Apache-Kafka_Bernhard-H_oss_2018
No ratings yet
Apache-Kafka_Bernhard-H_oss_2018
35 pages
BOSS16-Tutorial-flink
No ratings yet
BOSS16-Tutorial-flink
32 pages
Report
No ratings yet
Report
5 pages
EDP PDF PDF
No ratings yet
EDP PDF PDF
40 pages
Management Accounting-2 08.03.2022 MBS
No ratings yet
Management Accounting-2 08.03.2022 MBS
16 pages
Sales Price Variance
No ratings yet
Sales Price Variance
35 pages
Information Technology-II - QB - SYBMS - SEMIV - APRIL2022
No ratings yet
Information Technology-II - QB - SYBMS - SEMIV - APRIL2022
25 pages
TDS Chart
No ratings yet
TDS Chart
4 pages
Denmark IBA
No ratings yet
Denmark IBA
3 pages
Oil NCNDA
No ratings yet
Oil NCNDA
8 pages
Ten Years of ASEAN Framework Agreement on Services (AFAS)
No ratings yet
Ten Years of ASEAN Framework Agreement on Services (AFAS)
142 pages
Mallikarjun Pandya Resume
No ratings yet
Mallikarjun Pandya Resume
2 pages
Electronic contracts in Nigeria
No ratings yet
Electronic contracts in Nigeria
66 pages
Costing Labour & Oh Test
No ratings yet
Costing Labour & Oh Test
2 pages
MarketingAnalytics Conference
No ratings yet
MarketingAnalytics Conference
2 pages
A Study of Prioritizing Customers As A Form of Strategy in Business Establishment in The Present Generation
No ratings yet
A Study of Prioritizing Customers As A Form of Strategy in Business Establishment in The Present Generation
37 pages
Quiz - W02 Quiz - Capsim Team Member Guide 1
No ratings yet
Quiz - W02 Quiz - Capsim Team Member Guide 1
5 pages
California Consumer Privacy Act
No ratings yet
California Consumer Privacy Act
37 pages
CH3 - Static Techniques: Quick Notes
No ratings yet
CH3 - Static Techniques: Quick Notes
5 pages
SEDEX Supplier Workbook
100% (1)
SEDEX Supplier Workbook
127 pages
WS Acquisition v. Versa Yurt - Complaint
No ratings yet
WS Acquisition v. Versa Yurt - Complaint
46 pages
Self Help Group
100% (1)
Self Help Group
44 pages
22BCF010 Internship Review 2
No ratings yet
22BCF010 Internship Review 2
19 pages
Domain Model: Visualizing Concepts: Applying UML and Patterns - Craig Larman
No ratings yet
Domain Model: Visualizing Concepts: Applying UML and Patterns - Craig Larman
13 pages
A Study On International Expansion of Indian Handloom Bandhani Textile Final Report Kakade Pradnya 07IBM
No ratings yet
A Study On International Expansion of Indian Handloom Bandhani Textile Final Report Kakade Pradnya 07IBM
64 pages
Units: Fozia Zaheer
No ratings yet
Units: Fozia Zaheer
2 pages
Invoice_PBT0924A00182214 Patiala to Chandigarh
No ratings yet
Invoice_PBT0924A00182214 Patiala to Chandigarh
1 page
Kareen Leon
100% (1)
Kareen Leon
10 pages
EIJBM
No ratings yet
EIJBM
18 pages
IPL Business Model PDF
No ratings yet
IPL Business Model PDF
14 pages
MKTG Practice Chap 3
No ratings yet
MKTG Practice Chap 3
8 pages
REVISED Date Sheet - Old Course For 1st Year Compart & 2nd Year Both Regular & Compart
No ratings yet
REVISED Date Sheet - Old Course For 1st Year Compart & 2nd Year Both Regular & Compart
3 pages

getting-started-with-apache-kafka

Uploaded by

getting-started-with-apache-kafka

Uploaded by

TIBCO solution brief

Getting Started with

Designed by LinkedIn, and now an Apache solution, Apache

PRODUCER PRODUCER PRODUCER

CONSUMER CONSUMER CONSUMER

Figure 1: Basic Kafka Architecture

CONSUMER CONSUMER CONSUMER

Figure 2: Kafka Message Storage

Consumers are managed in much the same way as producers. They

The Kafka server persists message data based on a

PRODUCER PRODUCER PRODUCER

CONSUMER CONSUMER CONSUMER

Figure 3: Kafka Connect Source Connector

PRODUCER PRODUCER PRODUCER

CONSUMER CONSUMER CONSUMER

Figure 4: Kafka Connect Sink Connector

Getting Started with Apache Kafka

1 Download and install the software for your operating

For additional information on deploying Apache Kafka into

You might also like