Module 1

The document outlines a course on Distributed Systems, covering topics such as the introduction to distributed systems, their advantages over centralized systems, and cluster coordination. It discusses the challenges of centralized systems, the benefits of horizontal scalability, and the use of Zookeeper for leader election in distributed environments. The course emphasizes the importance of understanding distributed algorithms and the architecture of distributed systems for effective management and coordination.

Uploaded by

pitchumaniangayarkanni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views95 pages

Module 1

Uploaded by

pitchumaniangayarkanni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 95

CSC 610

DISTRIBUTED SYSTEMS
Dr. S. Pitchumani Angayarkanni
COURSE CONTENT
MODULE 1
• Introduction to Distributed systems
• Cluster coordination service and Distributed algorithms
• Cluster Management, Registration and Discovery
Introduction to Distributed Systems
• Overview:
• Where we can find Distributed systems
• The problems with centralized systems
• Distributed systems definitions and challenges
Where we can find Distributed systems

• Distributed systems are everywhere

• Watch a movie on demand
• Shop online
• Order a ride share service through our mobile phone
• Search for something online
Why Distributed Systems?

• Those companies are running highly scalable distributed systems in order to

• Handle millions of users

• Petabytes of data

• Provide consistent user experience

The cloud and distributed systems
• Even the simplest website hosted on cloud is running on a distributed system.
• Cloud(AWS, AZURE,GCP …) itself is a complex distributed system designed
for companies and software developers. So we can focus on product and
cloud vendors
The virtue of a Distributed system
• Beauty of a well designed and well implemented Distributed system
• Not aware of the complexities of the systems
• Feels like a single machine on the other side of the internet connection dedicated
specifically for you

A distributed system is a collection of independent computers that

appears to its users as a single coherent system
The Problems with Centralized Systems
• It is an opposite of the distributed system
• Startup company wanted to reach users through
website or App
• Create an online shopping experience for people to
buy video games / computers and share their views
on purchase with friends

• Users from other continents

who want to check out our
website are faced with bad
experience in the form of slow
page load as the latency
towards computer grows with
distance.
• There is no way for us to
improve the latency in this
current configuration.
4. Security and Privacy

• Our computer is open to

internet which makes it
vulnerable to hackers
• DDoS attack and many other
threats cannot be handled by
the centralized system.
• It also cant handle security and
privacy
Solution – Distributed System
• Horizontal scalability allows the system to grow and shrink on
demand
Distributed Systems Definition and challenges
• A Distributed system is a system of several processes, running on different
computers, communicating with each other through the network, and are
sharing a state or are working together to achieve a common goal.
Discussion on the terminologies
• What is a process?
• After we compile our application into an executable class or a jar file its stored on the file system just like any
other text, music or image file
• When we launch the application the operating system creates an instance of that application in the memory
• That instance is called process
• This process is entirely isolated from any other running process on the same computer no matter that
other process are instance of the same application or instance of different applications
• Processes running on the same machine can communicate with each other
through then network , the file system and the memory through some
advanced techniques that the operating system provide.
Still not Distributed
• All process are running in the same
machine still sharing all resources
and cannot scale beyond the
capacity of that particular computer.
• So if we put each process on a
Solution separate machine those processes
are completely decoupled from each
other
• We put each process on a separate machine
and those processes are completely
decoupled from each other.
• They can scale horizontally as much as
needed meaning we can keep adding more
and more machines as we need to extend our
memory or processing power.
• If some machine become unavailable or break
down the processes keep functioning and
overall our system can stay available and
continue performing its tasks that is actually not
trivial to achieve .
• Decoupling of the processes and placing
them on different machines left us with
network as the only option for
communication between the processes
• Once we establish the communication
our job is to build those process in such a
way that they would maintain a shared
view of the world in a form of a state or
work together to achieve a common goal.
1.2 Cluster Coordination
Terminology - Node
• A process running on a dedicated machine as a part of the distributed system
• This term originally comes from graph theory
• When two nodes have an edge between them that means the two processes
can communicate with each other through the network
Terminology - Cluster
• Collection of computers/nodes connected to each other.
• The nodes in a cluster are working on the same task, and typically are running
the same code.
• Large amount of data to analyze or a
complex computation to solve we want
to hand over this task to a cluster of
nodes.
• Question: What part of task is going to
be performed by which node after all
the biggest benefit in a distributed
system is that we can paralyze the
work and let each node work
independently for that common goal.
Attempt 1
• We would manually distribute the work and assign each node a separate task
but that would not be scalable
• We could receive 1000 tasks per second and so we need a programmatic way
to do that distribution.
Attempt 2
• We could manually decide on one special
node to be the leader or the master node will
be incharge of distributing the work and
collecting the results.
• This is better than our first approach.
• The problem with this is all nodes can fail at
any time including the master node.
• In distributed system failure is a question of
when our leader is not there to distribute the
work or collect the results the entire cluster is
decommissioned.
Attempt 3
• The solution is to build an algorithm to allow the
nodes to elect their own leader on demand and
make them all watch the leaders health closely
• If the master node becomes unavailable the
remaining nodes will reelect a new leader
• Later if the old leader recovers from its failure once
it joins the cluster it should realize that it's not a
leader anymore and we join as a regular node to
help with the work that is what we want to achieve
• but if we think about this architecture for a moment
picking a leader in a large group of people each
one with their own ego is not a trivial
The way we're going to use zookeeper is instead of having our nodes communicating directly with
each other to coordinate the work they are going to communicate with the zookeeper servers
directly.
• Instead on the other side of the equation zookeeper provides us with a very
familiar and easy to use software abstraction and data model that looks a lot
like a tree and is very similar to a file system each element in that tree or
virtual file system is called a Z node
Persistent: ephemeral Z node:
if our application disconnects from the zookeeper and then reckon X is the exact opposite it gets deleted as soon as the application that
again a persistent Z node that was created by our application stays created that Z node disconnects from the zookeeper we can already
intact with all its children and data guess that ephemeral Z nodes would be a great tool for us to
identify if another node that created them when done
Design First distributed algorithm – The leader
election
• In step one every node that connects to zookeeper volunteers to become a
leader.
• Each node submits its candidacy by adding a Z node that represents itself
under the election parent since zookeeper maintains a global order it can
name each Z node according to the order of their addition
• In step two after each node finishes creating a Z node it would query the
current children of the election parent notice that because of that order that
zookeeper provides us each node when querying the children of the election
parent is guaranteed to see all the Z nodes created prior to its own Z node
creation
• So in step three if the Z note that the current node created is the smallest
number it knows that it is now the leader on the other hand if the Z note that
the current node is not the smallest then the node knows that is not the leader
and it is now waiting for instructions from the elected leader.
• This is how we break the symmetry and arrive to a global agreement on the
leader node
Zookeeper - Leader Election
• Let us analyze how a leader node can be elected in a ZooKeeper ensemble. Consider there are N number of nodes
in a cluster. The process of leader election is as follows −
• All the nodes create a sequential, ephemeral znode with the same path, /app/leader_election/guid_.
• ZooKeeper ensemble will append the 10-digit sequence number to the path and the znode created will
be /app/leader_election/guid_0000000001, /app/leader_election/guid_0000000002, etc.
• For a given instance, the node which creates the smallest number in the znode becomes the leader and all the other
nodes are followers.
• Each follower node watches the znode having the next smallest number. For example, the node which creates
znode /app/leader_election/guid_0000000008 will watch the znode /app/leader_election/guid_0000000007 and
the node which creates the znode /app/leader_election/guid_0000000007 will watch the
znode /app/leader_election/guid_0000000006.
• If the leader goes down, then its corresponding znode /app/leader_electionN gets deleted.
• The next in line follower node will get the notification through watcher about the leader removal.
• The next in line follower node will check if there are other znodes with the smallest number. If none, then it will
assume the role of the leader. Otherwise, it finds the node which created the znode with the smallest number as
leader.
• Similarly, all other follower nodes elect the node which created the znode with the smallest number as leader.
• Leader election is a complex process when it is done from scratch. But ZooKeeper service makes it very simple.
Focus
Zookeeper Configuration and Startup
• https://www.apache.org/dyn/closer.lua/zookeeper/zookeeper-
3.8.0/apache-zookeeper-3.8.0-bin.tar.gz
Extract the Files to C:
Rename the configuration file
Create a folder called logs
Only Z node present
Create an hierarchy of parent and child Z node
• create command which takes the path we want to create and the data
our Z node will store the ACL parameter stands for access control list
which we can ignore at the moment so let's create a Z node called
parent under the root Z node and put some data in it
Zookeeper
• Create znodes
• Get data
• Watch znode for changes
• Set data
• Create children of a znode
• List children of a znode
• Check Status
• Remove / Delete a znode
• Create Znodes
• Create a znode with the given path. The flag argument specifies whether the
created znode will be ephemeral, persistent, or sequential. By default, all znodes
are persistent.
• Ephemeral znodes (flag: e) will be automatically deleted when a session expires or
when the client disconnects.
• Sequential znodes guaranty that the znode path will be unique.
• ZooKeeper ensemble will add sequence number along with 10 digit padding to the
znode path. For example, the znode path /myapp will be converted to
/myapp0000000001 and the next sequence number will be /myapp0000000002. If
no flags are specified, then the znode is considered as persistent.
To create a Sequential znode, add -s flag as
shown below.
To create an Ephemeral Znode, add -e flag as
shown below.

Remember when a client connection is lost, the ephemeral

znode will be deleted. You can try it by quitting the ZooKeeper
CLI and then re-opening the CLI.
• Get Data
• It returns the associated data of the znode and metadata of the specified
znode. You will get information such as when the data was last modified,
where it was modified, and information about the data. This CLI is also used
to assign watches to show notification about the data.
To access a sequential znode, you must enter the full path of the znode.
Sample
get /FirstZnode0000000023
• Watch
• Watches show a notification when the specified znode or znode’s children
data changes. You can set a watch only in get command.
• Set Data
• Set the data of the specified znode. Once you finish this set operation, you
can check the data using the get CLI command.
• Create Children / Sub-znode
• Creating children is similar to creating new znodes. The only difference is that
the path of the child znode will have the parent path as well.
• List Children
• This command is used to list and display the children of a znode.
• Check Status
• Status describes the metadata of a specified znode. It contains details such
as Timestamp, Version number, ACL, Data length, and Children znode.
• Remove a Znode
• Removes a specified znode and recursively all its children. This would happen
only if such a znode is available.
• https://dzone.com/articles/running-apache-kafka-on-windows-os

BCT - Unit-2
No ratings yet
BCT - Unit-2
27 pages
CJ720 GPS Tracker Command List
67% (3)
CJ720 GPS Tracker Command List
1 page
Chat Server and Client Application
No ratings yet
Chat Server and Client Application
11 pages
How To Write A Formal Email
No ratings yet
How To Write A Formal Email
3 pages
SCTP in Theory and Practice-Sample
0% (1)
SCTP in Theory and Practice-Sample
55 pages
ADSU1 VFTVF25 VF
No ratings yet
ADSU1 VFTVF25 VF
118 pages
Net - Centric Computing
No ratings yet
Net - Centric Computing
45 pages
Chapter 1 - Intro
No ratings yet
Chapter 1 - Intro
31 pages
Distributed Computing
No ratings yet
Distributed Computing
14 pages
Distributed Computing Note
100% (1)
Distributed Computing Note
54 pages
DistributedComputing Rev2
No ratings yet
DistributedComputing Rev2
44 pages
Distributed Computing: Unit-1 (
No ratings yet
Distributed Computing: Unit-1 (
47 pages
ADSU1VFTVF25
No ratings yet
ADSU1VFTVF25
118 pages
DS&CC Lab Manual
No ratings yet
DS&CC Lab Manual
23 pages
Distributed Computing - 2
No ratings yet
Distributed Computing - 2
7 pages
Distributed Computing
No ratings yet
Distributed Computing
126 pages
Introduction To Distributed Systems
No ratings yet
Introduction To Distributed Systems
5 pages
Distributed Computing Tech Knowledge
100% (1)
Distributed Computing Tech Knowledge
149 pages
Chapter 1
No ratings yet
Chapter 1
117 pages
DC Unit 1
No ratings yet
DC Unit 1
15 pages
Distributed Computing
No ratings yet
Distributed Computing
27 pages
Updated Lecture 08 - Distributed Computing
No ratings yet
Updated Lecture 08 - Distributed Computing
24 pages
Distributed System
No ratings yet
Distributed System
3 pages
Distributed Computing
No ratings yet
Distributed Computing
56 pages
Distributed Computing
No ratings yet
Distributed Computing
11 pages
Distributed sys-WPS Office
No ratings yet
Distributed sys-WPS Office
9 pages
DCS Chapter-1
No ratings yet
DCS Chapter-1
9 pages
IntroDistribuetComputing
No ratings yet
IntroDistribuetComputing
41 pages
IEC 312 - Distributed System Security
No ratings yet
IEC 312 - Distributed System Security
22 pages
Intro To DS Chapter 1
No ratings yet
Intro To DS Chapter 1
56 pages
CCunit 1
No ratings yet
CCunit 1
69 pages
DS Syllabus Introduction (Reference)
No ratings yet
DS Syllabus Introduction (Reference)
44 pages
LU1-Introduction To Distributed System
No ratings yet
LU1-Introduction To Distributed System
42 pages
W01-L01 Introduction To Distributed Computing
No ratings yet
W01-L01 Introduction To Distributed Computing
46 pages
Lesson 1
No ratings yet
Lesson 1
27 pages
A Brief Introduction To Distributed Systems: Maarten Van Steen Andrew S. Tanenbaum
No ratings yet
A Brief Introduction To Distributed Systems: Maarten Van Steen Andrew S. Tanenbaum
44 pages
Blockchain - Unit1
No ratings yet
Blockchain - Unit1
115 pages
DC Module1
No ratings yet
DC Module1
54 pages
Chap 1 DSintro
No ratings yet
Chap 1 DSintro
56 pages
Distributed System2
No ratings yet
Distributed System2
102 pages
What Is Distributed Computing
No ratings yet
What Is Distributed Computing
45 pages
Distributed System
No ratings yet
Distributed System
19 pages
Lecture 1-Introduction To Distributed Computing
No ratings yet
Lecture 1-Introduction To Distributed Computing
38 pages
Introduction To Distributed Systems
No ratings yet
Introduction To Distributed Systems
45 pages
Lecture 1 - Introduction
No ratings yet
Lecture 1 - Introduction
15 pages
Distributed Sys Lab Manual
No ratings yet
Distributed Sys Lab Manual
25 pages
Chapter (1) Introduction To Distributed Systems
No ratings yet
Chapter (1) Introduction To Distributed Systems
15 pages
Cloud Computing Notes
No ratings yet
Cloud Computing Notes
98 pages
Notes CC Unit1
No ratings yet
Notes CC Unit1
21 pages
Chapter 01: Introduction: Distributed Systems Principles and Paradigms
No ratings yet
Chapter 01: Introduction: Distributed Systems Principles and Paradigms
10 pages
Parallel Computing - Unit IV
No ratings yet
Parallel Computing - Unit IV
40 pages
UNIT-1 by Satish
No ratings yet
UNIT-1 by Satish
37 pages
Distributed Systems For Fun and Profit PDF
0% (1)
Distributed Systems For Fun and Profit PDF
88 pages
Introduction To Parallel and Distributed Computing
No ratings yet
Introduction To Parallel and Distributed Computing
29 pages
06 Parallel and Distributed Computing
No ratings yet
06 Parallel and Distributed Computing
43 pages
UE19CS352 - CC - HLP - Lecture 5
No ratings yet
UE19CS352 - CC - HLP - Lecture 5
15 pages
DC Module1
No ratings yet
DC Module1
62 pages
DS Mod 1
No ratings yet
DS Mod 1
44 pages
Literature Survey
No ratings yet
Literature Survey
3 pages
InfoNet Article06
No ratings yet
InfoNet Article06
5 pages
196 - Cahpter 1 - Characterization of DSs
No ratings yet
196 - Cahpter 1 - Characterization of DSs
10 pages
Chapter 0
No ratings yet
Chapter 0
4 pages
Module 2
No ratings yet
Module 2
49 pages
Publisher Subscriber Based Messaging System - Demo
No ratings yet
Publisher Subscriber Based Messaging System - Demo
12 pages
Module 3
No ratings yet
Module 3
77 pages
Module 4.2
No ratings yet
Module 4.2
78 pages
Brother MFC J3930DW Datasheet
No ratings yet
Brother MFC J3930DW Datasheet
5 pages
03 PassiveConnectivity PDF
No ratings yet
03 PassiveConnectivity PDF
77 pages
SWC FullManual
No ratings yet
SWC FullManual
313 pages
Overview of Troubleshooting The BTS 111
100% (1)
Overview of Troubleshooting The BTS 111
3 pages
What Is CCNA
No ratings yet
What Is CCNA
14 pages
Cisco Switch Configuration Cheat Sheet: by Via
No ratings yet
Cisco Switch Configuration Cheat Sheet: by Via
1 page
Wireless Systems: Summer Training Report On
No ratings yet
Wireless Systems: Summer Training Report On
13 pages
CS101 Preperation 2024 by ZB FILE F .. (Mids)
No ratings yet
CS101 Preperation 2024 by ZB FILE F .. (Mids)
8 pages
HPE - A00127303en - Us - HPE Performance Cluster Manager Installation Guide For Clusters Without Leader Nodes
No ratings yet
HPE - A00127303en - Us - HPE Performance Cluster Manager Installation Guide For Clusters Without Leader Nodes
235 pages
UX Workflow
No ratings yet
UX Workflow
7 pages
Marvell Linkstreet 88E6060
No ratings yet
Marvell Linkstreet 88E6060
164 pages
HO Features
No ratings yet
HO Features
889 pages
Software Project Management: Syllabus
No ratings yet
Software Project Management: Syllabus
6 pages
OSS PAC KPIs With Counters Descriptions
No ratings yet
OSS PAC KPIs With Counters Descriptions
191 pages
Step by Step Process of Windows XP Installation
No ratings yet
Step by Step Process of Windows XP Installation
2 pages
Spotlight On SQL Server Getting Started Guide
No ratings yet
Spotlight On SQL Server Getting Started Guide
38 pages
Modicon x80 I - Os - Bmxamo0410
No ratings yet
Modicon x80 I - Os - Bmxamo0410
4 pages
FTTO
No ratings yet
FTTO
13 pages
Ucs X Series Modular System Aag
No ratings yet
Ucs X Series Modular System Aag
2 pages
Social Media in B2B Marketing
No ratings yet
Social Media in B2B Marketing
18 pages
H.323 and Associated Recommendations.
No ratings yet
H.323 and Associated Recommendations.
32 pages
Lab02 - Installation of Active Directory and Joining Domain
100% (1)
Lab02 - Installation of Active Directory and Joining Domain
13 pages
Brook PS3/PS4 To PS2 Manual
No ratings yet
Brook PS3/PS4 To PS2 Manual
2 pages
Alteon-Tech-Spec ODS-VL
No ratings yet
Alteon-Tech-Spec ODS-VL
3 pages
T0B040000552C01PDFE
No ratings yet
T0B040000552C01PDFE
2 pages
Sensitivity or Selectivity - How Does ELNA Impact The Receriver Performance
100% (1)
Sensitivity or Selectivity - How Does ELNA Impact The Receriver Performance
15 pages

Module 1

Uploaded by

Module 1

Uploaded by

CSC 610

• Distributed systems are everywhere

• Those companies are running highly scalable distributed systems in order to

• Provide consistent user experience

A distributed system is a collection of independent computers that

• They decide to host their web site on the spare

• If there is a power or network

• Users from other continents

• Our computer is open to

Remember when a client connection is lost, the ephemeral

You might also like