100% found this document useful (1 vote)

675 views15 pages

Advanced Distributed Systems Guide

Distributed systems consist of hardware and software components located across a network that communicate by passing messages. Key implications include concurrency, lack of a global clock, independent component failures, unreliable communication, and expensive coordination. Distributed systems aim for resource sharing, openness, scalability, fault tolerance, and heterogeneity. There are three main types: distributed computing systems for high performance tasks, distributed information systems for business functions, and distributed pervasive systems for mobile devices. Design challenges include heterogeneity, transparency, openness, concurrency, security, and scalability. Common architectural models are client-server and peer-to-peer.

Uploaded by

shashank_w85m_312965

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

675 views15 pages

Advanced Distributed Systems Guide

Uploaded by

shashank_w85m_312965

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

UNIT 1 ADVANCED DISTRIBUTED SYSTEMS

A distributed system consists of hardware and software components located in a network of

computers that communicate and coordinate their actions only by passing messages.
Implications of distributed systems

Concurrency components execute in concurrent processes that read and update

shared resources. Requires coordination
No global clock makes coordination difficult (ordering of events)
Independent failure of components partial failure & incomplete information
Unreliable communication Loss of connection and messages. Message bit errors
Unsecure communication Possibility of unauthorized recording and modification of
messages.
Expensive communication Communication between computers usually has less
bandwidth, longer latency, and costs more, than between independent processes on the
same computer.

Goals of distributed systems

Resource sharing the possibility of using available resources anywhere.

Openness an open distributed system can be extended and improved incrementally.
requires publication of component interfaces and standards protocols for accessing
interfaces.
Scalability the ability to serve more users, provide acceptable response times with
increased amount of data.
Fault tolerance maintain availability even when individual components fail.
Allow heterogeneity network and hardware, operating system, programming
languages, implementations by different developers.

Types of distributed system

1. Distributed Computing Systems
Used for high performance computing tasks
Cluster computing systems
Grid computing systems
2. Distributed Information Systems
Systems mainly for management and integration of business functions
Transaction processing systems
Enterprise Application Integration
3. Distributed Pervasive (or Ubiquitous) Systems
Mobile and embedded systems
Home systems
Sensor networks

Designing the distributed systems does not come for free. Some challenges need to be
overcome in order to get the ideal systems. The challenges in distributed systems are:

Heterogeneity
This term means the diversity of the distributed systems in terms of hardware, software,
platform, etc. Modern distributed systems will likely span different:

Hardware devices: computers, tablets, mobile phones, embedded devices, etc.

Operating System: Ms Windows, Linux, Mac, UNIX, etc.

Network: Local network, the Internet, wireless network, satellite links, etc.

Programming Languages: Java, C/C++, Python, PHP, etc.

Different roles of software developers, designers, system managers

Transparency
Distributed systems designers must hide the complexity of the systems as much as they
can. Adding abstraction layer is particularly useful in distributed systems. While users hit
search in [Link], they never notice that their query goes through a complex process
before google shows them a result. Some terms of transparency in distributed systems
are:

Openness
If the well-defined interfaces for a system are published, it is easier for developers to add
new features or replace sub-systems in the future. Example: Twitter and Facebook have
API that allows developers to develop their own software interactively.

Concurrency
Distributed Systems usually is multi-users environment. In order to maximize
concurrency, resource handling components should be anticipate as they will be accessed
by competing users. Concurrency is a tricky challenges, then we must avoid the systems
state from becoming unstable when users compete to view or update data.

Security
Every system must consider strong security measurement. Distributed Systems somehow
deals with sensitive information; so secure mechanism must be in place.

Scalability
Distributed systems must be scalable as the number of user increases.
Scalability has 3 Dimensions:

Size: Number of users and resources to be processed. Problem associated is overloading

Geography: Distance between users and resources. Problem associated is
communication reliability

Administration: As the size of distributed systems increases, many of the system

needs to be controlled. Problem associated is administrative mess.

Resilience to Failure

Distributed Systems involves a lot of collaborating components (hardware, software,

communication). So there is a huge possibility of partial or total failure.

Architectural Models
How are responsibilities distributed between system components and how are these
components placed?
Client-server model
The system is structured as a set of processes, called servers that offer services to the users,
called clients.
The client-server model is usually based on a simple request/reply protocol, implemented with
send/receive primitives or using remote procedure calls (RPC) or remote method invocation
(RMI): - the client sends a request (invocation) message to the server asking for some service;
- the server does the work and returns a result (e.g. the data requested) or an error code if the
work could not be performed.
A server can itself request services from other servers; thus, in this new relation, the server
itself acts like a client.

Peer-to-peer
All processes (objects) play similar role.
Processes (objects) interact without particular distinction between clients and servers.

The pattern of communication depends on the particular application.

A large number of data objects are shared; any individual computer holds only a small part
of the application database.
Processing and communication loads for access to objects are distributed across many
computers and access links.
This is the most general and flexible model.

For example, be distributed computing, file-sharing, distributed storage, communication, or

real-time media streaming. Ideally, there is no centralized entity to control, organize,
administer, or maintain the entire system. Instead, these functions are divided and supported
by all peers. Peers cooperate by sharing resources such as storage, CPU cycles, network
bandwidth, and data. A number of benefits are obtained by adopting the P2P paradigm to
implement distributed applications.
These benefits include:
(1) Improved scalability by aggregating resources from peers and reducing the reliance on
centralized servers.
(2) Cost-effectiveness by utilizing already-deployed resources and eliminating the need for
expensive infrastructure.
(3) deplorability by performing all processing at the end systems.
Peer-to-Peer Applications
According to the software architecture model described in Section 1.2, P2P applications are
built on top of P2P substrates. The P2P substrate provides file lookup and peer management
services to the P2P application. Many distributed applications can leverage the P2P paradigm.
In this section, we present different categories of applications that either have been proposed
in the literature, or have been deployed in the real world. We do not intend to provide
exhaustive coverage of all possible P2P applications.

File Sharing
File sharing is the simplest and the most widely-deployed application in P2P systems. A filesharing application uses the P2P substrate to discover peers who have a requested file. Once
one or more suppling peers have been found, connections are established between the
supplier(s) and the requester. The application does not do more than storing and providing
files to requesting peers.
Media Streaming and High-bandwidth Content Distribution
In file-sharing P2P applications, a client has to download the entire file before starting using
it. Consider for example, a one-hour movie recorded at 1 Mb/s, and being downloaded by a
client with an in-bound bandwidth of 1.5 Mb/s. Ignoring all protocols overhead and
retransmissions, the client will have to wait for 40 minutes to start watching the movie! Given
that most of the contents distributed over the current P2P systems are multimedia files [17],
P2P media streaming applications have been receiving increasing attention in the research
community [40]. Real-time streaming applications start playing out the requested movie after
a short (e.g., order of seconds) waiting period.
File and Storage Systems
Distributed file systems provide logical functions similar to those provided by a centralized
file server, but they are constructed from physically distributed peers.
Some problems with client-server:
Centralisation of service poor scaling
Limitations: capacity of server bandwidth of network connecting the server
Peer-to-Peer tries to solve some of the above
It distributes shared resources widely
share computing and communication loads
Problems with peer-to-peer:
High complexity due to
- cleverly place individual objects
- retrieve the objects
- maintain potentially large number of replicas

Introduction to interprocess communications

Interprocess communication (IPC) is the set of tools provided by the OS to allow processes
that do not share common memory segments to communicate with each other
UNIX pipes is one of these tools: it is an instance of message passing
Message passing comprises two basic primitives:
send(destination, this_msg, msg_length);
receive(source, a_msg, &how_longength);

External data representation and marshalling

The Pub/Sub Model

The pub/sub communication model defines three different roles for entities in the system.
Publishers are sources that inject their information using publication messages. Subscribers
are information sinks and act as consumers of publications that were produced by publishers.
For this purpose, a subscriber issues a subscription message to specify the type of publications
that it would like to consume. Publishers and subscribers are collectively considered to be
clients for the pub/sub middleware which is then responsible for delivering published
messages to subscribers by taking their subscription interests into account. Typically, this
involves forwarding of publications through a number of intermediary nodes in an overlay
network. These forwarding nodes collectively realize the pub/sub service and are referred to
as service providers. In the pub/sub model, service providers store clients subscriptions and
use it to determine which publications must be delivered to which subscribers. This has the
advantage of eliminating the need for publishers or subscribers to be consciously aware of one
another.

Based on the expressiveness of the language used to represent subscribers interests, pub/sub
systems are commonly classified into two types, namely topic-based and content-based [53].
Simply put, both topic-based and content-based pub/sub systems allow a subscriber to issue
subscriptions that declare filtering constraints on the produced publication messages. Only
publications that satisfy these constraints are delivered to a subscriber. These publications are
said to match the clients subscription. Topic-based pub/sub systems support simple
constraints that are based upon a predefined set of topics.

Dependability in Pub/Sub Systems

Different aspects of dependability of operation in pub/sub systems.
Reliability: Reliable publication delivery concerns assurances provided by the pub/sub
implementation regarding successful delivery of individual publications. For example, highly
sensitive stock market information must be delivered to interested traders as failure to meet
this requirement can potentially lead to lost trading opportunities and cause financial loss. As
a result, reliability of the pub/sub system in this application scenario is of great importance.
Ordered delivery: Ordering guarantees concern assurances regarding the order of
successive publications that are delivered to the subscribers. For example, if the pub/sub

system provides total ordering guarantee, then all traders will receive stock quotes in the exact
same order. This may be useful in order to ensure an even and unbiased playing field for all
competing traders. Alternatively, the system may provide causal ordering such that the
precedence relationships between messages are preserved. This can be useful to study how
traders react to delivery of market news, for instance.
Recovery from failures: Distributed applications are commonly composed of faultprone
processes and networking components which may cease to operate at any point in time or
become disconnected from one another. For example, service providers may instantaneously
crash at any time or the machines that they run on may be unplugged suddenly. Likewise,
communication links may be unreliable and experience long periods of disconnections.
Occurrence of such failures in an unprepared pub/sub system can significantly hinder its
operation and even permanently disrupt its availability. To recover from such failure
scenarios, the system must have built-in recovery mechanisms that ensure such disruptions are
temporary and do not impact the operation of the system in the long run. In a distributed
pub/sub system, recovery typically involves amending the pub/sub overlay (i.e., maintaining
connectivity among service providers despite failures) and updating routing tables of service
providers accordingly in order to setup new forwarding paths in the network (i.e., in order to
re-route publications).

Cloud computing
In the simplest terms, cloud computing means storing and accessing
data and programs over the Internet instead of your computer's hard
drive. The cloud is just a metaphor for the Internet. It goes back to the
days of flowcharts and presentations that would represent the gigantic
server-farm infrastructure of the Internet as nothing but a puffy, white
cumulonimbus cloud, accepting connections and doling out information
as it floats.
What cloud computing is not about is your hard drive. When you store
data on or run programs from the hard drive, that's called local storage
and computing. Everything you need is physically close to you, which
means accessing your data is fast and easy, for that one computer, or
others on the local network. Working off your hard drive is how the
computer industry functioned for decades; some would argue it's still
superior to cloud computing, for reasons I'll explain shortly.

For it to be considered "cloud computing," you need to access your

data or your programs over the Internet, or at the very least, have that
data synchronized with other information over the Web. In a big

business, you may know all there is to know about what's on the other
side of the connection; as an individual user, you may never have any
idea what kind of massive data-processing is happening on the other
end. The end result is the same: with an online connection, cloud
computing can be done anywhere, anytime.
Common Cloud Examples
The lines between local computing and cloud computing sometimes get
very, very blurry. That's because the cloud is part of almost everything
on our computers these days. You can easily have a local piece of
software (for instance, Microsoft Office 365 ) that utilizes a form of
cloud computing for storage
Some other major examples of cloud computing you're probably using:
Google Drive : This is a pure cloud computing service, with all the
storage found online so it can work with the cloud apps: Google Docs,
Google Sheets, and Google Slides. Drive is also available on more
than just desktop computers; you can use it on tablets like
the iPad $335.00 at Amazon or on smartphones, and there are separate
apps for Docs and Sheets, as well. In fact, most of Google's services
could be considered cloud computing: Gmail, Google Calendar, Google
Maps, and so on.
Apple iCloud : Apple's cloud service is primarily used for online storage,
backup, and synchronization of your mail, contacts, calendar, and
more. All the data you need is available to you on your iOS, Mac OS,
or Windows device (Windows users have to install the iCloud control
panel). Naturally, Apple won't be outdone by rivals: it offers cloudbased versions of its word processor (Pages), spreadsheet (Numbers),
and presentations (Keynote) for use by any iCloud subscriber. iCloud is
also the place iPhone users go to utilze the Find My iPhone feature
that's all important when the phone goes missing.
Hybrid services like Box, Dropbox , and SugarSync all say they work in
the cloud because they store a synced version of your files online, but
most also sync those files with local storage. Synchronization to allow
all your devices to access the same data is a cornerstone of the cloud
computing experience, even if you do access the file locally.

Fundamentals of Distributed Systems
100% (1)
Fundamentals of Distributed Systems
20 pages
Distributed Systems REPORT
No ratings yet
Distributed Systems REPORT
39 pages
Distributed Systems
No ratings yet
Distributed Systems
17 pages
Principles of Distributed Systems
No ratings yet
Principles of Distributed Systems
31 pages
Overview of Distributed Systems
100% (1)
Overview of Distributed Systems
45 pages
Chapter 1-Introduction To Distributed Systems
No ratings yet
Chapter 1-Introduction To Distributed Systems
59 pages
Distributed System PPT 40
No ratings yet
Distributed System PPT 40
18 pages
SDN vs NFV: A Technical Comparison
No ratings yet
SDN vs NFV: A Technical Comparison
8 pages
Distributed Systems (Cosc 6003) : Chapter 1 - Introduction
No ratings yet
Distributed Systems (Cosc 6003) : Chapter 1 - Introduction
37 pages
Distributed Systems Characterization and Design
No ratings yet
Distributed Systems Characterization and Design
35 pages
Virtualization Essentials: History & Types
100% (3)
Virtualization Essentials: History & Types
28 pages
Distributed Systems
100% (1)
Distributed Systems
35 pages
Distributed System 25 Questions
No ratings yet
Distributed System 25 Questions
19 pages
Distributed Systems Overview
No ratings yet
Distributed Systems Overview
70 pages
4 Virtual Machine Provisioning and Migration Services
No ratings yet
4 Virtual Machine Provisioning and Migration Services
31 pages
Lecture 6 - Synchronization
No ratings yet
Lecture 6 - Synchronization
16 pages
Cloud Computing UNIT-I PPT - PPSX
No ratings yet
Cloud Computing UNIT-I PPT - PPSX
61 pages
Unit 4
100% (1)
Unit 4
33 pages
Cloud Platform Architecture Over
No ratings yet
Cloud Platform Architecture Over
71 pages
Unit Iii Virtualization Infrastructure and Docker Desktop Virtualization
No ratings yet
Unit Iii Virtualization Infrastructure and Docker Desktop Virtualization
20 pages
Lecture 1 Distributed Notes
No ratings yet
Lecture 1 Distributed Notes
23 pages
Cloud Storage Solutions Explained
100% (1)
Cloud Storage Solutions Explained
51 pages
Distributed Computing
No ratings yet
Distributed Computing
11 pages
Cloud Computing2-1
No ratings yet
Cloud Computing2-1
29 pages
GNS3 Your Network Simulation Playground
No ratings yet
GNS3 Your Network Simulation Playground
10 pages
Cloud Computing Unit - 3 Final
No ratings yet
Cloud Computing Unit - 3 Final
43 pages
TCP/IP Protocol Architecture Overview
100% (1)
TCP/IP Protocol Architecture Overview
3 pages
Cloud Computing Unit 3
No ratings yet
Cloud Computing Unit 3
48 pages
Intelligent Routing in Network Architecture
100% (1)
Intelligent Routing in Network Architecture
28 pages
SDN for Network Professionals
No ratings yet
SDN for Network Professionals
30 pages
Chapter 5 - Theoretical Foundations
No ratings yet
Chapter 5 - Theoretical Foundations
36 pages
Computer Networks Overview
No ratings yet
Computer Networks Overview
158 pages
CC Unit 1
100% (1)
CC Unit 1
11 pages
Chapter 3 - Data Link Layer
No ratings yet
Chapter 3 - Data Link Layer
46 pages
Module 2
No ratings yet
Module 2
7 pages
Google App Engine Overview and Features
No ratings yet
Google App Engine Overview and Features
44 pages
Distributed Computing Environment
No ratings yet
Distributed Computing Environment
42 pages
CSD4001 - SVT Lab Record Note - Sample
No ratings yet
CSD4001 - SVT Lab Record Note - Sample
78 pages
Cc-Unit-2
No ratings yet
Cc-Unit-2
99 pages
Sna Unit-I
No ratings yet
Sna Unit-I
10 pages
B.Tech Computer Networks Exam
100% (2)
B.Tech Computer Networks Exam
6 pages
CN Lab Manual
100% (1)
CN Lab Manual
46 pages
Cloud Computing
100% (1)
Cloud Computing
22 pages
Introduction to Distributed Systems
No ratings yet
Introduction to Distributed Systems
29 pages
Module 3 WMC
100% (1)
Module 3 WMC
16 pages
Securing The Cloud
No ratings yet
Securing The Cloud
8 pages
Virtualization for IT Professionals
No ratings yet
Virtualization for IT Professionals
56 pages
UNIT 1-System Models Distributed Computing
No ratings yet
UNIT 1-System Models Distributed Computing
70 pages
Cisco Packet Tracer Lab Manual 2
100% (1)
Cisco Packet Tracer Lab Manual 2
6 pages
Cloud Computing: Models and Challenges
No ratings yet
Cloud Computing: Models and Challenges
26 pages
MSCS201 Module-02
No ratings yet
MSCS201 Module-02
23 pages
Distributed Mutex & Deadlock
No ratings yet
Distributed Mutex & Deadlock
21 pages
Grid Architecture
No ratings yet
Grid Architecture
19 pages
IoT Architecture & Standards Guide
No ratings yet
IoT Architecture & Standards Guide
17 pages
CS3551 DC - Int - I - Answer Key 7.9.23
No ratings yet
CS3551 DC - Int - I - Answer Key 7.9.23
68 pages
Computer Networks Lecture Notes
0% (1)
Computer Networks Lecture Notes
50 pages
MC4203 - Cloud Computing Technologies
No ratings yet
MC4203 - Cloud Computing Technologies
98 pages
Unit 1distributed
No ratings yet
Unit 1distributed
18 pages
Types of Distributed Computing Jobs
No ratings yet
Types of Distributed Computing Jobs
38 pages
Lecture 1 Introduction To Distributed Systems - 034922
No ratings yet
Lecture 1 Introduction To Distributed Systems - 034922
6 pages
Soft v3 n12 2010 17
No ratings yet
Soft v3 n12 2010 17
13 pages
Session16385 WAS Timeouts
No ratings yet
Session16385 WAS Timeouts
75 pages
Legal Aspects of Business
No ratings yet
Legal Aspects of Business
34 pages
How Messaging and Queuing Works: Messaging: Because Programs Communicate by Sending Each Other Data in
No ratings yet
How Messaging and Queuing Works: Messaging: Because Programs Communicate by Sending Each Other Data in
7 pages
Visual FoxPro Outlook-Style Alerts
No ratings yet
Visual FoxPro Outlook-Style Alerts
24 pages
Lesson 1 Week 3 e Tech
No ratings yet
Lesson 1 Week 3 e Tech
2 pages
VEX Marble Sorting Machine Design
No ratings yet
VEX Marble Sorting Machine Design
17 pages
Elizabeth Berlin Resume
No ratings yet
Elizabeth Berlin Resume
1 page
Testbank For Voyages in World History 4th Edition Hansen Instant Download
0% (1)
Testbank For Voyages in World History 4th Edition Hansen Instant Download
18 pages
Math 218 (Fall 2008) Quiz 2 Solutions TA: Wei Lin Name: Section (Circle One) : 2pm 3pm 4pm
No ratings yet
Math 218 (Fall 2008) Quiz 2 Solutions TA: Wei Lin Name: Section (Circle One) : 2pm 3pm 4pm
2 pages
Libro - Algebra - UNI
No ratings yet
Libro - Algebra - UNI
346 pages
PenAWAran Lidar Candi - Tisa
No ratings yet
PenAWAran Lidar Candi - Tisa
13 pages
Good Questions For Coding Placements
No ratings yet
Good Questions For Coding Placements
11 pages
Industrial Water Flow Control
No ratings yet
Industrial Water Flow Control
6 pages
CHAPTER 6
No ratings yet
CHAPTER 6
100 pages
How To Write A Spreadsheet For Calculating Age
No ratings yet
How To Write A Spreadsheet For Calculating Age
2 pages
Assignment 1: Go Phish (Group Assignment) : Do Not
No ratings yet
Assignment 1: Go Phish (Group Assignment) : Do Not
3 pages
NOSS Core Abilities
100% (3)
NOSS Core Abilities
73 pages
Netflix Debunker 3.0
No ratings yet
Netflix Debunker 3.0
78 pages
Resume Ryan Cardoza
No ratings yet
Resume Ryan Cardoza
2 pages
Ruckus Vs Cisco
No ratings yet
Ruckus Vs Cisco
5 pages
Pharmacy Management System Details Paginated
No ratings yet
Pharmacy Management System Details Paginated
8 pages
IKI-LUM_30 Non-Directional CT Overview
No ratings yet
IKI-LUM_30 Non-Directional CT Overview
3 pages
Project On School Management System
No ratings yet
Project On School Management System
13 pages
String Functions
No ratings yet
String Functions
9 pages
Iot Based Sanitization Robot Using Raspberry Pi Zero W and Uv Lamp
No ratings yet
Iot Based Sanitization Robot Using Raspberry Pi Zero W and Uv Lamp
30 pages
Original Message
No ratings yet
Original Message
309 pages
Technology and Domestic Violence
No ratings yet
Technology and Domestic Violence
21 pages
TBS Encoder SRT Configuration Guide
No ratings yet
TBS Encoder SRT Configuration Guide
6 pages
1 ST
No ratings yet
1 ST
14 pages
Cavity Radiation in Abaqus/Standard
No ratings yet
Cavity Radiation in Abaqus/Standard
4 pages
Multimedia Networks for Students
No ratings yet
Multimedia Networks for Students
66 pages
TRANSEC Tech Brief - 0418 - FINAL
No ratings yet
TRANSEC Tech Brief - 0418 - FINAL
2 pages
Yllana Bay View College, Inc.: Teaching Guide For
No ratings yet
Yllana Bay View College, Inc.: Teaching Guide For
4 pages

Advanced Distributed Systems Guide

Uploaded by

Advanced Distributed Systems Guide

Uploaded by

UNIT 1 ADVANCED DISTRIBUTED SYSTEMS

A distributed system consists of hardware and software components located in a network of

Concurrency components execute in concurrent processes that read and update

Goals of distributed systems

Resource sharing the possibility of using available resources anywhere.

Types of distributed system

Hardware devices: computers, tablets, mobile phones, embedded devices, etc.

Operating System: Ms Windows, Linux, Mac, UNIX, etc.

Programming Languages: Java, C/C++, Python, PHP, etc.

Different roles of software developers, designers, system managers

Size: Number of users and resources to be processed. Problem associated is overloading

Administration: As the size of distributed systems increases, many of the system

Distributed Systems involves a lot of collaborating components (hardware, software,

The pattern of communication depends on the particular application.

For example, be distributed computing, file-sharing, distributed storage, communication, or

Introduction to interprocess communications

External data representation and marshalling

The Pub/Sub Model

Dependability in Pub/Sub Systems

For it to be considered "cloud computing," you need to access your

You might also like