0% found this document useful (0 votes)

38 views

Distributed systems Chapter 1-Introduction

The document provides an introduction to distributed systems, defining them as collections of independent computers that work together to appear as a single system to users. It discusses the evolution of computing from centralized systems to distributed systems, highlighting their benefits such as resource sharing, reliability, and scalability, while also addressing challenges like concurrency and security. Additionally, it outlines the goals of distributed systems, including transparency, openness, and scalability, and categorizes different types of distributed systems.

Uploaded by

Endemariam Mehari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views

Distributed systems Chapter 1-Introduction

Uploaded by

Endemariam Mehari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

Kibru College

Department of Computer Science

Introduction to Distributed Systems

By:
Girmay G.
[email protected]
Chapter 1 - Introduction
1.1 Introduction and Definition
 before the mid-80s, computers were
 very expensive (hundred of thousands or even millions of
dollars)
 very slow (a few thousand instructions per second)
 not connected among themselves
 after the mid-80s: two major developments
 cheap and powerful microprocessor-based computers appeared
 computer networks
 LANs at speeds ranging from 10 to 1000 Mbps
 WANs at speed ranging from 64 Kbps to gigabits/sec
 consequence
 feasibility of using a large network of computers to work for
the same application; this is in contrast to the old centralized
systems where there was a single computer with its peripherals
3
 Definition of a Distributed System
 a distributed system is:
a collection of independent computers that
appears to its users as a single coherent system -
computer (Tanenbaum & Van Steen)

 This definition has two aspects:

1. hardware: autonomous machines
2. software: a single system view for the users

4
 Other Definitions
A distributed system is a system designed to support the
development of applications and services which can
exploit a physical architecture consisting of multiple,
autonomous processing elements that do not share primary
memory but cooperate by sending asynchronous messages
over a communication network (Blair & Stefani)

A distributed system is one that stops you getting any work

done when a machine you’ve never even heard of crashes
(Leslie)

5
Why Distributed?
 Resource and Data Sharing
 printers, databases, multimedia servers, ...
 Availability, Reliability
 the loss of some instances can be hidden
 Scalability, Extensibility
 the system grows with demand (e.g., extra
servers)
 Performance
 huge power (CPU, memory, ...) available
 Inherent distribution, communication
 organizational distribution, e-mail, video 6
Problems of Distribution
 Concurrency, Security
 clients must not disturb each other
 Privacy
 e.g., when building a preference profile
 unwanted communication such as spam
 Partial failure
 we often do not know where the error is (e.g., RPC)
 Location, Migration, Replication
 clients must be able to find their servers
 Heterogeneity
 hardware, platforms, languages, management 7
Characteristics of Distributed Systems
 differences between the computers and the ways they
communicate are hidden from users
 users and applications can interact with a distributed
system in a consistent and uniform way regardless of
location
 distributed systems should be easy to expand and scale
 a distributed system is normally continuously available,
even if there may be partial failures

8
1.2 Goals of a Distributed System
 to support heterogeneous computers and networks and to
provide a single-system view, a distributed system is often
organized by means of a layer of software called middleware
that extends over multiple machines

a distributed system organized as middleware; note that the

middleware layer extends over multiple machines, and offers each
application the same interface
9
Goals of a distributed system: a distributed system should
 easily connect users with resources (printers,
computers, storage facilities, data, files, Web pages, ...)
 reasons: economics, to collaborate and exchange
information
 be transparent: hide the fact that the resources and
processes are distributed across multiple computers
 be open
 be scalable
Transparency in a Distributed System
 a distributed system that is able to present itself to users
and applications as if it were only a single computer
system is said to be transparent
10
 different forms of transparency in a distributed system
Transparency Description
Access Hide differences in data representation
(endianness, file naming, ...) and how a resource
is accessed
Location Hide where a resource is physically located; where
is http://www.prenhall.com/index.html? (naming)
Migration Hide that a resource may move to another location
Relocation Hide that a resource may be moved to another
location while in use; e.g., mobile users using their
wireless laptops
Replication Hide that a resource is replicated
Concurrency Hide that a resource may be shared by several
competitive users; a resource must be left in a
consistent state
Failure Hide the failure and recovery of a resource
 But trying to achieve all distribution transparency may be
impossible or may not be a good idea 11
 Openness in a Distributed System
 a distributed system should be open
 we need well-defined interfaces
 interoperability
 components of different origin can communicate
 portability
 components work on different platforms
 another goal of an open distributed system is that it should be flexible
and extensible; easy to configure the system out of different
components; easy to add new components, replace existing ones;
easier said than done
 an Open Distributed System is a system that offers services according
to standard rules that describe the syntax and semantics of those
services; e.g., protocols in networks
 standards - a necessity
 should allow competition in non-normative areas
12
 in distributed systems, such services are often specified
through interfaces often described using an Interface
Definition Language (IDL)
 specify only syntax: the names of the functions, types
of parameters, return values, possible exceptions, ...
 Semantics are given in an informal way by means of
natural languages

 Scalability in Distributed Systems

 a distributed system should be scalable
 size: adding more users and resources to the system
 geographically: users and resources may be far apart
 administratively: should be easy to manage even if it
spans many administrative organizations
 but a scalable system may exhibit performance problems
13
 scalability problems

Concept Example
Single server for all users-mostly for security
Centralized services
reasons
Centralized data A single on-line telephone book
Centralized algorithms Doing routing based on complete information
examples of scalability limitations

 Scaling Techniques
 how to solve scaling problems
 the problem is mainly performance, and arises as a result of
limitations in the capacity of servers and networks (for geographical
scalability)
 three possible solutions: hiding communication latencies, distribution,
and replication

14
a. Hide Communication Latencies
 try to avoid waiting for responses to remote
service requests
 let the requester do other useful job
 i.e., construct requesting applications that use
only asynchronous communication instead of
synchronous communication; when a reply
arrives the application is interrupted
 good for batch processing and parallel
applications but not for interactive applications
 for interactive applications, move part of the job
to the client to reduce communication; e.g.
filling a form and checking the entries
15
(a) a server checking the correctness of field entries
(b) a client doing the job
 e.g., checking the completeness of mandatory fields
 shipping code is now supported in Web applications using
Java Applets and Javascript
16
b. Distribution
 e.g., DNS - Domain Name System ([email protected])
 divide the name space into nonoverlapping zones
 for details, see later in Chapter 5 - Naming

an example of dividing the DNS name space into zones

17
c. Replication
 replicate components across a distributed system to
increase availability and for load balancing, leading to
better performance
 decided by the owner of a resource
 caching (a special form of replication) also reduces
communication latency; decided by the user
 but, caching and replication may lead to consistency
problems (see Chapter 7 - Consistency and Replication)

18
Pitfalls when Developing Distributed Systems
 False assumptions made by first time developers
 The network is reliable
 The network is secure
 The network is homogeneous
 The topology does not change
 Latency is zero
 Bandwidth is infinite
 Transport cost is zero
 There is one administrator

19
1.3 Types of Distributed Systems
 Three types: distributed computing systems, distributed
information systems, and distributed embedded systems
1. Distributed Computing Systems
 Used for high-performance computing tasks
 two types: cluster computing and grid computing
 Cluster Computing
 a collection of similar workstations or PCs
(homogeneous), closely connected by means of a
high-speed LAN
 each node runs the same operating system
 used for parallel programming in which a single
compute intensive program is run in parallel on
multiple machines

20
an example of a cluster computing system

21
 Grid Computing
 “Resource sharing and coordinated problem solving
in dynamic, multi-institutional virtual organizations”
(I. Foster)
 high degree of heterogeneity: no assumptions are
made concerning hardware, operating systems,
networks, administrative domains, security policies,
etc.
2. Distributed Information Systems
 problem: many networked applications with a problem of
interoperability
 at the lowest level: wrap a number of requests into a
single larger request and have it executed as a
distributed transaction
 how to let applications communicate directly with each
other, i.e., Enterprise Application Integration (EAI)

22
 Transaction Processing Systems
 Consider database applications
 special primitives are required to program transactions,
supplied either by the underlying distributed system or
by the language runtime system
 exact list of primitives depends on the type of application

Primitive Description
BEGIN_TRANSACTION Mark the start of a transaction
Terminate the transaction and try to
END_TRANSACTION
commit
Kill the transaction and restore the old
ABORT_TRANSACTION
values
Read data from a file, a table, or
READ
otherwise
Write data to a file, a table, or
WRITE
otherwise
23
 The Transaction Model
 the model for transactions comes from the world of
business
 a supplier and a retailer negotiate on
 price
 delivery date
 quality
 etc.
 until the deal is concluded they can continue
negotiating or one of them can terminate
 but once they have reached an agreement they are
bound by law to carry out their part of the deal
 transactions between processes is similar with this
scenario

24
 e.g., assume the following banking operation
 withdraw an amount x from account 1
 deposit the amount x to account 2
 what happens if there is a problem after the first activity
is carried out?
 group the two operations into one transaction; either
both are carried out or neither
 we need a way to roll back when a transaction is not
completed

25
 e.g. reserving a seat from White Plains to Malindi through
JFK and Nairobi airports

BEGIN_TRANSACTION BEGIN_TRANSACTION
reserve WP  JFK; reserve WP  JFK;
reserve JFK  Nairobi; reserve JFK  Nairobi;
reserve Nairobi  Malindi; reserve Nairobi  Malindi full 
END_TRANSACTION ABORT_TRANSACTION
(a) (b)

(a) transaction to reserve three flights commits

(b) transaction aborts when third flight is unavailable

26
 properties of transactions, often referred to as ACID
1. Atomic: to the outside world, the transaction happens
indivisibly; a transaction either happens completely or
not at all; intermediate states are not seen by other
processes
2. Consistent: the transaction does not violate system
invariants; e.g., in an internal transfer in a bank, the
amount of money in the bank must be the same as it
was before the transfer (the law of conservation of
money); this may be violated for a brief period of time,
but not seen to other processes
3. Isolated or Serializable: concurrent transactions do not
interfere with each other; if two or more transactions
are running at the same time, the final result must look
as though all transactions run sequentially in some
order
4. Durable: once a transaction commits, the changes are
permanent; see later in Chapter 8
27
 Classification of Transactions
 a transaction could be flat, nested or distributed
 Flat Transaction
 consists of a series of operations that satisfy the ACID
properties
 simple and widely used but with some limitations
 do not allow partial results to be committed or aborted
 i.e., atomicity is also partly a weakness
 in our airline reservation example, we may want to
accept the first two reservations and find an
alternative one for the last
 some transactions may take too much time

28
 Nested Transaction
 constructed from a number of subtransactions; it is
logically decomposed into a hierarchy of
subtransactions
 the top-level transaction forks off children that run in
parallel, on different machines; to gain performance or
for programming simplicity
 each may also execute one or more subtransactions
 permanence (durability) applies only to the top-level
transaction; commits by children should be undone
 Distributed Transaction
 a flat transaction that operates on data that are
distributed across multiple machines
 problem: separate algorithms are needed to handle the
locking of data and committing the entire transaction;
see later in Chapter 8 for distributed commit

29
(a) a nested transaction
(b) distributed transaction

30
 Enterprise Application Integration
 how to integrate applications independent from their
databases
 transaction systems rely on request/reply
 how can applications communicate with each other

middleware as a communication facilitator in enterprise application

integration 31
 there are different communication models
 RPC (Remote procedure Call)
 RMI (Remote Method Invocation)
 MOM (Message-Oriented Communication)
 see later in Chapter 4
3. Distributed Pervasive Systems
 the distributed systems discussed so far are
characterized by their stability; fixed nodes having high-
quality connection to a network
 there are also mobile and embedded computing devices
with wireless connections

32
 three requirements for pervasive applications
 embrace contextual changes: a device is aware that
its environment may change all the time
 encourage ad hoc composition: devices are used in
different ways by different users
 recognize sharing as the default: devices join a
system to access or provide information
 examples of pervasive systems
 Home Systems
 Electronic Health Care Systems
 Sensor Networks
 read pages 27 - 30

33
?
End

Design and Analysis of Truss Using Staad Pro
67% (3)
Design and Analysis of Truss Using Staad Pro
18 pages
Glossary of Philatelic Terms
No ratings yet
Glossary of Philatelic Terms
54 pages
Design and Implementation of Web Based Human Resource Management
100% (1)
Design and Implementation of Web Based Human Resource Management
10 pages
Distributed Systems Chapter 1-Introduction
No ratings yet
Distributed Systems Chapter 1-Introduction
32 pages
Distributed Systems (Cosc 6003) : Chapter 1 - Introduction
No ratings yet
Distributed Systems (Cosc 6003) : Chapter 1 - Introduction
37 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
73 pages
Distributed Systems
No ratings yet
Distributed Systems
47 pages
Chapter 1-Introduction
No ratings yet
Chapter 1-Introduction
35 pages
Chapter 1-Introduction To Distributed Systems
No ratings yet
Chapter 1-Introduction To Distributed Systems
59 pages
Chapter 1-Introduction (2)
No ratings yet
Chapter 1-Introduction (2)
45 pages
Intro To DS Chapter 1
No ratings yet
Intro To DS Chapter 1
56 pages
Intro To Distributed Systems
No ratings yet
Intro To Distributed Systems
30 pages
Chapter 1 - Introduction DS
No ratings yet
Chapter 1 - Introduction DS
36 pages
Chapter-1-Introduction
No ratings yet
Chapter-1-Introduction
53 pages
Chapter 01 - Introduction Distributed Syetem
No ratings yet
Chapter 01 - Introduction Distributed Syetem
45 pages
Chapter 1
No ratings yet
Chapter 1
60 pages
CH 1 Distributed Systems
No ratings yet
CH 1 Distributed Systems
57 pages
Distributed System
No ratings yet
Distributed System
57 pages
Distributed Systems Overview
No ratings yet
Distributed Systems Overview
42 pages
Distributed Systems: Andrew S. Tanenbaum Maarten Van Steen
No ratings yet
Distributed Systems: Andrew S. Tanenbaum Maarten Van Steen
65 pages
Mulugeta A.: Chapter One
No ratings yet
Mulugeta A.: Chapter One
57 pages
Chapter One
No ratings yet
Chapter One
40 pages
Chapter 1-Introduction
No ratings yet
Chapter 1-Introduction
60 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
60 pages
Introduction Distributed Syetem and Principles For Distributed System
No ratings yet
Introduction Distributed Syetem and Principles For Distributed System
86 pages
Chapter 1 (A) - Distribted System
No ratings yet
Chapter 1 (A) - Distribted System
40 pages
Distributed Systems
No ratings yet
Distributed Systems
68 pages
Distributed Systems: Tanenbaum Chapter 1
No ratings yet
Distributed Systems: Tanenbaum Chapter 1
70 pages
2 - Lect 0 - Introduction to Distributed Systems
No ratings yet
2 - Lect 0 - Introduction to Distributed Systems
30 pages
Chapter 1 - Introduction
No ratings yet
Chapter 1 - Introduction
44 pages
Дистрибуиран компјутерски систем
No ratings yet
Дистрибуиран компјутерски систем
19 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
42 pages
Chapter 1
No ratings yet
Chapter 1
36 pages
Chapter 1-Introduction
No ratings yet
Chapter 1-Introduction
51 pages
Distributed Systems: Chapter 1 - Introduction
100% (2)
Distributed Systems: Chapter 1 - Introduction
74 pages
Chapter 1 IntroDistributed
No ratings yet
Chapter 1 IntroDistributed
143 pages
Chapter 1 - Introduction to Distributed System
No ratings yet
Chapter 1 - Introduction to Distributed System
25 pages
Chapter 1 - Definition of DS
No ratings yet
Chapter 1 - Definition of DS
50 pages
Overview of Distributed Computing
No ratings yet
Overview of Distributed Computing
4 pages
Chapter One
No ratings yet
Chapter One
33 pages
CSCE455/855 Distributed Operating Systems: Dr. Ying Lu Schorr Center 106
No ratings yet
CSCE455/855 Distributed Operating Systems: Dr. Ying Lu Schorr Center 106
41 pages
Chapter 1-Introduction (1)
No ratings yet
Chapter 1-Introduction (1)
51 pages
Destributed System Lecture Note Finale
No ratings yet
Destributed System Lecture Note Finale
148 pages
Distributed System Chapter-1
No ratings yet
Distributed System Chapter-1
61 pages
Chapter 1
No ratings yet
Chapter 1
63 pages
Computer Networks Notes Unit1
No ratings yet
Computer Networks Notes Unit1
10 pages
W01-L01 Introduction To Distributed Computing
No ratings yet
W01-L01 Introduction To Distributed Computing
46 pages
Chap 01
No ratings yet
Chap 01
16 pages
Distributed Systems: MSC in Computer Science Unyt-Uog Assoc. Prof. Marenglen Biba
No ratings yet
Distributed Systems: MSC in Computer Science Unyt-Uog Assoc. Prof. Marenglen Biba
105 pages
DC Module 1
No ratings yet
DC Module 1
136 pages
chapter 1 (4)
No ratings yet
chapter 1 (4)
55 pages
Distributed Systems Introduction
No ratings yet
Distributed Systems Introduction
40 pages
Lecture-1 - Distributed Computing
No ratings yet
Lecture-1 - Distributed Computing
38 pages
Distributed Systems Principles and Paradigms: Second Edition Andrew S. Tanenbaum Maarten Van Steen
No ratings yet
Distributed Systems Principles and Paradigms: Second Edition Andrew S. Tanenbaum Maarten Van Steen
29 pages
Ch1-Introduction
No ratings yet
Ch1-Introduction
57 pages
chapter 1-Introductionk (1)
No ratings yet
chapter 1-Introductionk (1)
58 pages
Introduction
No ratings yet
Introduction
59 pages
Chapter 1-Introduction (4)
No ratings yet
Chapter 1-Introduction (4)
33 pages
Distributed System
No ratings yet
Distributed System
62 pages
Distributed Systems: Introduction
No ratings yet
Distributed Systems: Introduction
11 pages
Distributed Systems Report
100% (1)
Distributed Systems Report
36 pages
Cloud Computing Interview Questions You'll Most Likely Be Asked: Second Edition
From Everand
Cloud Computing Interview Questions You'll Most Likely Be Asked: Second Edition
Vibrant Publishers
No ratings yet
Operating System Interview Questions and Answers
From Everand
Operating System Interview Questions and Answers
Manish Soni
No ratings yet
MARA University of Technology Malaysia: Faculty of Art and Design
No ratings yet
MARA University of Technology Malaysia: Faculty of Art and Design
29 pages
Little Booklet of Phone Scams
No ratings yet
Little Booklet of Phone Scams
12 pages
EEGI 3131-Adjustment Computations-Lesson 4
No ratings yet
EEGI 3131-Adjustment Computations-Lesson 4
17 pages
Unit 4, Lesson 2, Worksheet 12: The Blue Pants Costs Fifty Dollars and The Green Pants Cost Twenty Dollars
No ratings yet
Unit 4, Lesson 2, Worksheet 12: The Blue Pants Costs Fifty Dollars and The Green Pants Cost Twenty Dollars
18 pages
Confidential: SESSION 2018/2019
No ratings yet
Confidential: SESSION 2018/2019
26 pages
Management Information System Final Project: Some of The Suggestions Are As Follows
No ratings yet
Management Information System Final Project: Some of The Suggestions Are As Follows
2 pages
Adam Thierer (PFF) Remarks at FCC Hearing On Public Interest in Digital Era (3!4!10)
No ratings yet
Adam Thierer (PFF) Remarks at FCC Hearing On Public Interest in Digital Era (3!4!10)
14 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
6 pages
2 Rough Clustering Highlighted
No ratings yet
2 Rough Clustering Highlighted
9 pages
Professional Experience: Avijit Das
No ratings yet
Professional Experience: Avijit Das
2 pages
Answering Key Worksheet # 1
No ratings yet
Answering Key Worksheet # 1
2 pages
PLC To Hmi Communication Protocol
No ratings yet
PLC To Hmi Communication Protocol
7 pages
Microsoft Powerpoint 2010 Part 1: Introduction To Powerpoint
No ratings yet
Microsoft Powerpoint 2010 Part 1: Introduction To Powerpoint
25 pages
Introduction To Minitab: Lab No: 01
No ratings yet
Introduction To Minitab: Lab No: 01
3 pages
Psycopg2 Tutorial
No ratings yet
Psycopg2 Tutorial
6 pages
Vendor Return Process in SAP MM
100% (3)
Vendor Return Process in SAP MM
15 pages
Dynamic HTTP or Odata Adapter –ntication for Flow Processing in
No ratings yet
Dynamic HTTP or Odata Adapter –ntication for Flow Processing in
24 pages
Blockchain Knowledge Check Lopez
No ratings yet
Blockchain Knowledge Check Lopez
4 pages
Ahoy There
No ratings yet
Ahoy There
55 pages
Gary Bronson Excel 2019 Project Book Mercury Learning and Information 2021
No ratings yet
Gary Bronson Excel 2019 Project Book Mercury Learning and Information 2021
162 pages
AI for People and Business: A Framework for Better Human Experiences and Business Success Alex Castrounis - Download the ebook now to start reading without waiting
100% (1)
AI for People and Business: A Framework for Better Human Experiences and Business Success Alex Castrounis - Download the ebook now to start reading without waiting
69 pages
Manual Del FaultKin - Geomechanics
100% (1)
Manual Del FaultKin - Geomechanics
31 pages
Inventory Control & Improving Record Accuracy in Production: Dr. Elbahlul M. Abogrean, Tajedeen R. Own
No ratings yet
Inventory Control & Improving Record Accuracy in Production: Dr. Elbahlul M. Abogrean, Tajedeen R. Own
6 pages
ModelMate Tutorial GEGN583-483enhanced
No ratings yet
ModelMate Tutorial GEGN583-483enhanced
9 pages
CSE 3 & 4yrs - Syllabus
No ratings yet
CSE 3 & 4yrs - Syllabus
62 pages
CS-403 Software Engineering
No ratings yet
CS-403 Software Engineering
2 pages
Application Config
No ratings yet
Application Config
17 pages

Distributed systems Chapter 1-Introduction

Uploaded by

Distributed systems Chapter 1-Introduction

Uploaded by

Kibru College

Department of Computer Science

Introduction to Distributed Systems

 This definition has two aspects:

A distributed system is one that stops you getting any work

a distributed system organized as middleware; note that the

 Scalability in Distributed Systems

an example of dividing the DNS name space into zones

(a) transaction to reserve three flights commits

middleware as a communication facilitator in enterprise application

You might also like