0% found this document useful (0 votes)

389 views15 pages

System Models For Distributed and Cloud Computing

This document discusses different types of distributed computing systems including clusters, peer-to-peer networks, grids, and clouds. It provides classifications and descriptions of each type of system. Key aspects covered include that clusters consist of connected computers working as a single resource, peer-to-peer networks have no central coordination, grids are heterogeneous clusters with centralized control, and clouds provide virtualized resources that can be rapidly provisioned. The document also discusses performance metrics, scalability, software environments, and concepts like Amdahl's law related to distributed systems.

Uploaded by

Subrahmanyam Sudi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

389 views15 pages

System Models For Distributed and Cloud Computing

Uploaded by

Subrahmanyam Sudi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 15

System Models for

Distributed and Cloud

Computing
Dr. Sanjay P. Ahuja, Ph.D.
2010-14 FIS Distinguished Professor
of Computer Science
School of Computing, UNF

Classification of Distributed Computing Systems

These can be classified into 4 groups: clusters, peer-to-peer

networks, grids, and clouds.

A computing cluster consists of interconnected stand-alone
computers which work cooperatively as a single integrated
computing resource. The network of compute nodes are connected
by LAN/SAN and are typically homogeneous with distributed
control running Unix/Linux. They are suited to HPC.

Peer-to-peer (P2P) Networks

In a P2P network, every node (peer) acts as both a client and server. Peers

act autonomously to join or leave the network. No central coordination or

central database is needed. No peer machine has a global view of the entire
P2P system. The system is self-organizing with distributed control.

Unlike the cluster or grid, a P2P network does not use dedicated
interconnection network.

P2P Networks are classified into different groups:

Distributed File Sharing: content distribution of MP3 music, video, etc. E.g.
Gnutella, Napster, BitTorrent.
Collaboration P2P networks: Skype chatting, instant messaging, gaming etc.
Distributed P2P computing: specific application computing such as
SETI@home provides 25 Tflops of distributed computing power over 3 million
Internet host machines.

Computational and Data Grids

Grids are heterogeneous clusters interconnected by high-speed

networks. They have centralized control, are server-oriented with

authenticated security. They are suited to distributed
supercomputing. E.g. TeraGrid.

Like an electric utility power grid, a computing grid offers an

infrastructure that couples computers, software/middleware,
people, and sensors together.

The grid is constructed across LANs, WANs, or Internet backbones

at a regional, national, or global scale.

The computers used in a grid include servers, clusters, and

supercomputers. PCs, laptops, and mobile devices can be used to

access a grid system.

Clouds
A Cloud is a pool of virtualized computer resources. A cloud can
host a variety of different workloads, including batch-style
backend jobs and interactive and user-facing applications.

Workloads can be deployed and scaled out quickly through rapid

provisioning of VMs. Virtualization of server resources has enabled

cost effectiveness and allowed cloud systems to leverage low
costs to benefit both users and providers.

Cloud system should be able to monitor resource usage in real

time to enable rebalancing of allocations when needed.

Cloud computing applies a virtualized platform with elastic

resources on demand by provisioning hardware, software, and

data sets dynamically. Desktop computing is moved to a serviceoriented platform using server clusters and huge databases at
datacenters.

Advantage of Clouds over Traditional Distributed

Systems
Traditional distributed computing systems provided for onpremise computing and were owned and operated by
autonomous administrative domains (e.g. a company).

These traditional systems encountered performance

bottlenecks, constant system maintenance, poor server

(and other resource) utilization, and increasing costs
associated with hardware/software upgrades.

Cloud computing as an on-demand computing paradigm

resolves or relieves many of these problems.

Software Environments for Distributed Systems and

Clouds:

Service-Oriented Architecture (SOA) Layered

Architecture
web services, Java RMI, and

CORBA, an entity is, respectively, a

service, a Java remote object, and a
CORBA object. These build on the
TCP/IP network stack. On top of the
network stack we have a base
software environment, which would
be .NET/Apache Axis for web
services, the JVM for Java, and the
ORB network for CORBA. On top of
this base environment, a higher level
environment with features specific to
the
distributed
computing
environment is built.

Loose

coupling and support of

heterogeneous
implementations
make services more attractive than
distributed objects.

CORBA Stack

RMI Stack

Web Services Stack

IDL

Java
interface

WSDL

CORBA Services

RMI
Registry

UDDI

CORBA
Stubs/Skeletons

RMI
Stubs/Skelet
ons

SOAP Message

CDR binary
encoding

Java native
encoding serialization

XML Unicode encoding

IIOP

JRMP

HTTP

RPC or Message Oriented Middleware (Websphere MQ or

JMS)
ORB

JVM

.NET/Apache Axis

TCP/IP/DataLink/Physical

Performance Metrics and Scalability Analysis

Performance Metrics:
CPU speed: MHz or GHz, SPEC benchmarks like SPECINT
Network Bandwidth: Mbps or Gbps
System throughput: MIPS, TFlops (tera floating-point operations
per second), TPS (transactions per second), IOPS (IO operations
per second)
Other metrics: Response time, network latency, system availability

Scalability:
Scalability is the ability of a system to handle growing amount of

work in a capable/efficient manner or its ability to be enlarged to

accommodate that growth.
For example, it can refer to the capability of a system to increase
total throughput under an increased load when resources
(typically hardware) are added.

Scalability
Scale Vertically
To scale vertically (or scale up) means to add resources to a single
node in a system, typically involving the addition of CPUs or
memory to a single computer.
Tradeoffs
There are tradeoffs between the two models. Larger numbers of
computers means increased management complexity, as well as a
more complex programming model and issues such as throughput
and latency between nodes.
Also, some applications do not lend themselves to a distributed
computing model.
In the past, the price difference between the two models has
favored "scale up" computing for those applications that fit its
paradigm, but recent advances in virtualization technology have
blurred that advantage, since deploying a new virtual
system/server over a hypervisor is almost always less expensive
than actually buying and installing a real one.

Scalability
One form of scalability for parallel and distributed systems is:
Size Scalability
This refers to achieving higher performance or more functionality by
increasing the machine size. Size in this case refers to adding
processors, cache, memory, storage, or I/O channels.
Scale Horizontally and Vertically
Methods of adding more resources for a particular application fall into
two broad categories:
Scale Horizontally
To scale horizontally (or scale out) means to add more nodes to a
system, such as adding a new computer to a distributed software
application. An example might be scaling out from one Web server
system to three.
The scale-out model has created an increased demand for shared
data storage with very high I/O performance, especially where
processing of large amounts of data is required.

Amdahls Law
It is typically cheaper to add a new node to a system in order to
achieve improved performance than to perform performance tuning
to improve the capacity that each node can handle. But this approach
can have diminishing returns as indicated by Amdahls Law.
Consider the execution of a given program on a uniprocessor
workstation with a total execution time of T minutes. Now, lets say
that the program has been parallelized or partitioned for parallel
execution on a cluster of many processing nodes.
Assume that a fraction of the code must be executed sequentially,
called the sequential block. Therefore, (1 - ) of the code can be
compiled for parallel execution by n processors. The total execution
time of program is calculated by:
T + (1 - ) T / n
where the first term is the sequential execution time on a single
processor and the second term is the parallel execution time on n
processing nodes.
All system or communication overhead is ignored here. The I/O and
exception handling time is also not included in the speedup analysis.

Amdahls Law
Amdahls Law states that the Speedup Factor of using the nprocessor system over the use of a single processor is expressed by
Speedup = S = T / [ T + (1 - ) T / n]
= 1 / [ + (1 - ) / n]
The maximum speedup of n is achievable only when = 0, i.e. the
entire program is parallelizable.
As the cluster becomes sufficiently large, i.e. n , then S 1 / ,
an upper bound on the speedup S. This upper bound is
independent of the cluster size, n. The sequential bottleneck is the
portion of the code that cannot be parallelized.
Example, = 0.25 and so (1 0.25) = 0.75 then the maximum
speedup, S = 4 even if one uses hundreds of processors.
Amdahls Law teaches us that we should make the sequential
bottleneck as small as possible. Increasing the cluster size alone
may not result in a good speedup in this case.

Amdahls Law

Example: suppose 70% of a program can be sped up if parallelized

and run on multiple CPUs instead of one CPU.

N = 4 processors
S = 1 / [0.3 + (1 0.3) / 4] = 2.105

Doubling the number of processors to N = 8 processors

S = 1 / [0.3 + (1 0.3) / 8] = 2.581
Double the processing power has only improved the speedup by
roughly one-fifth. Therefore, throwing in more hardware is not
necessarily the optimal approach.

System Efficiency

To execute a fixed workload on n processors, parallel processing may

lead to a system efficiency defined as:

System Efficiency, E = S / n = 1 / [ n + (1 - ) ]
System efficiency can be rather low if the cluster size is very large.
Example: To execute a program on a cluster with n = 4, = 0.25 and so
(1 0.25) = 0.75,
E = 1 / [0.25 * 4 + 0.75] = 0.57 or 57%
Now if we have 256 nodes (i.e. n = 256)
E = 1 / [0.25 * 256 + 0.75] = 0.015 or 1.5%
This is because only a few processors (4, as in the previous case) are
kept busy, while the majority of the processors (or nodes) are left idling.

Fault Tolerance and System Availability

High availability (HA) is desired in all clusters, grids, P2P networks,

and cloud systems. A system is highly available if it has a long Mean

Time to Failure (MTTF) and a short Mean Time to Repair (MTTR).

System Availability = MTTF / (MTTF + MTTR)

All hardware, software, and network components may fail. Single

points of failure that bring down the entire system must be avoided
when designing distributed systems.

Adding hardware redundancy, increasing component reliability,

designing for testability all help to enhance system availability and

dependability.

In general, as a distributed system increases in size, availability

decreases due to a higher chance of failure and a difficulty in isolating

failures.

Parallel Computing LessonPlan
No ratings yet
Parallel Computing LessonPlan
10 pages
1734787260059cloud Computing AKTU Notes Password Chaudhary - Unlocked
No ratings yet
1734787260059cloud Computing AKTU Notes Password Chaudhary - Unlocked
55 pages
Unit 5
No ratings yet
Unit 5
27 pages
TOC Module-1 Notes
No ratings yet
TOC Module-1 Notes
19 pages
Assignment Networking Sajeevan
No ratings yet
Assignment Networking Sajeevan
48 pages
Community Masters
No ratings yet
Community Masters
377 pages
Content Delivery Networks (Lecture Notes in Electrical Engineering)
No ratings yet
Content Delivery Networks (Lecture Notes in Electrical Engineering)
429 pages
P2P Networking Project Report
100% (1)
P2P Networking Project Report
55 pages
Lecturenotes Module-5 BCS403 Databasemanagementsystem
No ratings yet
Lecturenotes Module-5 BCS403 Databasemanagementsystem
20 pages
Big Data Analytics Notes
No ratings yet
Big Data Analytics Notes
33 pages
Bda Unit Iv
No ratings yet
Bda Unit Iv
102 pages
Unit 5 BDA
No ratings yet
Unit 5 BDA
34 pages
SEE Computer Science 2079 Notes.
100% (1)
SEE Computer Science 2079 Notes.
57 pages
Cc-Unit-2
No ratings yet
Cc-Unit-2
99 pages
Eti Chapter 3
No ratings yet
Eti Chapter 3
9 pages
Cloud, Microservices and Applications Notes (5 Units)
No ratings yet
Cloud, Microservices and Applications Notes (5 Units)
71 pages
Unit-2 Introduction To Hadoop
No ratings yet
Unit-2 Introduction To Hadoop
19 pages
Univ QP DC Case Study Based Question
No ratings yet
Univ QP DC Case Study Based Question
8 pages
BDA Unit2 Complete
No ratings yet
BDA Unit2 Complete
56 pages
Lesson 1 Intro To Computer Networking
No ratings yet
Lesson 1 Intro To Computer Networking
31 pages
SSIS Materials
No ratings yet
SSIS Materials
133 pages
SSIS Materials
No ratings yet
SSIS Materials
133 pages
Fighting Child Pornography: A Review of Legal and Technological Developments
100% (2)
Fighting Child Pornography: A Review of Legal and Technological Developments
20 pages
Hadoop Distributed File System
No ratings yet
Hadoop Distributed File System
5 pages
Get Modern Deploy and Maintain Guidance With Windows11
100% (2)
Get Modern Deploy and Maintain Guidance With Windows11
28 pages
Deep Web, Dark Web, Dark Net: A Taxonomy of "Hidden" Internet
No ratings yet
Deep Web, Dark Web, Dark Net: A Taxonomy of "Hidden" Internet
16 pages
Cloud Computing Unit - 3 Final
No ratings yet
Cloud Computing Unit - 3 Final
43 pages
TYPES OF SCHEDULING ALGORITHMS in Cloud
100% (1)
TYPES OF SCHEDULING ALGORITHMS in Cloud
4 pages
Subject Name Parallel and Distributed Computing
100% (1)
Subject Name Parallel and Distributed Computing
3 pages
Unit 4 BDA
No ratings yet
Unit 4 BDA
31 pages
Data-Intensive Computing
No ratings yet
Data-Intensive Computing
88 pages
Hadoop ppt@87
No ratings yet
Hadoop ppt@87
16 pages
Flow-X Manual IIc - Liquid Metric Application - CM - FlowX - LM-En - J
No ratings yet
Flow-X Manual IIc - Liquid Metric Application - CM - FlowX - LM-En - J
147 pages
Cloud COMPUTING Module 5
No ratings yet
Cloud COMPUTING Module 5
63 pages
NoSQL Module 2
No ratings yet
NoSQL Module 2
76 pages
Cloud Computing Qb-Module 1-5
No ratings yet
Cloud Computing Qb-Module 1-5
7 pages
8 Elasticity in Cloud
No ratings yet
8 Elasticity in Cloud
22 pages
DC Notes - 2 Marks
No ratings yet
DC Notes - 2 Marks
11 pages
Interview QA On Informatica
No ratings yet
Interview QA On Informatica
21 pages
Cloud Question Bank
No ratings yet
Cloud Question Bank
3 pages
CHAPTER 03: Big Data Technology Landscape
No ratings yet
CHAPTER 03: Big Data Technology Landscape
81 pages
Unit Iii Virtualization Infrastructure and Docker Desktop Virtualization
No ratings yet
Unit Iii Virtualization Infrastructure and Docker Desktop Virtualization
20 pages
Cs3391 Oops Unit 1 Notes Eduengg
No ratings yet
Cs3391 Oops Unit 1 Notes Eduengg
60 pages
Cryoto PDF
No ratings yet
Cryoto PDF
24 pages
Computer Systems Servicing-DLL
100% (1)
Computer Systems Servicing-DLL
55 pages
Chapter 6 Telecom and Networks
No ratings yet
Chapter 6 Telecom and Networks
49 pages
Unit 4-DBP
No ratings yet
Unit 4-DBP
66 pages
Unit No.4 Parallel Database
No ratings yet
Unit No.4 Parallel Database
32 pages
UGW9811
100% (1)
UGW9811
44 pages
Nevin Charles Kawamala
No ratings yet
Nevin Charles Kawamala
28 pages
Unit 1notes Full
No ratings yet
Unit 1notes Full
20 pages
Bda Unit 5
No ratings yet
Bda Unit 5
29 pages
Module 3 Nosql
No ratings yet
Module 3 Nosql
12 pages
1.fundemental of Networking
No ratings yet
1.fundemental of Networking
45 pages
Module 3 Notes
No ratings yet
Module 3 Notes
17 pages
hmpd6204 Research Project Design Premela Maruthamuthu
No ratings yet
hmpd6204 Research Project Design Premela Maruthamuthu
74 pages
Chapter 06 SQL (Advanced)
No ratings yet
Chapter 06 SQL (Advanced)
38 pages
Parallel Computing
No ratings yet
Parallel Computing
57 pages
Cp4152 Database Practice Lab Manual R 2021
No ratings yet
Cp4152 Database Practice Lab Manual R 2021
48 pages
Implementation Techniques - Unit 4
No ratings yet
Implementation Techniques - Unit 4
29 pages
Topic 3 - Networks B Computer Science Notes
No ratings yet
Topic 3 - Networks B Computer Science Notes
10 pages
Module 4 Nosql
No ratings yet
Module 4 Nosql
8 pages
Bca 2 CC Unit 1 Eng
No ratings yet
Bca 2 CC Unit 1 Eng
11 pages
Smart Steam Emu
No ratings yet
Smart Steam Emu
13 pages
BLOCKCHAIN Assignment 2
No ratings yet
BLOCKCHAIN Assignment 2
10 pages
CS8591 Computer Networks L T P C 3 0 0 3 Objectives
0% (1)
CS8591 Computer Networks L T P C 3 0 0 3 Objectives
5 pages
Hbase
No ratings yet
Hbase
13 pages
Advanced Databases - Unit - V - PPT
No ratings yet
Advanced Databases - Unit - V - PPT
71 pages
Skype Analysis 1 3
No ratings yet
Skype Analysis 1 3
30 pages
Overview of The Computing Paradigm: 1.1 Recent Trends in Distributed Computing
No ratings yet
Overview of The Computing Paradigm: 1.1 Recent Trends in Distributed Computing
5 pages
Msbi Ssis
No ratings yet
Msbi Ssis
164 pages
Chat - Hoc: Presented By: Slava Ustinov Ron Meiry
No ratings yet
Chat - Hoc: Presented By: Slava Ustinov Ron Meiry
11 pages
Step by Step Installation of Microsoft SQL Server 2012 With Business Intelligence
No ratings yet
Step by Step Installation of Microsoft SQL Server 2012 With Business Intelligence
29 pages
CC Modul 5 Gud
No ratings yet
CC Modul 5 Gud
11 pages
NOSQL
No ratings yet
NOSQL
16 pages
SE Unit 4 - Part 2
No ratings yet
SE Unit 4 - Part 2
9 pages
SSRS 2012 Material
No ratings yet
SSRS 2012 Material
58 pages
SSRS 2012 Material
No ratings yet
SSRS 2012 Material
58 pages
FSD Unit2
No ratings yet
FSD Unit2
41 pages
AI Chatbot Unit 2
No ratings yet
AI Chatbot Unit 2
7 pages
DAA With Ans Wheebox
No ratings yet
DAA With Ans Wheebox
485 pages
Creating Cube in SSAS 2008
No ratings yet
Creating Cube in SSAS 2008
6 pages
Assignment Coa Wase Wims2019
No ratings yet
Assignment Coa Wase Wims2019
8 pages
Practical File Cloud Computing IT-704
No ratings yet
Practical File Cloud Computing IT-704
27 pages
Microsoft Business Intelligence
No ratings yet
Microsoft Business Intelligence
10 pages
Distributed Databases: Course Code:13IT1109 L TPC 4 0 0 3
No ratings yet
Distributed Databases: Course Code:13IT1109 L TPC 4 0 0 3
3 pages
Gamut National High School 11 Janelkris E. Plaza 3 Quarter I
No ratings yet
Gamut National High School 11 Janelkris E. Plaza 3 Quarter I
2 pages
Module-4 Cloud Computing Architecture PDF
No ratings yet
Module-4 Cloud Computing Architecture PDF
19 pages
Chapter02 SG
No ratings yet
Chapter02 SG
18 pages
Translations in SSAS
No ratings yet
Translations in SSAS
9 pages
Transformation Description Examples of When Transformation Would Be Used
No ratings yet
Transformation Description Examples of When Transformation Would Be Used
7 pages
Transformation Description Examples of When Transformation Would Be Used
No ratings yet
Transformation Description Examples of When Transformation Would Be Used
7 pages
Assignment - 3: Security Challenges Within Iot
No ratings yet
Assignment - 3: Security Challenges Within Iot
4 pages
ADBMS Lab Manual
No ratings yet
ADBMS Lab Manual
33 pages
Mode of Submission: Offline Mode: Page 1 of 3 Allsec Smartpay Services
No ratings yet
Mode of Submission: Offline Mode: Page 1 of 3 Allsec Smartpay Services
3 pages
Cloud Computing QB
No ratings yet
Cloud Computing QB
3 pages
RRIT Question Bank 1 - CC - IA-1-2021-22
No ratings yet
RRIT Question Bank 1 - CC - IA-1-2021-22
2 pages
Cloud Computing Chapter-11
No ratings yet
Cloud Computing Chapter-11
15 pages
Cloud Computing and Data Science Syllabus
No ratings yet
Cloud Computing and Data Science Syllabus
3 pages
Torch Torrent
No ratings yet
Torch Torrent
1 page
Expressions For Alternate Row Color,: "Page " " of "
No ratings yet
Expressions For Alternate Row Color,: "Page " " of "
2 pages
Expressions For Alternate Row Color,: "Page " " of "
No ratings yet
Expressions For Alternate Row Color,: "Page " " of "
2 pages
Is Interview Questions
No ratings yet
Is Interview Questions
2 pages
Is Interview Questions
No ratings yet
Is Interview Questions
2 pages
11 Aneka in Cloud Computing
No ratings yet
11 Aneka in Cloud Computing
14 pages
Seminar Abstract
No ratings yet
Seminar Abstract
1 page
SSIS Architecture
No ratings yet
SSIS Architecture
4 pages
Select From Tablename Where (Case When @repparam 'All' and Colname @repparam Then 1 When @repparam 'All' Then 1 End) 1
No ratings yet
Select From Tablename Where (Case When @repparam 'All' and Colname @repparam Then 1 When @repparam 'All' Then 1 End) 1
1 page
Primary Account
No ratings yet
Primary Account
1 page
Introduction to Linux: Installation and Programming
From Everand
Introduction to Linux: Installation and Programming
N. B. Venkateswarlu
No ratings yet

System Models For Distributed and Cloud Computing

Uploaded by

System Models For Distributed and Cloud Computing

Uploaded by

System Models for

Distributed and Cloud

Classification of Distributed Computing Systems

networks, grids, and clouds.

Peer-to-peer (P2P) Networks

act autonomously to join or leave the network. No central coordination or

P2P Networks are classified into different groups:

Computational and Data Grids

networks. They have centralized control, are server-oriented with

Like an electric utility power grid, a computing grid offers an

The grid is constructed across LANs, WANs, or Internet backbones

The computers used in a grid include servers, clusters, and

supercomputers. PCs, laptops, and mobile devices can be used to

Workloads can be deployed and scaled out quickly through rapid

provisioning of VMs. Virtualization of server resources has enabled

Cloud system should be able to monitor resource usage in real

Cloud computing applies a virtualized platform with elastic

resources on demand by provisioning hardware, software, and

Advantage of Clouds over Traditional Distributed

These traditional systems encountered performance

bottlenecks, constant system maintenance, poor server

Cloud computing as an on-demand computing paradigm

Software Environments for Distributed Systems and

Service-Oriented Architecture (SOA) Layered

CORBA, an entity is, respectively, a

coupling and support of

Web Services Stack

XML Unicode encoding

RPC or Message Oriented Middleware (Websphere MQ or

Performance Metrics and Scalability Analysis

work in a capable/efficient manner or its ability to be enlarged to

Example: suppose 70% of a program can be sped up if parallelized

and run on multiple CPUs instead of one CPU.

Doubling the number of processors to N = 8 processors

To execute a fixed workload on n processors, parallel processing may

Fault Tolerance and System Availability

High availability (HA) is desired in all clusters, grids, P2P networks,

and cloud systems. A system is highly available if it has a long Mean

System Availability = MTTF / (MTTF + MTTR)

Adding hardware redundancy, increasing component reliability,

designing for testability all help to enhance system availability and

In general, as a distributed system increases in size, availability

decreases due to a higher chance of failure and a difficulty in isolating

You might also like