0% found this document useful (0 votes)

9 views36 pages

5CS022 Lecture 1

The document provides an overview of distributed systems and cloud computing, covering key topics such as the Message Passing Interface (MPI), the Actor Model, and various architectures like client-server and peer-to-peer networks. It discusses the evolution and advantages of distributed computing, including resource sharing, fault tolerance, and scalability. Additionally, it outlines the structure of an assessment for a course on distributed and cloud systems programming.

Uploaded by

Aaditya Chaudhary Tharu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views36 pages

5CS022 Lecture 1

Uploaded by

Aaditya Chaudhary Tharu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 36

Distributed systems

and Cloud Computing

Lecture 1
We will very likely cover topics such as

● MPI - Message Passing Interface

● Actor Model for Distributed Programming
● Apache Spark
● Amazon AWS
● Google Cloud
● Microsoft Azure
Introduction to
Distributed
Systems
What is Distributed Systems?

Also known as distributed computing and distributed databases, a

distributed system is a collection of independent components
located on different machines that share messages with each other in
order to achieve common goals.

Distributed computing is the method of making multiple

computers work together to solve a common problem.
Parallelism is the new
order of the day
Advantages of Parallel computing

• Serial implementation

• Parallel implementation
Evolution of
Distributed
Systems
In the early days, computers
operated in isolation. The birth of
networking marked a transformative
era as machines started connecting
The Birth of for basic communication. This
allowed data exchange and paved
Networking the way for collaborative computing.

From Isolated Machines to

Connected Worlds
The client-server architecture
emerged, introducing a
Client-Server centralized model where clients
requested services from a central
Architecture server.

This paradigm shift streamlined

Enter the Client-Server Era processes, making information
and services more accessible
across networks.
With the proliferation of the
internet, distributed systems

Proliferation of became ubiquitous. Mass adoption

occurred, connecting the world in

the Internet
ways previously unimaginable.
This global network laid the
The Web Unleashed foundation for distributed
computing on a larger scale.
Peer-to-peer networks gained
prominence, promoting

Peer-to-Peer decentralization. Devices began

sharing resources directly,
Networks reducing reliance on a central
authority. This shift brought about
The Rise of Equality increased resilience and
scalability.
The era of distributed computing

Distributed began, leveraging the collective

power of multiple machines. Tasks

Computing were divided and processed in

parallel, optimizing performance
Harnessing Collective Power and efficiency.
Cloud computing revolutionized

Cloud Computing the landscape by introducing

virtualization. This allowed remote

Emerges access to computing resources on-

demand, offering scalability,
The Sky's the Limit flexibility, and cost-efficiency.
Elements of a Distributed System

The most important functions of distributed computing are:

● Resource sharing - whether it’s the hardware, software or data
that can be shared
● Openness - how open is the software designed to be developed
and shared with each other
● Concurrency - multiple machines can process the same function at
the same time
● Scalability - how do the computing and processing capabilities
multiply when extended to many machines
● Fault tolerance - how easy and quickly can failures in parts of the
system be detected and recovered
● Transparency - how much access does one node have to locate
and communicate with other nodes in the system.
Different ways to achieve Distributed computing?

Distributed computing involves the use of multiple computers or servers

to work together on solving a problem or executing a task.

There are several ways to achieve distributed computing, each with its
own advantages and use cases.

Here are some common approaches:

1. Client-Server Architecture
Description: In this model, one central server Use Case: Common in applications where
provides services or resources, and multiple centralized control is needed, such as web
client machines request and use these services.
servers or database systems.

2. Peer-to-Peer Networks
Description: In a peer-to-peer network, Use Case: Popular for file sharing (e.g.,
all nodes (computers or devices) are
BitTorrent) and decentralized applications.
considered equal, and they can share
resources directly with one another
without relying on a central server.

3. Distributed Computing Clusters

Description: Multiple computers are Use Case: High-performance computing
interconnected and work together as a tasks like scientific simulations or data
cluster. Tasks are divided among the analysis.
cluster nodes, allowing parallel processing
and improved performance.
4. Grid Computing
Description: Similar to clusters, but often Use Case: Research projects, scientific
involves geographically distributed computing, and projects that require
resources connected over a network. It massive computational power.
aims to solve large-scale problems by
utilizing idle resources from multiple
locations.

5. Distributed Databases
Description: Data is distributed across Use Case: Large-scale applications with
multiple nodes, allowing for improved high read and write demands, such as
scalability and fault tolerance. Different
types include sharded databases and social media platforms.
NoSQL databases.

6. Cloud Computing
Description: Resources are provided as a Use Case: General-purpose computing,
service over the internet. Users can scalable web applications, and data
access computing power, storage, and storage.
other services on a pay-as-you-go basis.
●Message
Distributed
Computing Passing
Models Model
●Actor
Model
The Message-Passing Model

● Based on the notion of multiple processes

○ A process is an instance of a running program, together with the program’s data
● Parallelism is achieved by having many processes co-operate on the
same task
● Each process has access only to its own data
○ all variables are private
● Processes communicate with each other by sending and receiving
messages
○ using library calls from a conventional sequential language for synchronization
○ sending data from one process's memory space to another
MPI: Message Passing Interface

The message passing interface (MPI) is a standardized means of

exchanging messages between multiple computers running a
parallel program across distributed memory.

MPI is not…
● a language or
compiler
specification
● a specific
implementation or
product
Reasons for Using MPI

● Standardization: MPI has replaced other message passing libraries,

becoming a generally accepted industry standard.
● Portability: MPI has been implemented for many distributed memory
architectures, meaning users don't need to modify source code when
porting applications over to different platforms that are supported by the
MPI standard.
● Speed: Implementation is typically optimized for the hardware the MPI
runs on. Vendor implementations may also be optimized for native
hardware features.
● Functionality: MPI is designed for high performance on massively parallel
machines and clusters. The basic MPI-1 implementation has more than
100 defined routines.
● Availability: A variety of implementations are available, both commercial
and public domain
○ OpenMPI and MPICH are popular open-source and free implementations of MPI
○ Microsoft has an open source implementation for Windows called MSMPI
SIMD / SPMD

● Most message passing programming systems use the Single-

Instruction-Multiple-Data (SIMD)
● Sometimes is this also called SPMD: Single Program Multiple
Data
● All processes run their own identical copy of the same program
● Each process work on different parts of same data or on
different data
● Each process has a unique identifier
● Processes can follow different control paths through the
program, depending on their process ID
● Usually run one process per processor / core / machine
Communication Messages

● A message usually transfers a number of data items of a certain type

from the memory of one process to the memory of another process
● A message typically contains
○ the ID of the sending process
○ the ID of the receiving process
○ the type of the data items
○ the number of data items
○ the data itself
○ a message type identifier
● Sending a message can either be synchronous or asynchronous
● A synchronous send is not completed until the message has started
to be received
● An asynchronous send completes as soon as the message has been
accepted into the system
● Receives are usually synchronous - the receiving process must wait
until the message arrives
Process Identification

● MPI processes can belong to one or more groups called

"communicators" (communication channels)
● Processes within a communicator can only communicate with other
processes in that communicator
● When an MPI application starts, the group of all processes is initially
given a predefined name called MPI_COMM_WORLD
● A process is identified by a unique number within each communicator,
called rank
● The process of rank zero (0), is the starting process or the first
process. In older documentation, it is refer to as the "master" node.
Other documentation may refer to it as "controller" node or
"supervisor" node.
● For two different communicators, the same process can have two
different ranks: so the meaning of a “rank” is only defined when you
specify the communicator
Data Communication

● Data communication in MPI is like email exchange

● One process sends a copy of the data to another process, and the
other process receives it

Communication requires the following information:

● Sender has to know:
○ Who to send the data to (receiver’s process rank)
○ What kind of data to send (100 integers or 200 characters, etc)
○ A user-defined “tag” for the message (like an email subject which allows
the receiver to understand what type of data is being received)
● Receiver might have to know:
○ Who is sending the data. It's OK if the receiver does not know; in this
case sender rank will be MPI_ANY_SOURCE, meaning anyone can send
○ What kind of data is being received
○ What the user-defined “tag” of the message is. It's OK if the receiver
does not know; in this case tag will be MPI_ANY_TAG
Using Ranks for Communication

● When sending data, the sender has to specify the destination

process’ rank (process ID)
○ specifies where the message should go
● The receiver has to specify the source process’ rank
○ indicates where the message will come from
● The ID of MPI_ANY_SOURCE is a special “wild-card” source ID
that can be used by the receiver to match any source
Point to Point Messaging

● Sender calls a SEND routine

○ specifying the data that is to be sent
○ this is called the send buffer
● Receiver calls a RECEIVE routine
○ specifying where the incoming data should be stored
○ this is called the receive buffer
● Data goes into the receive buffer
● Metadata describing message also transferred
○ this is received into separate storage
○ this is called the status
MPI Basic Send Message

MPI_SEND(buf,count,datatype,dest,tag,comm)
● The message buffer is described by buf, count, datatype.
● The target process is specified by dest and comm.
○ dest is the rank of the target process in the communicator specified by
comm.
● tag is a user-defined “type” for the message
● When this function returns, the data has been delivered to the
system and the buffer can be reused.
○ Thus this function is "blocking"
○ However, the message might not have been received by the target
process, yet.
MPI Basic Receive Message

MPI_RECV(buf,count,datatype,source,tag,comm,status)
● Waits until a matching on source, tag, comm message is received
from the system, and the buf buffer can be read.
● source is rank in communicator comm, or MPI_ANY_SOURCE.
● Receiving fewer than count occurrences of datatype is OK, but
receiving more is an error.
● status is a structure containing further information:
○ Who sent the message, which is useful if you used MPI_ANY_SOURCE
○ How much data was actually received
○ What tag was used with the message, which is useful if you used
MPI_ANY_TAG
○ MPI_STATUS_IGNORE can be used if we don’t need any additional
information
Running MPI Programs

● MPI programs can either run on the same computer or they can be
distributed to other computers(nodes) to share the
workload.
● In order to run MPI programs on other nodes, they have to be
copied to all the node.
Assessment

This assessment is a Portfolio for 5CS022 Distributed and Cloud Systems Programming,
which accounts for 100% of the module marks.

There are several components to the Portfolio:

Part 1 – Workshop tasks

The workshop tasks will contribute 20% of the marks to the Portfolio. These

will be clearly identified within the workshop instructions.

Part 2 – Quizzes

Quiz will contribute a total of 30% of the marks to the Portfolio.

Part 3 – Coursework

The coursework will consist of a number of questions that you will have to answer by
writing a short

research-based report and a number of tasks which you will have to carry out by
creating a number

of specified programs. The coursework will contribute 50% of the marks to the Portfolio.

BCT - Unit-2
No ratings yet
BCT - Unit-2
27 pages
Nemo Analyze 8.90 User Guide
No ratings yet
Nemo Analyze 8.90 User Guide
472 pages
Net-Centric Past Questions Answers
No ratings yet
Net-Centric Past Questions Answers
7 pages
Parallel and Distributed Computing
33% (3)
Parallel and Distributed Computing
10 pages
5CS022 Lecture 1
No ratings yet
5CS022 Lecture 1
36 pages
Distributed Computing Architecture
No ratings yet
Distributed Computing Architecture
4 pages
Lecture 1 - Overview of Distributed Computing
No ratings yet
Lecture 1 - Overview of Distributed Computing
71 pages
ADSU1 VFTVF25 VF
No ratings yet
ADSU1 VFTVF25 VF
118 pages
Unit 5 - Distributed Algorithms
No ratings yet
Unit 5 - Distributed Algorithms
15 pages
UCS531-Cloud Computing
No ratings yet
UCS531-Cloud Computing
29 pages
Introduction
No ratings yet
Introduction
34 pages
ADSU1VFTVF25
No ratings yet
ADSU1VFTVF25
118 pages
PDC Week 11 Synchronization
No ratings yet
PDC Week 11 Synchronization
6 pages
Module 1
No ratings yet
Module 1
10 pages
Module 1
No ratings yet
Module 1
39 pages
Pdcco 1
No ratings yet
Pdcco 1
8 pages
Unit2 A
No ratings yet
Unit2 A
70 pages
CS621 Final Term Current Papers
100% (1)
CS621 Final Term Current Papers
9 pages
DC - Unit 1 - Introduction
No ratings yet
DC - Unit 1 - Introduction
68 pages
MPI & Parallel Programming Models Cloud Computing
No ratings yet
MPI & Parallel Programming Models Cloud Computing
7 pages
DC - Unit 1 - Introduction Final
No ratings yet
DC - Unit 1 - Introduction Final
53 pages
DS Mod 1
No ratings yet
DS Mod 1
44 pages
Lecture 1
No ratings yet
Lecture 1
23 pages
Literature Survey
No ratings yet
Literature Survey
3 pages
20ai503 U1 LP3 22-23
No ratings yet
20ai503 U1 LP3 22-23
13 pages
Module 1
No ratings yet
Module 1
51 pages
DistributedComputing Rev2
No ratings yet
DistributedComputing Rev2
44 pages
06 Parallel and Distributed Computing
No ratings yet
06 Parallel and Distributed Computing
43 pages
Cloud Computing Unit I Ch.2 2021
No ratings yet
Cloud Computing Unit I Ch.2 2021
21 pages
Unit-1 Cloud Computing
No ratings yet
Unit-1 Cloud Computing
18 pages
Cloud Computing - Notes
No ratings yet
Cloud Computing - Notes
75 pages
MPI Part2 Updated
No ratings yet
MPI Part2 Updated
20 pages
Introduction To MPI Basics
No ratings yet
Introduction To MPI Basics
8 pages
Net - Centric Computing
No ratings yet
Net - Centric Computing
45 pages
Message Passing-1
No ratings yet
Message Passing-1
76 pages
Module 203 20 - 20MPI 20for 20cluster 20computing 20lec
No ratings yet
Module 203 20 - 20MPI 20for 20cluster 20computing 20lec
30 pages
Grid
No ratings yet
Grid
26 pages
CSE 423 Virtualization and Cloud Computinglecture0
No ratings yet
CSE 423 Virtualization and Cloud Computinglecture0
16 pages
Unit 1 Part 1
No ratings yet
Unit 1 Part 1
29 pages
Module II (CC)
No ratings yet
Module II (CC)
125 pages
Cloud Computing Continuation
No ratings yet
Cloud Computing Continuation
29 pages
Parallel VS Distributed Computing
No ratings yet
Parallel VS Distributed Computing
9 pages
Lecture 1 Introduction To Distributed Systems - 034922
No ratings yet
Lecture 1 Introduction To Distributed Systems - 034922
6 pages
KCS 713 Unit 1 Lecture 6
No ratings yet
KCS 713 Unit 1 Lecture 6
38 pages
DC - Unit 1 Complte Notes
No ratings yet
DC - Unit 1 Complte Notes
66 pages
CC Unit 1
No ratings yet
CC Unit 1
18 pages
CS621 Cheatsheet
No ratings yet
CS621 Cheatsheet
11 pages
Writing Message Passing Parallel Programs With MPI: Course Notes
No ratings yet
Writing Message Passing Parallel Programs With MPI: Course Notes
80 pages
Unit I
No ratings yet
Unit I
13 pages
Chapter 1
No ratings yet
Chapter 1
20 pages
Co 1
No ratings yet
Co 1
66 pages
Unit I
No ratings yet
Unit I
40 pages
Distributed Computing - 2
No ratings yet
Distributed Computing - 2
7 pages
Message Passing Interface (MPI)
No ratings yet
Message Passing Interface (MPI)
4 pages
Distributed Computing
No ratings yet
Distributed Computing
189 pages
Computing
No ratings yet
Computing
6 pages
PDC Complete Course File
No ratings yet
PDC Complete Course File
422 pages
Cloud Computing
From Everand
Cloud Computing
Dr. Nirvikar Katiyar
No ratings yet
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
16 pages
U1&u2 Padcom-25
No ratings yet
U1&u2 Padcom-25
95 pages
(A) Introduction To Compute
No ratings yet
(A) Introduction To Compute
30 pages
CS3551 Unit 1 and 2
No ratings yet
CS3551 Unit 1 and 2
48 pages
5CS022 Lecture 2
No ratings yet
5CS022 Lecture 2
24 pages
5CS022 Week 2 MPI Workshop
No ratings yet
5CS022 Week 2 MPI Workshop
3 pages
Tutorial 11
No ratings yet
Tutorial 11
10 pages
Cloud Assessment Part 1
No ratings yet
Cloud Assessment Part 1
4 pages
Collaborative Final Draft
No ratings yet
Collaborative Final Draft
7 pages
Aaditya Chaudhary Tharu
No ratings yet
Aaditya Chaudhary Tharu
21 pages
Brill Creations Company Profile
No ratings yet
Brill Creations Company Profile
21 pages
ADE LAB Manual
No ratings yet
ADE LAB Manual
75 pages
Poster - Template For Postgraduate Business Dissertation Students
No ratings yet
Poster - Template For Postgraduate Business Dissertation Students
1 page
Letter Head Ade XMN
No ratings yet
Letter Head Ade XMN
6 pages
BOM Tidrev2
No ratings yet
BOM Tidrev2
5 pages
Machine Data Sheet
No ratings yet
Machine Data Sheet
3 pages
Empowerment Week 3 4
No ratings yet
Empowerment Week 3 4
4 pages
How To Update Master Data During An Upload of Transactional Data
No ratings yet
How To Update Master Data During An Upload of Transactional Data
8 pages
AccountStatement Report 6078308449 25062025 23 33 PDF
No ratings yet
AccountStatement Report 6078308449 25062025 23 33 PDF
8 pages
Panasonic tc-l55dt50 CH La35
No ratings yet
Panasonic tc-l55dt50 CH La35
71 pages
Template Project Report Sem IV - VI - VIII
No ratings yet
Template Project Report Sem IV - VI - VIII
21 pages
New Light Vehicle Requirements
No ratings yet
New Light Vehicle Requirements
29 pages
Datasheet Viasat MEOlink Modem PDF
No ratings yet
Datasheet Viasat MEOlink Modem PDF
2 pages
Semester Registration Notice (July-Dec 2023 Semester) - FoC
No ratings yet
Semester Registration Notice (July-Dec 2023 Semester) - FoC
2 pages
Principle of Operating Systems
No ratings yet
Principle of Operating Systems
6 pages
GM 360 User Guide
No ratings yet
GM 360 User Guide
60 pages
Advanced Instructional Design For Successive E-Learning
No ratings yet
Advanced Instructional Design For Successive E-Learning
15 pages
MFF
No ratings yet
MFF
402 pages
Bos 2021-22 Details Odd Sem
No ratings yet
Bos 2021-22 Details Odd Sem
6 pages
CRM (Compatibility Mode)
No ratings yet
CRM (Compatibility Mode)
86 pages
Sony Mobile Thesis
100% (2)
Sony Mobile Thesis
4 pages
Catalog Inverter Mitsubishi F800
No ratings yet
Catalog Inverter Mitsubishi F800
138 pages
2.fall 23 Lecture2QualityMetrics
No ratings yet
2.fall 23 Lecture2QualityMetrics
62 pages
KYOCERA km2540 SERVICE MANUAL
100% (1)
KYOCERA km2540 SERVICE MANUAL
364 pages
Gagan Resume
No ratings yet
Gagan Resume
2 pages
Data Transmission IGCSE
No ratings yet
Data Transmission IGCSE
4 pages
112 SOS - Awareness PPT - 13.09.2020
No ratings yet
112 SOS - Awareness PPT - 13.09.2020
19 pages
Acid Plant Database: Sulphuric Acid On The Web Technical Manual DKL Engineering, Inc
No ratings yet
Acid Plant Database: Sulphuric Acid On The Web Technical Manual DKL Engineering, Inc
2 pages

5CS022 Lecture 1

Uploaded by

5CS022 Lecture 1

Uploaded by

Distributed systems

and Cloud Computing

● MPI - Message Passing Interface

Also known as distributed computing and distributed databases, a

Distributed computing is the method of making multiple

From Isolated Machines to

This paradigm shift streamlined

Proliferation of became ubiquitous. Mass adoption

Peer-to-Peer decentralization. Devices began

Distributed began, leveraging the collective

Computing were divided and processed in

Cloud Computing the landscape by introducing

Emerges access to computing resources on-

The most important functions of distributed computing are:

Distributed computing involves the use of multiple computers or servers

Here are some common approaches:

3. Distributed Computing Clusters

● Based on the notion of multiple processes

The message passing interface (MPI) is a standardized means of

● Standardization: MPI has replaced other message passing libraries,

● Most message passing programming systems use the Single-

● A message usually transfers a number of data items of a certain type

● MPI processes can belong to one or more groups called

● Data communication in MPI is like email exchange

Communication requires the following information:

● When sending data, the sender has to specify the destination

● Sender calls a SEND routine

There are several components to the Portfolio:

Part 1 – Workshop tasks

will be clearly identified within the workshop instructions.

Quiz will contribute a total of 30% of the marks to the Portfolio.

You might also like