0% found this document useful (0 votes)
489 views83 pages

GCC Unit 1

This document provides an overview of the syllabus for a course on Grid and Cloud Computing. The syllabus covers topics like evolution of distributed computing, grid services, virtualization, programming models, and security. Unit 1 of the syllabus focuses on technologies for networked systems, grid infrastructures, cloud computing, and an introduction to grid architecture and standards.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
489 views83 pages

GCC Unit 1

This document provides an overview of the syllabus for a course on Grid and Cloud Computing. The syllabus covers topics like evolution of distributed computing, grid services, virtualization, programming models, and security. Unit 1 of the syllabus focuses on technologies for networked systems, grid infrastructures, cloud computing, and an introduction to grid architecture and standards.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 83

CS – 6703

GRID & CLOUD COMPUTING


Dr. B. Chandramouli
Overall Syllabus
• Unit 1 -- Introduction
• Unit 2 -- Grid Services
• Unit 3 -- Virtualisation
• Unit 4 -- Programming model
• Unit 5 -- Security
Unit 1 - Syllabus
• Evolution of Distributed computing:
• Scalable computing over the Internet
• Technologies for network based systems
• Clusters of cooperative computers
• Grid computing Infrastructures
• Cloud computing
• Service oriented architecture
• Introduction to Grid Architecture and standards
• Elements of Grid
• Overview of Grid Architecture
Background
• Extension of what you read before….
– Distributed systems
– Software architecture
• Discuss about “Computing” wrt
– Large scale , big data, scientific problems and
solutions
– Virtual computing
• These computing requires
– Multiple computers distributed globally and
communicating with each other
• Results in “Computer to computer
communication ( network)” for “computing”
COMMUNICATION ?
• Communication is the transfer of information
from one place to another.
• The process of communication essentially
involves the following three components:
– Sender
– Receiver
– Medium
Components of Communication
• Sender :-
– The component from where the information is
transferred
• Receiver :-
– The component to which the information is
transferred.
• Medium:-
– The component through which the information is
transferred .
Computer Communication
• In computer communication the sender and
the receiver are both computers and medium
is electromagnetic waves, cables or some
other physical media.
What is Networking ?
• Networking is a way of communication.

• The concept of connecting computers together to form


computer network. A computer network is a
communication system where a group of computers
and other devices like printer are connected by cables
and other hardware.
• This concept of connected computers sharing
resources called networking.
Why Network ?
• To share
– Data Messages
– Printers
– Hard disks
– CD-ROMS
– Modems
– Other hard disk resources.

Advantages of Networking :
• Networks allow efficient management of resources.
• Network helps keep information reliable and up-to-date.
• Networks help speed up data sharing.
• Transferring files across a network a network is always faster than non
network.
• Services like E-mail allow communication among individuals.
Types of Networks
• L A N (Local Area Network)
– If a network is confined to a single location, typically one building or complex,
it is called local area network. The maximum distance from one end of a
network to another is usually limited . Generally sending and receiving of
messages through a physical connection such as a cable.
• W A N (Wide Area Network)
– When the networks is spread over wide areas such as across cites, states or
countries. It is called as a wide area network. Communication take place via
telephone lines, satellites or physical cable. Best example is live telecast of
sports.
• M A N (Metropolitan Area Network)
– In between LAN and WAN is the metropolitan network. This networks that
covers entire city, but uses LAN technology. Cable television networks are the
examples of MAN s. The MAN s we are interested in carry information in the
form of computer signals from one computer to another.
Components of networks
• Server:-
– Computers that provide shared resources to network users.
• Clients:-
– Computers that access shared network resources provided
by a server.
• Media:-
– The way in which the computers are connected.
• Resources:-
– Files printers and other items to be used by network users
CATEGORIES OF NETWORK
• Server Based Network :
– A server is a computer on a network that functions as a server and is not
used as client.
– A server is optimized to service requests from network clients
– Servers perform wide variety of tasks.
– Usually large networks have specialized servers for different tasks, e.g.
• File and print servers:-
– Manage user access and use of file and printer resources.
• Application servers:-
– Make the data on the server available to clients.
• Mail servers:-
– Manage electronic messaging between network users.
• Communication servers:-
– Handle data flow and E-mail messages.

CATEGORIES OF NETWORK…
• PEER TO PEER NETWORK :
– In this there are no dedicated servers.
– All the computers are equal and therefore are
termed as peers.
– The main advantage of this network is simple in
design and maintenance.
– It is usually expensive to set up as compared to
server-based networks.
Topology
• Topology refers to the arrangement of
computers, cables and other components on
the network.
• Networks can be configured in to following
basic topologies
– Bus
– Star
– Ring
Bus / Star / Ring
• Bus topology :
– This is also known as liner bus. It consists of several computers which are
attached to a common cable called trunk line. This is also called as passive
topology, in which computers are not responsible for transferring of data. In
this failure of one computer does not affect the performance of the network

• Star topology :
– In this computers are connected to a centralized device known as HUB. As
each computer is connected to central point, this topology requires great deal
of cabling. If the centralized controller fails , the entire network is disabled.

• Ring topology :
– In this computers are connected through a single circle of cable. This is an
active topology, as each computer acts like a repeater to boost the signal and
sends it on to the next computer. Failure of one computer can affect the entire
network.
INTERNET & INTRANET
• INTERNET
– Internet is an inter-connection between several computers
of different types belonging to various networks all over
the globe. It is a networks of network.

• INTRANET
– An intranet is a private computer network that
uses Internet technologies to securely share any part of an
organization's information or operational systems with its
employees. Interconnectivity of two or more networks to
form a single unit is called intranet. It may consist of many
interlinked local area networks and also use leased lines in
the wide area network
Some terminologies
WWW
• WORLD WIDE WEB : The World Wide
Web (commonly abbreviated as the "Web") is
a system of interlinked hypertext documents
accessed via the Internet. With a Web
browser, one can view Web pages that may
contain text, images, videos, and
other multimedia and navigate between them
using hyperlinks
HYPERTEXT
• Hypertext is text, displayed on a computer,
with references (hyperlinks) to other text that
the reader can immediately access, usually by
a mouse click or key press sequence
WEB BROWSER
• A web browser is a software application for
retrieving, presenting, and traversing
information resources on the World Wide
Web. The major web browsers are Windows
Internet Explorer, Mozilla Firefox, Apple
Safari, Google Chrome, and Opera.
Web Pages / Hypertext
• A web page or webpage is a document or
resource of information that is suitable for
the World Wide Web and can be accessed
through a web browser and displayed on
a computer screen. This information is usually
in HTML or XHTML format
Web Pages / Hypertext …
• WWW is the most popular method of
accessing the internet. The main reason of
popularity is due to usage of concept called
hypertext. Hypertext is the new way of
information storage and retrieval. Hypertext
documents on the internet called as web
pages. These web pages are created by using
special language called hypertext markup
language (HTML).
Internet
Individual server / Network (Stores Web pages) / web
pages are created by HTML
WWW – Logical link between all web pages

Global network of many


networks ( Internet)

Client with
browser SW
ISP
MULTIMEDIA & APPLICATIONS
• Multimedia is media and content that uses a
combination of different content forms.
Multimedia includes a combination
of text, audio, still images, animation, video,
and interactivity content forms.

• Multimedia Apps
– Foreign language learning
– Video games
– Special effects in films
– Animated advertisements
Internet of Things
• Major Technology evolution of today
• Use of internet( wireless) to connect common objects,
sensors, computers, devices used in daily life
• IoT uses RFID, sensors, GPS( Global Positioning system)
• 3 Types of communication
– H2H ( Human to Human) -- email, chat,Skype, facebook
– H2T ( Human to Thing ) -- Voice commands to remote robo
( Open Gate), New location suggest good restaurants( apple watch)
– T2T ( Thing to Thing ) – remote a/c controller to Home ac
( At 4:50 pm automatically send signal to power On AC at home)
Technologies in HW
• Multicore Processor ( HW)
– Have more than 1 core to execute necessary functions
• Multi threading (OS)
– OS gives the facility to execute an application in many
paths(threads)
• Graphic Processor ( HW –> CPU + Graphic processing)
– CPU is generally sequential processing compared to GPU which
is parallel processing mainly suited for Graphic apps. ( Engg,
scientific and 3D animation etc..)
• Networks ( as seen in previous slides) + Storage area
network ( SAN)
• Virtualisation ( imitating as if physical devices are available)
Multithreading
Scaling – What and Why ?
• What ?
– It is the ability of a computer application (software / OS) or
product (hardware / network) to continue to function well with
high reliability when the need goes high (up) or low (down)
• Why ?
– Business growth forces organisation’s growth
– No. of functions in an application increase over a period
– No. of users increase
– Users get spread across the globe
– Demand for resources increase ( memory, proc speed etc.)
– Data becomes huge
Need of more Computing Power:
Grand Challenge Applications
Solving technology problems using
computer modeling, simulation and analysis

Geographic
Geographic
Information
Information
Systems
Systems
Aerospace
Aerospace
Life
LifeSciences
Sciences

Mechanical Design & Analysis (CAD/CAM)
Mechanical Design & Analysis (CAD/CAM)
Scalable Computing

P
E
R
F 2100 2100 2100 2100

O
R
M
A
N 2100 2100 2100 2100

C 2100

E Administrative Barriers

+ •Individual
•Group
Q •Department
o •Campus
•State
S •National
•Globe

Personal Device Local Enterprise Global Inter Planet


Main frames
Cluster Cluster/Grid Grid Grid
Distributed system ?
• Definition:
– A Set of nodes, connected by a network, which
appear to its users as a single coherent system.
– Examples
• Internet ( interconnection of globally distributed
computers)
• WWW ( Interlinked hypertext)
• Mobile network (interconnection of globally distributed
computers and mobiles devices)
Why DS is important ?
• Resource sharing
• Load Sharing
• Improves scalability
• Improves reliability
• Geographically distributed
• Works during partial failure
• Parallel processing, so performance improved
– PASS, dev by IBM in 1981 used in space shuttle.
– Could have been done by 1 m/c
– IBM did with 4 nodes for fault tolerance
DS Evolution

• Late 70’s and early 80’s: synchronous with


distributed processing
• Mid 80’s: no point of central control
• Late 80’s: peer structure and inter-connection
configuration
• Later: more fine-grained distribution
– Software is decomposed into components
– Components can resides on different computers and be implemented with
different languages

Distributed Systems 36
Examples of Distributed Systems
• IT Service Architecture of a Swiss Bank
– Service architecture consists of heterogeneous new and legacy components
– Hardware platforms range from mainframes to NTs
– Programming languages including Assembler, Cobol, C, C++, Java, …
– Different types of middleware can be used to resolve distribution and
heterogeneity

Authorization
Customer Services
Information
Services

Trading
Services
ATM
Services

Distributed Systems 38
Transparency in Distributed Systems

• Distributed System Definition: A distributed system is a


collection of autonomous hosts, heterogeneous in
nature, that are connected through a computer
network and coordinate with each other in such a way
that users perceive the system as a single and
integrated computing facility.

• Dimensions of transparency in DS
– Access Transparency
– Location Transparency
– Migration Transparency
– Replication Transparency
– Concurrency Transparency
– Scalability Transparency
– Performance Transparency
– Failure Transparency

Distributed Systems 39
DS categories
• Clusters
– A computer cluster consists of a set of loosely or tightly
connected computers that work together so that, in many respects, they can be
viewed as a single system. Here, computer clusters have each node set to
perform the same task ( homogeneous) , controlled and scheduled by software
• Cloud
– Internet-based computing, where different services — such as servers, storage
and applications are delivered to an organization's computers and devices
through the Internet.
• Grid
– Grid computing is the collection of computer resources from multiple locations
to reach a common goal.
– The grid can be thought of as a distributed system with non-interactive
workloads that involve a large number of files.
– Grid computing is distinguished from conventional high performance
computing systems such as clusters in that grid computers have each node set
to perform a different task/application.
Order of increasing complexity
Centralised Vs Distributed
Centralised Distributed
Many jobs done on a single CPU Jobs are distributed to many CPUs which
are interconnected
Uses Shared memory, Variables concept NO Shared memory, Variables concept
Single global clock to sync ops No global clock
CLUSTERS FOR COOPERATIVE COMPUTING -

CLUSTER ARCHITECTURE
What is a cluster?
• A cluster is a type of parallel or distributed processing system,
which consists of a collection of physically interconnected stand-
alone computers cooperatively working together as a single,
integrated computing resource.
• A typical cluster:
– Network: Faster, closer connection than a typical network
(LAN)
– Low latency communication protocols ( Slowly responding …
between service initiation to result)
– Looser connection than SMP ( symmetric multi processor)
• Use ?
– High Availability in comparison to standalone
– High Performance in comparison to standalone
– Scalable
A cluster of servers (S1, S2,…,Sn) interconnected
by a high-bandwidth system-area or
local-area network with shared I/O devices and
disk arrays.
The cluster acts as a single computing
node may be attached to the Internet
through a gateway.
High-availability clusters implementations are best
for mission-critical applications or databases,
mail, file and print, web, or application servers.
Cluster High Performance thru Load Balancing
CLUSTER BENEFITS

The main benefits of clusters are:


1.Availability
2.Performance
3.Scalability

These benefits map to needs of today's


enterprise business, education, military and
scientific community infrastructures.
Grid computing
What’s Grid ?
• Similar to electric power grid
• Electric power grid makes the job of electricity
users very very simple !
• Similarly for making computing power as easy
and as simple, computer grid is used.
GRID Computing visualisation
What’s Grid computing ?
• Grid computing is the collection of computer resources from multiple
locations connected together to reach a common goal.
• The grid can be thought of as a distributed system with non-interactive
workloads that involve a large number of files (data).
• Grid computing is distinguished from conventional high performance
computing systems such as cluster computing in that grid computers have
each node set to perform a different task/application
• Grid computers also tend to be more heterogeneous and geographically
dispersed (thus not physically coupled) than cluster computers.
• Grid sizes are generally quite large .
• Although a single grid can be dedicated to a particular application,
commonly a grid is used for a variety of purposes.
• Grids are often constructed with general-purpose grid middleware
software libraries.
An example computational Grid built over specialized computers at
three resource sites at Wisconsin, Caltech, and Illinois.
GRID ARCHITECTURE
5 Layers of
Grid Architecture
• Fabric Layer
– All physical hardware resources ( storage, sensors, computing servers,
network etc)
• Connectivity Layer
– Connects resources request to actual resources ( establishes
communication) – user permission, data security etc.
• Resource services
– Forming of various resources ( HW and SW) to run an application
under collective operations
• Collective Layer
– Responsible for Monitoring, diagnostics, replication , authorise and
authenticate etc…
• Application Layer
– All application, Control programs , SW dev kits reside here
Distributed Computing  GRID
• Distributed computing is an “older term”
– Typically built around proprietary software and
network
– Tightly couples systems/organization
• Grid is an evolution of distributed computing
– Dynamic
– Geographically independent
– Built around standards
– Internet backbone
– Open software
Criteria for a Grid:
Large scale ( a few million resources)
Geographically distributed
Systems heterogeneous in nature
Multiple Administrators
Single , common Goal
Coordinates resources that are not subject to centralized
control.
Uses standard, open, general-purpose protocols and
interfaces.
Delivers nontrivial qualities of service.

Benefits
Exploit Underutilized resources
Load Balancing of Resource
Virtualize resources across an enterprise
Data Grids, Compute Grids
Enable collaboration for virtual organizations
Grid Applications
Data and computationally intensive applications:
This technology has been applied to computationally-intensive scientific,
mathematical, and academic problems like drug discovery, economic
forecasting, seismic analysis , back office data processing in support of
e-commerce
• A chemist may utilize hundreds of processors to screen thousands of
compounds per hour.
• Teams of engineers worldwide pool resources to analyze terabytes of
structural data.
• Meteorologists seek to visualize and analyze petabytes of climate data
with enormous computational demands.
Resource sharing
– Computers, storage, sensors, networks, …
– Sharing always conditional: issues of trust, policy, negotiation,
payment, …
Coordinated problem solving
– distributed data analysis, computation, collaboration, …
Computational Grid Applications

• Biomedical research
• Industrial research
• Engineering research
• Studies in Physics and Chemistry
Grid Topologies

• Intragrid
– Local grid within an organisation
– Trust based on personal contracts
• Extragrid
– Resources of a consortium of organisations
connected through a (Virtual) Private Network
– Trust based on Business to Business contracts
• Intergrid
– Global sharing of resources through the internet
– Trust based on certification
A typical view of Grid environment

Grid InformationGrid Information Service


Service system Details of Grid
resources
collects the details of
the available Grid 1
resources and passes
the information to 2
the resource broker. 4
Computational
3 jobs
Grid
application
Processed jobs
Computation
result
User
A User sends
Resource Broker
A Resource Broker
computation or data
intensive application
distribute the jobs in an Grid Resources
application to the Grid Grid Resources (Cluster,
to Global Grids in resources based on user’s PC, Supercomputer,
order to speed up the QoS requirements and database, instruments,
execution of the details of available Grid etc.) in the Global Grid
application. resources for further execute the user jobs.
Elements of Grid Computing

• Resource sharing
– Computers, data, storage, sensors, networks, …
– Sharing always conditional: issues of trust, policy, negotiation,
payment, …
• Coordinated problem solving
– Beyond client-server: distributed data analysis, computation,
collaboration, …
• Dynamic, multi-institutional virtual organizations
– Community overlays on classic org structures
– Large or small, static or dynamic
SOA
What are “Web Services” and SOA?

• What is a service ?
– A service is a software entity ( which is available in a public domain)
that can be discovered and invoked by other software systems.
• What is Web service ?
– A Web service is a software system identified by a Uniform Resource
Identifier (URI), whose public interfaces and bindings are defined and
described using XML. Its definition can be discovered by other
software systems. These systems may then interact with the Web
service in a manner prescribed by its definition, using XML based
messages conveyed by internet protocols
• What is SOA ?
– A service-oriented architecture (SOA) is an architectural pattern in
computer software design in which application components provide
services to other components via a communications protocol, typically
over a network. The principles of service-orientation are independent
of any vendor, product or technology.
Web Services Basic Architecture

• Web services are consistent with


established Service-Oriented
Architecture(SOA) principles.
• Web Services Component
– Service
– Service Description
• Web services Roles
– Service providers.
– Service brokers
– Service requestors.
• Web Services Operations
– Publish
– Find
– Interact
The Conceptual Web Service Stacks
•Format: XML,Extensible Markup Language, to define structures of documents

•Search and Find: UDDI


Universal Discovery Description and
Integration, for registries of Web services

•Description: WSDL
Web Services Description Language, to describe
networked XML-based services
•Messaging: SOAP
•Simple Object Access Protocol, to define a
uniform way of passing XML-encoded data.
•Transport: Internet Protocol,HTTP, STMP,
etc.
Operations and Message Flow
CLOUD COMPUTING
Cloud computing ?
• Cloud computing is using the internet to
access someone else's software running on
someone else's hardware in someone else's
data center.
NIST Cloud Definition

“The National Institute of Standards and Technology


(NIST) defines cloud computing as a "pay-per-use
model for enabling available, convenient and on-
demand network access to a shared pool of
configurable computing resources (e.g., networks,
servers, storage, applications and services) that can be
rapidly provisioned and released with minimal
management effort or service provider interaction."

71
April 2009
Architecture

NIST Visual Model of Cloud Computing Definition 72


Cloud Computing - Services

Software as a Service - SaaS


Platform as a Service - PaaS
Infrastructure as a Service - IaaS
Cloud Deployment Models
• Public Cloud.
• Private Cloud.
• Community Cloud.
• Hybrid Cloud.

74
Cloud Security Forum Ref. Architecture

10/15/2013 Cloud Reference Architecture(s) IIT-RTC 75


GRID Computing visualisation
Grid Vs Cloud
• Common to both
– Maximise the use of existing resources
– Economise computing cost
– Uses abstraction extensively ( concepts looks similar)
• Grid
– one large job is divided into many small portions and executed
on multiple machines.
– Meant for large scale computing
• Cloud
– allow the users to avail of various services without investing in
the underlying architecture ( HW,SW,NW)
– offers many different services, from web hosting, right down to
word processing
Grid Vs Cloud
Grid Computing Cloud Computing

Access to Shared computing power, Access to Leased computing power,


storage and other resources storage and other resources

Big Org. pay for the services ( Univ. / Users pay for the leased services
Govt)
Managed by providers and Users ( Univ. / Managed by Cloud providers only. No role
Govt) for user.
Big Org. Provide the services ( Univ. / Commercial companies provide services
Govt/ Research org.)

Handles huge amount of data, complex Handles reasonable amount of data for
calculations ( e.g.weather forecast) commercial and other org.
Uses open source technology Uses proprietary technology

Suitable for research org. to carry out Suitable for business houses to effectively
tasks like global climate modelling etc.. use computing without investing
Standards for GRID environment

• OGSA
– Open Grid Service Architecture
– defines requirements for framework for grid systems
that support integration, virtualization and
management.
– Such a framework requires a core set of interfaces,
expected behaviors, resource models bindings.
• OGSI
• OGSA-DAI
• Grid FTP
• WSRF and etc.
Part A -- 2 marks Questions
1. What are the 3 components of a network ?
2. What is world wide web (WWW) ?
3. What is webpage ?
4. What is meant by scaling ?
5. What is the need for scaling ?
6. What is multicore architecture ?
7. What is multi threading?
8. What is GPU ? How is it different from CPU ?
9. Define distributed system
10. What are the advantages of distributed systems?
11. List the categories of distributed computing systems?
12. Differentiate between centralized and distributed computing.
13. Define Clusters?
14. List the cluster computing benefits.
15. What is meant by Grid computing ?
16. What are the benefits of Grid computing ?
17. List 3 applications of Grid computing
18. What is meant by Cloud computing ?
19. List the deployment modes of cloud computing.
20. What are the 3 services offered by Cloud computing ?
21. List the essential characteristics of Cloud computing
22. Explain – Web Service.
23. Define Service oriented Architecture ( SOA)
24. What is ‘Internet of Things’ ?
Part B – Big Questions
1. Explain in detail the evolution of distributed computing
2. Explain with a diagram Cluster architecture
3. Explain with a diagram Grid architecture
4. Explain with a diagram Cloud architecture
5. Explain with a diagram the Service oriented architecture
6. Explain with neat diagrams, how high availability and high
performance achieved through cooperative cluster
environment
7. Write a short note on
– Muticore processor
– Multi threading
– Graphic processor
End of Unit 1

Recap and discussions

You might also like