Grid and Cloud Computing

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 46

GRID AND CLOUD COMPUTING

Kai Hwang, Geoffery C. Fox and Jack J. Dongarra, “Distributed


and Cloud Computing: Clusters, Grids, Clouds and the Future
of Internet”, First Edition, Morgan Kaufman Publisher.

Prepapred by AbdurRahman/AP/CSE/JCE 1
UNIT I INTRODUCTION
• Evolution of Distributed computing: Scalable
computing over the Internet –
• Technologies for network based systems –
• clusters of cooperative computers –
• Grid computing Infrastructures –
• Cloud computing –
• service oriented architecture –
• Introduction to Grid Architecture and standards
• Elements of Grid –
• Overview of Grid Architecture.
Prepapred by AbdurRahman/AP/CSE/JCE 2
UNIT II GRID SERVICES
• Introduction to Open Grid Services
Architecture (OGSA) –
• Motivation –
• Functionality Requirements –
• Practical & Detailed view of OGSA/OGSI – Data
intensive grid service models –
• OGSA services.

Prepapred by AbdurRahman/AP/CSE/JCE 3
UNIT III VIRTUALIZATION
• Cloud deployment models: public, private, hybrid,
community –
• Categories of cloud computing: Everything as a
service: Infrastructure, platform, software –
• Pros and Cons of cloud computing
• Implementation levels of virtualization
• Virtualization structure –
• Virtualization of CPU, Memory and I/O devices
• Virtual clusters and Resource Management
• Virtualization for data center automation
Prepapred by AbdurRahman/AP/CSE/JCE 4
UNIT IV PROGRAMMING MODEL
• Open source grid middleware packages –
• Globus Toolkit (GT4) Architecture , Configuration –
• Usage of Globus –
• Main components and Programming model -
Introduction to Hadoop Framework –
• Mapreduce, Input splitting, map and reduce functions,
specifying input and output parameters, configuring
and running a job –
• Design of Hadoop file system, HDFS concepts,
command line and java interface, dataflow of File read
& File write.

Prepapred by AbdurRahman/AP/CSE/JCE 5
UNIT V SECURITY
• Trust models for Grid security environment
• Authentication and Authorization methods – Grid
security infrastructure –
• Cloud Infrastructure security: network, host and
application level –
• Aspects of data security, provider data and its
security,
• Identity and access management architecture
• IAM practices in the cloud, SaaS, PaaS, IaaS
availability in the cloud, Key privacy issues in the
cloud.
Prepapred by AbdurRahman/AP/CSE/JCE 6
Unit –I
Scalable computing over the Internet
• Evolutionary changes in Technology:
• Machine architecture,
• Operating system platform,
• Network connectivity, and
• Application workload
• The Age of Internet Computing
• High-performance computing (HPC)
• High-throughput computing (HTC)

Prepapred by AbdurRahman/AP/CSE/JCE 7
The Platform Evolution
Year Technology Example

1950 to 1970 Mainframes IBM 360 and CDC 6400

1960 to 1980 Minicomputers DEC PDP 11 and VAX

1970 to 1990, Personal computers

1980 to 2000 Portable computers and


pervasive devices

Prepapred by AbdurRahman/AP/CSE/JCE 8
Evolution of HPC and HTC systems

Prepapred by AbdurRahman/AP/CSE/JCE 9
HPC vs HTC
• High-Performance Computing
– large amounts of computing for short period.
– Gflops in 1990s to Pflops in 2010.
– Pflops =1000 tflops / tflop = 1000 gflops
• High-Throughput Computing
– large amounts of computing, for much longer time.
– how many jobs
– high-flux computing

Prepapred by AbdurRahman/AP/CSE/JCE 10
Three New Computing Paradigms
• Radio-frequency identification (RFID),
• Global Positioning System (GPS), and
• Internet of Things (IoT).

Prepapred by AbdurRahman/AP/CSE/JCE 11
Computing Paradigm Distinctions
• Distributed computing is the opposite of
centralized computing.
• Parallel computing overlaps with distributed
computing.
• Cloud computing overlaps with distributed,
centralized, and parallel computing.

Prepapred by AbdurRahman/AP/CSE/JCE 12
Computing Paradigm Contd.,
• Centralized computing: All resources are
shared within one integrated OS.
• Parallel computing: Processors are tightly
coupled with centralized shared memory.
• Distributed computing: Multiple autonomous
computers, each with own private memory.
• Cloud computing: Clouds can be built with
physical or virtualized resources.

Prepapred by AbdurRahman/AP/CSE/JCE 13
Scalable Computing Trends and New
Paradigms
• Moore’s law: Processor speed / 18 months
• Gilder’s law: Bandwidth / year
• Degrees of Parallelism
– bit-level parallelism (BLP) : 4-bit mp 8-,16-, 32-,
and 64-bit CPUs
– Instruction-level parallelism (ILP):pipelining,
superscalar computing, VLIW (very long
instruction word) architectures, and
multithreading.
Prepapred by AbdurRahman/AP/CSE/JCE 14
Degrees of Parallelism Contd.,
• Data-level parallelism (DLP) : SIMD (single
instruction, multiple data).
• Task-level parallelism (TLP): since chip
multiprocessors (CMPs).
• Job-level parallelism (JLP).

Prepapred by AbdurRahman/AP/CSE/JCE 15
Innovative Applications

Prepapred by AbdurRahman/AP/CSE/JCE 16
The Trend toward Utility Computing

Prepapred by AbdurRahman/AP/CSE/JCE 17
The Hype Cycle of New Technologies

Prepapred by AbdurRahman/AP/CSE/JCE 18
The Internet of Things and Cyber-
Physical Systems
• The Internet of Things
– 1,000 to 5,000
– H2H (human-to-human),
– H2T (human-to-thing),
– T2T (thing-to-thing)
• Intelligent cities, clean water, efficient power,
convenient transportation, good food supplies,
responsible banks, fast telecommunications,
green IT, better schools, good health care,
abundant resources, and so on.
Prepapred by AbdurRahman/AP/CSE/JCE 19
Cyber-physical system
• A cyber-physical system (CPS) is the result of
interaction between computational processes
and the physical world.
• A CPS merges the “3C” technologies of
computation, communication, and control.
• CPS emphasizes exploration of virtual reality
(VR) applications.

Prepapred by AbdurRahman/AP/CSE/JCE 20
Technologies for network based systems
Multicore CPUs and Multithreading Technologies
• Millions of instructions per second (MIPS)
• Mbps or Gbps
• Advances in CPU Processors
• Multicore architecture with dual, quad, six, or more
processing cores.
• 10 MHz for the Intel 286 to 4 GHz for the Pentium 4.
• ILP : Modern CPU
• DLP and TLP : Graphics processing units (GPUs)
• Multi-core CPU and many-core GPU handle multiple
instruction threads.

Prepapred by AbdurRahman/AP/CSE/JCE 21
Improvement in processor and
network technologies over 33 years.

Prepapred by AbdurRahman/AP/CSE/JCE 22
Typical Multicore processor

Prepapred by AbdurRahman/AP/CSE/JCE 23
Multicore CPUs and Multithreading
Technologies Contd.,
• Multicore CPU and Many-Core GPU
– Multicore CPUs from the tens of cores to hundreds.
– Limited in Parallelism due to Memory wall Problem.
– GPUs with hundreds or more thin cores.
– RISC replaced with multicore x-86 processors.
– CPU cores and thin GPU on same chip.

Prepapred by AbdurRahman/AP/CSE/JCE 24
Multicore CPUs and Multithreading
Technologies Contd.,
• Multithreading Technology

Prepapred by AbdurRahman/AP/CSE/JCE 25
GPU Computing to Exascale and
Beyond
• A GPU is a graphics coprocessor or accelerator
mounted on a computer’s graphics card or
video card.
– GeForce 256 by NVIDIA in 1999.
– 10 million polygons per second.
– Xeon X5670 CPU has six cores.
– GPU chip with hundreds of processing cores.
– GPGPUs: NVIDIA’s CUDA model.

Prepapred by AbdurRahman/AP/CSE/JCE 26
GPU Computing to Exascale and
Beyond Contd.,
• How GPUs Work
– as coprocessors.
– NVIDIA GPU 128 cores on a single chip.
– Eight thread / Core ~ 1024 threads.
– Not restricted to graphics or video coding.
– GPU offloads the CPU.

Prepapred by AbdurRahman/AP/CSE/JCE 27
GPU Computing to Exascale and
Beyond Contd.,
• GPU Programming Model
– Offloads CPU
– Match bandwidth on-board

Prepapred by AbdurRahman/AP/CSE/JCE 28
Example : NVIDIA Fermi GPU

Prepapred by AbdurRahman/AP/CSE/JCE 29
GPU Computing to Exascale and
Beyond Contd.,
• Power Efficiency of the GPU
– CPU Consumes 2 nJ/instruction
– 200 pJ/instruction, which is 1/10 less

Prepapred by AbdurRahman/AP/CSE/JCE 30
Memory, Storage, and Wide-Area
Networking
“Tape units are dead, disks are tape units, flashes are disks, and memory are caches
now.”

• Memory Technology

Prepapred by AbdurRahman/AP/CSE/JCE 31
Memory, Storage, and Wide-Area
Networking Contd.,
• System-Area Interconnects
– LAN : client hosts to big servers.
– SAN : servers to network storage.
– NAS : client hosts directly to the disk arrays.

Prepapred by AbdurRahman/AP/CSE/JCE 32
Memory, Storage, and Wide-Area
Networking Contd.,
• Wide-Area Networking
– 10 Mbps in 1979 to 1 Gbps in 1999,
– 40 ~ 100 GE in 2011.
– Speculated 1 Tbps network by 2013.
– According to Berman, Fox, and Hey : 1,000, 1,000,
100, 10, and 1 Gbps bandwidths were reported,
respectively, for international, national,
organization, optical desktop, and copper desktop
connections in 2006.

Prepapred by AbdurRahman/AP/CSE/JCE 33
Virtual Machines and Virtualization
Middleware
• Single OS image.
• Different instruction set.

Prepapred by AbdurRahman/AP/CSE/JCE 34
VM Primitive Operations

Prepapred by AbdurRahman/AP/CSE/JCE 35
Data Center Virtualization for Cloud
Computing
• Data Center Growth and Cost Breakdown
– 30 percent IT equipment (such as servers and
disks)
– 33 percent is attributed to the chiller,
– 18 percent to the uninterruptible power supply
(UPS),
– 9 percent to computer room air conditioning
(CRAC), and
– 7 percent to power distribution, lighting, and
transformer costs.

Prepapred by AbdurRahman/AP/CSE/JCE 36
Grid Computing Infrastructures
• Telnet
• HTTP
• Grid computing
• Forbes Magazine :
– $1 trillion in 2001 to $20 trillion by 2015

Prepapred by AbdurRahman/AP/CSE/JCE 37
Computational Grids
• A computing grid offers an infrastructure that
couples computers, software/middleware,
special instruments, and people and sensors.
• Virtual platforms to support Virtual
organizations.
• Grid System : Workstations, servers, clusters,
and supercomputers.
• Personal computers, laptops, and PDAs.

Prepapred by AbdurRahman/AP/CSE/JCE 38
Computational Grids Contd.,

Prepapred by AbdurRahman/AP/CSE/JCE 39
Grid Computing Infrastructures Contd.,
• Example:
– TeraGrid in US
– EGEE (Enabling Grids for E-Science in Europe)
– ChinaGrid
• Grid Families
– GARUDA(Global Access to Resource Using Distributed Architecture)
– IBM, Sun, HP, Dell, Cisco, EMC, and Platform
Computing.
– Categories :
• Data /Comp Grid
• P2P
Prepapred by AbdurRahman/AP/CSE/JCE 40
Peer to Peer Network
• Example of Distributed System : C-S Arch
• P2P Architecture offers Distributed Model.
• Client Oriented
• P2P Systems
– Node as Client /Server.
– Autonomous.
– No Central Coordination.
– TCP/IP and NAI

Prepapred by AbdurRahman/AP/CSE/JCE 41
Peer to Peer Network Contd.,
Network Overlays

Types:
Structured Overlay
Unstructured Overlay
Prepapred by AbdurRahman/AP/CSE/JCE 42
Peer to Peer Network Contd.,
[Application Families]
• Categories:
– Distributed file sharing
• Gnutella, Napster, and BitTorrent
– Collaboration
• MSN or Skype chatting, instant messaging.
– Distributed P2P computing
• SETI@home - 25 Tflops - 3 million Internet Host.
– P2P platforms
• JXTA, .NET, and FightingAID@home, support naming,
discovery, communication, security, and resource
aggregation
Prepapred by AbdurRahman/AP/CSE/JCE 43
Cloud Computing over the Internet
• Computational science is changing to be data-
intensive.
• Sending the computations (programs) to the
data, rather than copying the data to the
workstations.
• Cloud Computing.
“A cloud is a pool of virtualized computer resources.
A cloud can host a variety of different workloads,
including batch-style backend jobs and interactive
and user-facing applications.” - IBM

Prepapred by AbdurRahman/AP/CSE/JCE 44
Internet Clouds

Prepapred by AbdurRahman/AP/CSE/JCE 45
Cloud service models

Prepapred by AbdurRahman/AP/CSE/JCE 46

You might also like