Grid and Cloud Computing
Grid and Cloud Computing
Grid and Cloud Computing
Prepapred by AbdurRahman/AP/CSE/JCE 1
UNIT I INTRODUCTION
• Evolution of Distributed computing: Scalable
computing over the Internet –
• Technologies for network based systems –
• clusters of cooperative computers –
• Grid computing Infrastructures –
• Cloud computing –
• service oriented architecture –
• Introduction to Grid Architecture and standards
• Elements of Grid –
• Overview of Grid Architecture.
Prepapred by AbdurRahman/AP/CSE/JCE 2
UNIT II GRID SERVICES
• Introduction to Open Grid Services
Architecture (OGSA) –
• Motivation –
• Functionality Requirements –
• Practical & Detailed view of OGSA/OGSI – Data
intensive grid service models –
• OGSA services.
Prepapred by AbdurRahman/AP/CSE/JCE 3
UNIT III VIRTUALIZATION
• Cloud deployment models: public, private, hybrid,
community –
• Categories of cloud computing: Everything as a
service: Infrastructure, platform, software –
• Pros and Cons of cloud computing
• Implementation levels of virtualization
• Virtualization structure –
• Virtualization of CPU, Memory and I/O devices
• Virtual clusters and Resource Management
• Virtualization for data center automation
Prepapred by AbdurRahman/AP/CSE/JCE 4
UNIT IV PROGRAMMING MODEL
• Open source grid middleware packages –
• Globus Toolkit (GT4) Architecture , Configuration –
• Usage of Globus –
• Main components and Programming model -
Introduction to Hadoop Framework –
• Mapreduce, Input splitting, map and reduce functions,
specifying input and output parameters, configuring
and running a job –
• Design of Hadoop file system, HDFS concepts,
command line and java interface, dataflow of File read
& File write.
Prepapred by AbdurRahman/AP/CSE/JCE 5
UNIT V SECURITY
• Trust models for Grid security environment
• Authentication and Authorization methods – Grid
security infrastructure –
• Cloud Infrastructure security: network, host and
application level –
• Aspects of data security, provider data and its
security,
• Identity and access management architecture
• IAM practices in the cloud, SaaS, PaaS, IaaS
availability in the cloud, Key privacy issues in the
cloud.
Prepapred by AbdurRahman/AP/CSE/JCE 6
Unit –I
Scalable computing over the Internet
• Evolutionary changes in Technology:
• Machine architecture,
• Operating system platform,
• Network connectivity, and
• Application workload
• The Age of Internet Computing
• High-performance computing (HPC)
• High-throughput computing (HTC)
Prepapred by AbdurRahman/AP/CSE/JCE 7
The Platform Evolution
Year Technology Example
Prepapred by AbdurRahman/AP/CSE/JCE 8
Evolution of HPC and HTC systems
Prepapred by AbdurRahman/AP/CSE/JCE 9
HPC vs HTC
• High-Performance Computing
– large amounts of computing for short period.
– Gflops in 1990s to Pflops in 2010.
– Pflops =1000 tflops / tflop = 1000 gflops
• High-Throughput Computing
– large amounts of computing, for much longer time.
– how many jobs
– high-flux computing
Prepapred by AbdurRahman/AP/CSE/JCE 10
Three New Computing Paradigms
• Radio-frequency identification (RFID),
• Global Positioning System (GPS), and
• Internet of Things (IoT).
Prepapred by AbdurRahman/AP/CSE/JCE 11
Computing Paradigm Distinctions
• Distributed computing is the opposite of
centralized computing.
• Parallel computing overlaps with distributed
computing.
• Cloud computing overlaps with distributed,
centralized, and parallel computing.
Prepapred by AbdurRahman/AP/CSE/JCE 12
Computing Paradigm Contd.,
• Centralized computing: All resources are
shared within one integrated OS.
• Parallel computing: Processors are tightly
coupled with centralized shared memory.
• Distributed computing: Multiple autonomous
computers, each with own private memory.
• Cloud computing: Clouds can be built with
physical or virtualized resources.
Prepapred by AbdurRahman/AP/CSE/JCE 13
Scalable Computing Trends and New
Paradigms
• Moore’s law: Processor speed / 18 months
• Gilder’s law: Bandwidth / year
• Degrees of Parallelism
– bit-level parallelism (BLP) : 4-bit mp 8-,16-, 32-,
and 64-bit CPUs
– Instruction-level parallelism (ILP):pipelining,
superscalar computing, VLIW (very long
instruction word) architectures, and
multithreading.
Prepapred by AbdurRahman/AP/CSE/JCE 14
Degrees of Parallelism Contd.,
• Data-level parallelism (DLP) : SIMD (single
instruction, multiple data).
• Task-level parallelism (TLP): since chip
multiprocessors (CMPs).
• Job-level parallelism (JLP).
Prepapred by AbdurRahman/AP/CSE/JCE 15
Innovative Applications
Prepapred by AbdurRahman/AP/CSE/JCE 16
The Trend toward Utility Computing
Prepapred by AbdurRahman/AP/CSE/JCE 17
The Hype Cycle of New Technologies
Prepapred by AbdurRahman/AP/CSE/JCE 18
The Internet of Things and Cyber-
Physical Systems
• The Internet of Things
– 1,000 to 5,000
– H2H (human-to-human),
– H2T (human-to-thing),
– T2T (thing-to-thing)
• Intelligent cities, clean water, efficient power,
convenient transportation, good food supplies,
responsible banks, fast telecommunications,
green IT, better schools, good health care,
abundant resources, and so on.
Prepapred by AbdurRahman/AP/CSE/JCE 19
Cyber-physical system
• A cyber-physical system (CPS) is the result of
interaction between computational processes
and the physical world.
• A CPS merges the “3C” technologies of
computation, communication, and control.
• CPS emphasizes exploration of virtual reality
(VR) applications.
Prepapred by AbdurRahman/AP/CSE/JCE 20
Technologies for network based systems
Multicore CPUs and Multithreading Technologies
• Millions of instructions per second (MIPS)
• Mbps or Gbps
• Advances in CPU Processors
• Multicore architecture with dual, quad, six, or more
processing cores.
• 10 MHz for the Intel 286 to 4 GHz for the Pentium 4.
• ILP : Modern CPU
• DLP and TLP : Graphics processing units (GPUs)
• Multi-core CPU and many-core GPU handle multiple
instruction threads.
Prepapred by AbdurRahman/AP/CSE/JCE 21
Improvement in processor and
network technologies over 33 years.
Prepapred by AbdurRahman/AP/CSE/JCE 22
Typical Multicore processor
Prepapred by AbdurRahman/AP/CSE/JCE 23
Multicore CPUs and Multithreading
Technologies Contd.,
• Multicore CPU and Many-Core GPU
– Multicore CPUs from the tens of cores to hundreds.
– Limited in Parallelism due to Memory wall Problem.
– GPUs with hundreds or more thin cores.
– RISC replaced with multicore x-86 processors.
– CPU cores and thin GPU on same chip.
Prepapred by AbdurRahman/AP/CSE/JCE 24
Multicore CPUs and Multithreading
Technologies Contd.,
• Multithreading Technology
Prepapred by AbdurRahman/AP/CSE/JCE 25
GPU Computing to Exascale and
Beyond
• A GPU is a graphics coprocessor or accelerator
mounted on a computer’s graphics card or
video card.
– GeForce 256 by NVIDIA in 1999.
– 10 million polygons per second.
– Xeon X5670 CPU has six cores.
– GPU chip with hundreds of processing cores.
– GPGPUs: NVIDIA’s CUDA model.
Prepapred by AbdurRahman/AP/CSE/JCE 26
GPU Computing to Exascale and
Beyond Contd.,
• How GPUs Work
– as coprocessors.
– NVIDIA GPU 128 cores on a single chip.
– Eight thread / Core ~ 1024 threads.
– Not restricted to graphics or video coding.
– GPU offloads the CPU.
Prepapred by AbdurRahman/AP/CSE/JCE 27
GPU Computing to Exascale and
Beyond Contd.,
• GPU Programming Model
– Offloads CPU
– Match bandwidth on-board
Prepapred by AbdurRahman/AP/CSE/JCE 28
Example : NVIDIA Fermi GPU
Prepapred by AbdurRahman/AP/CSE/JCE 29
GPU Computing to Exascale and
Beyond Contd.,
• Power Efficiency of the GPU
– CPU Consumes 2 nJ/instruction
– 200 pJ/instruction, which is 1/10 less
Prepapred by AbdurRahman/AP/CSE/JCE 30
Memory, Storage, and Wide-Area
Networking
“Tape units are dead, disks are tape units, flashes are disks, and memory are caches
now.”
• Memory Technology
Prepapred by AbdurRahman/AP/CSE/JCE 31
Memory, Storage, and Wide-Area
Networking Contd.,
• System-Area Interconnects
– LAN : client hosts to big servers.
– SAN : servers to network storage.
– NAS : client hosts directly to the disk arrays.
Prepapred by AbdurRahman/AP/CSE/JCE 32
Memory, Storage, and Wide-Area
Networking Contd.,
• Wide-Area Networking
– 10 Mbps in 1979 to 1 Gbps in 1999,
– 40 ~ 100 GE in 2011.
– Speculated 1 Tbps network by 2013.
– According to Berman, Fox, and Hey : 1,000, 1,000,
100, 10, and 1 Gbps bandwidths were reported,
respectively, for international, national,
organization, optical desktop, and copper desktop
connections in 2006.
Prepapred by AbdurRahman/AP/CSE/JCE 33
Virtual Machines and Virtualization
Middleware
• Single OS image.
• Different instruction set.
Prepapred by AbdurRahman/AP/CSE/JCE 34
VM Primitive Operations
Prepapred by AbdurRahman/AP/CSE/JCE 35
Data Center Virtualization for Cloud
Computing
• Data Center Growth and Cost Breakdown
– 30 percent IT equipment (such as servers and
disks)
– 33 percent is attributed to the chiller,
– 18 percent to the uninterruptible power supply
(UPS),
– 9 percent to computer room air conditioning
(CRAC), and
– 7 percent to power distribution, lighting, and
transformer costs.
Prepapred by AbdurRahman/AP/CSE/JCE 36
Grid Computing Infrastructures
• Telnet
• HTTP
• Grid computing
• Forbes Magazine :
– $1 trillion in 2001 to $20 trillion by 2015
Prepapred by AbdurRahman/AP/CSE/JCE 37
Computational Grids
• A computing grid offers an infrastructure that
couples computers, software/middleware,
special instruments, and people and sensors.
• Virtual platforms to support Virtual
organizations.
• Grid System : Workstations, servers, clusters,
and supercomputers.
• Personal computers, laptops, and PDAs.
Prepapred by AbdurRahman/AP/CSE/JCE 38
Computational Grids Contd.,
Prepapred by AbdurRahman/AP/CSE/JCE 39
Grid Computing Infrastructures Contd.,
• Example:
– TeraGrid in US
– EGEE (Enabling Grids for E-Science in Europe)
– ChinaGrid
• Grid Families
– GARUDA(Global Access to Resource Using Distributed Architecture)
– IBM, Sun, HP, Dell, Cisco, EMC, and Platform
Computing.
– Categories :
• Data /Comp Grid
• P2P
Prepapred by AbdurRahman/AP/CSE/JCE 40
Peer to Peer Network
• Example of Distributed System : C-S Arch
• P2P Architecture offers Distributed Model.
• Client Oriented
• P2P Systems
– Node as Client /Server.
– Autonomous.
– No Central Coordination.
– TCP/IP and NAI
Prepapred by AbdurRahman/AP/CSE/JCE 41
Peer to Peer Network Contd.,
Network Overlays
Types:
Structured Overlay
Unstructured Overlay
Prepapred by AbdurRahman/AP/CSE/JCE 42
Peer to Peer Network Contd.,
[Application Families]
• Categories:
– Distributed file sharing
• Gnutella, Napster, and BitTorrent
– Collaboration
• MSN or Skype chatting, instant messaging.
– Distributed P2P computing
• SETI@home - 25 Tflops - 3 million Internet Host.
– P2P platforms
• JXTA, .NET, and FightingAID@home, support naming,
discovery, communication, security, and resource
aggregation
Prepapred by AbdurRahman/AP/CSE/JCE 43
Cloud Computing over the Internet
• Computational science is changing to be data-
intensive.
• Sending the computations (programs) to the
data, rather than copying the data to the
workstations.
• Cloud Computing.
“A cloud is a pool of virtualized computer resources.
A cloud can host a variety of different workloads,
including batch-style backend jobs and interactive
and user-facing applications.” - IBM
Prepapred by AbdurRahman/AP/CSE/JCE 44
Internet Clouds
Prepapred by AbdurRahman/AP/CSE/JCE 45
Cloud service models
Prepapred by AbdurRahman/AP/CSE/JCE 46