0% found this document useful (0 votes)

16 views61 pages

22 Clusters Slides

The document discusses the design and implementation of distributed systems, focusing on clustering for high availability and scalability. It covers various clustering types, performance computing, batch processing, load balancing, and high availability strategies, including different RAID configurations for fault tolerance. Additionally, it highlights tools and technologies such as MPI, PVM, and specific clustering distributions like Beowulf and Rocks Cluster.

Uploaded by

Bommireddy Rambabu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views61 pages

22 Clusters Slides

Uploaded by

Bommireddy Rambabu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 61

Distributed Systems

Clusters

Paul Krzyzanowski
[email protected]

Except as otherwise noted, the content of this presentation is licensed under the Creative Commons
Attribution 2.5 License.
Designing highly available systems
Incorporate elements of fault-tolerant design
– Replication, TMR

Fully fault tolerant system will offer

non-stop availability
– You can’t achieve this!

Problem: expensive!
Designing highly scalable systems
SMP architecture

Problem:
performance gain as f(# processors) is sublinear
– Contention for resources (bus, memory, devices)
– Also … the solution is expensive!
Clustering
Achieve reliability and scalability by
interconnecting multiple independent systems

Cluster: group of standard, autonomous servers

configured so they appear on the network as a
single machine

approach single system image

Ideally…
• Bunch of off-the shelf machines
• Interconnected on a high speed LAN
• Appear as one system to external users
• Processors are load-balanced
– May migrate
– May run on different systems
– All IPC mechanisms and file access available
• Fault tolerant
– Components may fail
– Machines may be taken down
we don’t get all that (yet)

(at least not in one package)

Clustering types
• Supercomputing (HPC)
• Batch processing
• High availability (HA)
• Load balancing
High Performance Computing
(HPC)
The evolution of supercomputers
• Target complex applications:
– Large amounts of data
– Lots of computation
– Parallelizable application

• Many custom efforts

– Typically Linux + message passing software
+ remote exec + remote monitoring
Clustering for performance
Example: One popular effort
– Beowulf
• Initially built to address problems associated with
large data sets in Earth and Space Science
applications
• From Center of Excellence in Space Data &
Information Sciences (CESDIS), division of
University Space Research Association at the
Goddard Space Flight Center
What makes it possible
• Commodity off-the-shelf computers are cost
effective
• Publicly available software:
– Linux, GNU compilers & tools
– MPI (message passing interface)
– PVM (parallel virtual machine)

• Low cost, high speed networking

• Experience with parallel software
– Difficult: solutions tend to be custom
What can you run?
• Programs that do not require fine-grain
communication

• Nodes are dedicated to the cluster

– Performance of nodes not subject to external factors

• Interconnect network isolated from external network

– Network load is determined only by application

• Global process ID provided

– Global signaling mechanism
Beowulf configuration
Includes:
– BPROC: Beowulf distributed process space
• Start processes on other machines
• Global process ID, global signaling

– Network device drivers

• Channel bonding, scalable I/O

– File system (file sharing is generally not critical)

• NFS root
• unsynchronized
• synchronized periodically via rsync
Programming tools: MPI
• Message Passing Interface

• API for sending/receiving messages

– Optimizations for shared memory & NUMA
– Group communication support

• Other features:
– Scalable file I/O
– Dynamic process management
– Synchronization (barriers)
– Combining results
Programming tools: PVM
• Software that emulates a general-purpose
heterogeneous computing framework on
interconnected computers

• Present a view of virtual processing elements

– Create tasks
– Use global task IDs
– Manage groups of tasks
– Basic message passing
Beowulf programming tools
• PVM and MPI libraries
• Distributed shared memory
– Page based: software-enforced ownership and consistency
policy
• Cluster monitor
• Global ps, top, uptime tools

• Process management
– Batch system
– Write software to control synchronization and load balancing
with MPI and/or PVM
– Preemptive distributed scheduling: not part of Beowulf (two
packages: Condor and Mosix)
Another example
• Rocks Cluster Distribution
– Based on CentOS Linux

– Mass installation is a core part of the system

• Mass re-installation for application-specific configurations

– Front-end central server + compute & storage nodes

– Rolls: collection of packages

• Base roll includes: PBS (portable batch system), PVM (parallel
virtual machine), MPI (message passing interface), job
launchers, …
Another example
• Microsoft HPC Server 2008
– Windows Server 2008 + clustering package
– Systems Management
• Management Console: plug-in to System Center UI with support for
Windows PowerShell
• RIS (Remote Installation Service)
– Networking
• MS-MPI (Message Passing Interface)
• ICS (Internet Connection Sharing) : NAT for cluster nodes
• Network Direct RDMA (Remote DMA)
– Job scheduler
– Storage: iSCSI SAN and SMB support
– Failover support
Batch Processing
Batch processing
• Common application: graphics rendering
– Maintain a queue of frames to be rendered
– Have a dispatcher to remotely exec process

• Virtually no IPC needed

• Coordinator dispatches jobs

Single-queue work distribution
Render Farms:
Pixar:
• 1,024 2.8 GHz Xeon processors running Linux and Renderman
• 2 TB RAM, 60 TB disk space
• Custom Linux software for articulating, animating/lighting (Marionette),
scheduling (Ringmaster), and rendering (RenderMan)
• Cars: each frame took 8 hours to Render. Consumes ~32 GB storage on a
SAN

DreamWorks:
• >3,000 servers and >1,000 Linux desktops
HP xw9300 workstations and HP DL145 G2 servers with 8 GB/server
• Shrek 3: 20 million CPU render hours. Platform LSF used for scheduling +
Maya for modeling + Avid for editing+ Python for pipelining – movie uses
24 TB storage
Single-queue work distribution
Render Farms:
– ILM:
• 3,000 processor (AMD) renderfarm; expands to 5,000 by harnessing
desktop machines
• 20 Linux-based SpinServer NAS storage systems and 3,000 disks from
Network Appliance
• 10 Gbps ethernet

– Sony Pictures’ Imageworks:

• Over 1,200 processors
• Dell and IBM workstations
• almost 70 TB data for Polar Express
Batch Processing
OpenPBS.org:
– Portable Batch System
– Developed by Veridian MRJ for NASA

• Commands
– Submit job scripts
• Submit interactive jobs
• Force a job to run
– List jobs
– Delete jobs
– Hold jobs
Load Balancing
for the web
Functions of a load balancer

Load balancing

Failover

Planned outage management

Redirection
Simplest technique
HTTP REDIRECT error code
Redirection
Simplest technique
HTTP REDIRECT error code

www.mysite.com
Redirection
Simplest technique
HTTP REDIRECT error code

www.mysite.com

REDIRECT
www03.mysite.com
Redirection
Simplest technique
HTTP REDIRECT error code

www03.mysite.com
Redirection
• Trivial to implement

• Successive requests automatically go to the

same web server
– Important for sessions

• Visible to customer
– Some don’t like it

• Bookmarks will usually tag a specific site

Software load balancer
e.g.: IBM Interactive Network Dispatcher Software

Forwards request via load balancing

– Leaves original source address
– Load balancer not in path of outgoing traffic (high
bandwidth)
– Kernel extensions for routing TCP and UDP
requests
• Each client accepts connections on its own address and
dispatcher’s address
• Dispatcher changes MAC address of packets.
Software load balancer

www.mysite.com
Software load balancer
src=bobby, dest=www03

www.mysite.com

response
Load balancing router
Routers have been getting smarter
– Most support packet filtering
– Add load balancing

Cisco LocalDirector, Altheon, F5 Big-IP

Load balancing router
• Assign one or more virtual addresses to physical
address
– Incoming request gets mapped to physical address

• Special assignments can be made per port

– e.g. all FTP traffic goes to one machine

Balancing decisions:
– Pick machine with least # TCP connections
– Factor in weights when selecting machines
– Pick machines round-robin
– Pick fastest connecting machine (SYN/ACK time)
High Availability
(HA)
High availability (HA)
Annual
Class Level Downtime
Continuous 100% 0

Six nines 99.9999% 30 seconds

(carrier class switches)

Fault Tolerant 99.999% 5 minutes

(carrier-class servers)

Fault Resilient 99.99% 53 minutes

High Availability 99.9% 8.3 hours

Normal 99-99.5% 44-87 hours

availability
Clustering: high availability
Fault tolerant design
Stratus, NEC, Marathon technologies
– Applications run uninterrupted on a redundant subsystem
• NEC and Stratus has applications running in lockstep
synchronization
– Two identical connected systems
– If one server fails, other takes over instantly

Costly and inefficient

– But does what it was designed to do
Clustering: high availability
• Availability addressed by many:
– Sun, IBM, HP, Microsoft, SteelEye Lifekeeper, …

• If one server fails

– Fault is isolated to that node
– Workload spread over surviving nodes
– Allows scheduled maintenance without disruption
– Nodes may need to take over IP addresses
Example: Windows Server 2003 clustering
• Network load balancing
– Address web-server bottlenecks

• Component load balancing

– Scale middle-tier software (COM objects)

• Failover support for applications

– 8-node failover clusters
– Applications restarted on surviving node
– Shared disk configuration using SCSI or fibre channel
– Resource group: {disk drive, IP address, network
name, service} can be moved during failover
Example: Windows Server 2003 clustering
Top tier: cluster abstractions
– Failover manager, resource monitor, cluster
registry

Middle tier: distributed operations

– Global status update, quorum (keeps track
of who’s in charge), membership

Bottom tier: OS and drivers

– Cluster disk driver, cluster network drivers
– IP address takeover
Clusters

Architectural models
HA issues
How do you detect failover?
How long does it take to detect?
How does a dead application move/restart?
Where does it move to?
Heartbeat network
• Machines need to detect faulty systems
– “ping” mechanism

• Need to distinguish system faults from network faults

– Useful to maintain redundant networks
– Send a periodic heartbeat to test a machine’s liveness
– Watch out for split-brain!

• Ideally, use a network with a bounded response time

– Lucent RCC used a serial line interconnect
– Microsoft Cluster Server supports a dedicated “private
network”
• Two network cards connected with a pass-through cable or hub
Failover Configuration Models
Active/Passive (N+M nodes)
– M dedicated failover node(s) for N active nodes

Active/Active
– Failed workload goes to remaining nodes
Design options for failover
Cold failover
– Application restart
Warm failover
– Application checkpoints itself periodically
– Restart last checkpointed image
– May use writeahead log (tricky)
Hot failover
– Application state is lockstep synchronized
– Very difficult, expensive (resources), prone
to software faults
Design options for failover
With either type of failover …

Multi-directional failover
– Failed applications migrate to / restart on
available systems

Cascading failover
– If the backup system fails, application can
be restarted on another surviving system
System support for HA
• Hot-pluggable devices
– Minimize downtime for component swapping

• Redundant devices
– Redundant power supplies
– Parity on memory
– Mirroring on disks (or RAID for HA)
– Switchover of failed components

• Diagnostics
– On-line serviceability
Shared resources (disk)
Shared disk
– Allows multiple systems to share access to
disk drives

– Works well if applications do not generate

much disk I/O

– Disk access must be synchronized

Synchronization via a distributed lock manager
(DLM)
Shared resources (disk)
Shared nothing
– No shared devices

– Each system has its own storage resources

– No need to deal with DLMs

– If a machine A needs resources on B, A

sends a message to B
• If B fails, storage requests have to be switched
over to a live node
Cluster interconnects
Traditional WANs and LANs may be slow as cluster interconnect
– Connecting server nodes, storage nodes, I/O channels, even
memory pages

– Storage Area Network (SAN)

• Fibre channel connectivity to external storage devices
• Any node can be configured to access any storage through a fibre
channel switch

– System Area Network (SAN)

• Switched interconnect to switch cluster resources
• Low-latency I/O without processor intervention
• Scalable switching fabric
• (Compaq, Tandem’s ServerNet)
• Microsoft Windows 2000 supports Winsock Direct for SAN
communication
Achieving High Availability

switch A heartbeat switch B

Local Area
Networks
Server A heartbeat 2 Server B

Fibre Fibre Storage Area

channel channel Network
heartbeat 3
switch switch

Fabric A Fabric B
Achieving High Availability

Switch A heartbeat switch B

Local Area
Networks
Server A heartbeat 2 Server B

Ethernet Ethernet
Storage Area
switch A’ heartbeat 3 switch B’ Network
(iSCSI)
ethernet A ethernet B
HA Storage: RAID
Redundant Array of Independent (Inexpensive)
Disks
RAID 0: Performance
Striping
• Advantages:
– Performance
– All storage capacity can be used
• Disadvantage:
– Not fault tolerant
RAID 1: HA
Mirroring
• Advantages:
– Double read speed
– No rebuild necessary if a disk fails: just copy
• Disadvantage:
– Only half the
space
RAID 3: HA
Separate parity disk
• Advantages:
– Very fast reads
– High efficiency: low ratio of parity/data
• Disadvantages:
– Slow random
I/O performance
– Only one I/O
at a time
RAID 5
Interleaved parity
• Advantages:
– Very fast reads
– High efficiency: low ratio of parity/data
• Disadvantage:
– Slower writes
– Complex
controller
RAID 1+0
Combine mirroring and striping
– Striping across a set of disks
– Mirroring of the entire set onto another set
The end

AWS Certified Security Specialty Master Cheat Sheet
100% (1)
AWS Certified Security Specialty Master Cheat Sheet
159 pages
Cluster Computing
No ratings yet
Cluster Computing
23 pages
SC-200 Microsoft Security Operations Analyst
100% (1)
SC-200 Microsoft Security Operations Analyst
83 pages
Vcs and Oracle Ha
No ratings yet
Vcs and Oracle Ha
168 pages
Cluster Computing
100% (6)
Cluster Computing
28 pages
Operting System Book
100% (2)
Operting System Book
49 pages
How To Become Cloud Engineer: Cloud Computing 101 AWS Services AWS Career Guidance AWS Study Resources
No ratings yet
How To Become Cloud Engineer: Cloud Computing 101 AWS Services AWS Career Guidance AWS Study Resources
39 pages
CS9211-Computer Architecture Question
No ratings yet
CS9211-Computer Architecture Question
7 pages
CC Module-1, 2& 3 Questions
No ratings yet
CC Module-1, 2& 3 Questions
4 pages
Cluster Computing: Definition and Architecture of A Cluster
No ratings yet
Cluster Computing: Definition and Architecture of A Cluster
7 pages
Cluster 2
No ratings yet
Cluster 2
26 pages
Presented By: Veena.K.P Mca S5 Roll No:28
No ratings yet
Presented By: Veena.K.P Mca S5 Roll No:28
35 pages
Cluster Computing: Definition and Architecture of A Cluster: Pravin Ganore Comments
No ratings yet
Cluster Computing: Definition and Architecture of A Cluster: Pravin Ganore Comments
6 pages
Cluster Computing: The Promise of Supercomputing To The Average PC User ?
No ratings yet
Cluster Computing: The Promise of Supercomputing To The Average PC User ?
57 pages
RT2021 Chap5
100% (1)
RT2021 Chap5
34 pages
Parallel and Cluster Computing
No ratings yet
Parallel and Cluster Computing
31 pages
AWS Certified SysOps Administrator Associate - Sample Questions
No ratings yet
AWS Certified SysOps Administrator Associate - Sample Questions
11 pages
Unit 2. Distributed Os and Issue
No ratings yet
Unit 2. Distributed Os and Issue
1 page
Introduction To Distributed Systems
No ratings yet
Introduction To Distributed Systems
36 pages
Low Cost Supercomputing: Parallel Processing On Linux Clusters
No ratings yet
Low Cost Supercomputing: Parallel Processing On Linux Clusters
43 pages
Low Cost Supercomputing: Parallel Processing On Linux Clusters
No ratings yet
Low Cost Supercomputing: Parallel Processing On Linux Clusters
43 pages
Cluster and Grid Computing
No ratings yet
Cluster and Grid Computing
37 pages
CLUSTER COMPUTING Report
No ratings yet
CLUSTER COMPUTING Report
15 pages
Clustering Documentation
No ratings yet
Clustering Documentation
15 pages
What Are Clusters?: High Cost of Traditional' High Performance Computing
No ratings yet
What Are Clusters?: High Cost of Traditional' High Performance Computing
24 pages
Client/Server Computing: Operating Systems: Internals and Design Principles, 6/E
No ratings yet
Client/Server Computing: Operating Systems: Internals and Design Principles, 6/E
79 pages
Chapter 2 - Operating Systems
No ratings yet
Chapter 2 - Operating Systems
12 pages
A Comparison of Provisioning Systems For Beowulf Clusters
No ratings yet
A Comparison of Provisioning Systems For Beowulf Clusters
10 pages
Distributed Systems Introduction
No ratings yet
Distributed Systems Introduction
21 pages
Cluster Vs Distributed
No ratings yet
Cluster Vs Distributed
2 pages
G. B. Pant of Institute & Technology: Comparison of Parallel Processing Via HPC Cluster Vs Non Parallel Processor
No ratings yet
G. B. Pant of Institute & Technology: Comparison of Parallel Processing Via HPC Cluster Vs Non Parallel Processor
22 pages
Advance Computing Technology (170704)
No ratings yet
Advance Computing Technology (170704)
106 pages
Cluster Computing4
No ratings yet
Cluster Computing4
43 pages
Rehan Khan Roll No - 38
No ratings yet
Rehan Khan Roll No - 38
23 pages
Cluster Computing
No ratings yet
Cluster Computing
57 pages
CCunit 1
No ratings yet
CCunit 1
54 pages
CS4513 Distributed Computer Systems
No ratings yet
CS4513 Distributed Computer Systems
32 pages
Cluster
No ratings yet
Cluster
55 pages
Distributed CS571
No ratings yet
Distributed CS571
36 pages
ECommerce Infrastructure
No ratings yet
ECommerce Infrastructure
33 pages
Amity University, Rajasthan: " Cluster Computing "
No ratings yet
Amity University, Rajasthan: " Cluster Computing "
3 pages
04 - Computer Clusters
No ratings yet
04 - Computer Clusters
66 pages
Unit-Ii PPT
No ratings yet
Unit-Ii PPT
43 pages
Cluster Computing: A Paper Presentation On
No ratings yet
Cluster Computing: A Paper Presentation On
16 pages
Cluster Computer
No ratings yet
Cluster Computer
22 pages
Classification of Distributed Computing Systems
No ratings yet
Classification of Distributed Computing Systems
14 pages
UECS3223-Lecture 1-Server Technology
No ratings yet
UECS3223-Lecture 1-Server Technology
67 pages
CC Unit 1
No ratings yet
CC Unit 1
18 pages
Chapter One
No ratings yet
Chapter One
42 pages
MCA Cluster Computing
No ratings yet
MCA Cluster Computing
24 pages
1 Cluster Computing
No ratings yet
1 Cluster Computing
42 pages
Cluster Computing: DATE: 28 November 2013
No ratings yet
Cluster Computing: DATE: 28 November 2013
32 pages
Vcs and Oracle Ha
No ratings yet
Vcs and Oracle Ha
167 pages
Lecture Clusters PDF
No ratings yet
Lecture Clusters PDF
168 pages
Beowulf Cluster
No ratings yet
Beowulf Cluster
60 pages
Tema 1
No ratings yet
Tema 1
59 pages
Lecture 2 On Distributed Systems
No ratings yet
Lecture 2 On Distributed Systems
45 pages
ADSU1VFTVF25
No ratings yet
ADSU1VFTVF25
118 pages
Distributed System
No ratings yet
Distributed System
62 pages
Building A Supercomputer
No ratings yet
Building A Supercomputer
34 pages
Chapter 2 - Operating Systems
No ratings yet
Chapter 2 - Operating Systems
12 pages
Ira Pramanick
No ratings yet
Ira Pramanick
24 pages
CC Notes
No ratings yet
CC Notes
30 pages
Clustering Tech Overview
No ratings yet
Clustering Tech Overview
48 pages
OS Assignment-1 - 22-23
No ratings yet
OS Assignment-1 - 22-23
12 pages
Byte Python Concurrent and Parallel Programming V2
No ratings yet
Byte Python Concurrent and Parallel Programming V2
38 pages
POSIX Threads Explained by Daniel Robbins
No ratings yet
POSIX Threads Explained by Daniel Robbins
9 pages
OS Lab Manual PPP
No ratings yet
OS Lab Manual PPP
41 pages
Concurrent Process and Programming: Processs and Threads Processes
No ratings yet
Concurrent Process and Programming: Processs and Threads Processes
11 pages
CS609 Quiz 3 Merged File Learning With Me
No ratings yet
CS609 Quiz 3 Merged File Learning With Me
126 pages
Software Design of Real-Time Systems
No ratings yet
Software Design of Real-Time Systems
57 pages
Concurrent Processes
No ratings yet
Concurrent Processes
52 pages
2 Param
No ratings yet
2 Param
7 pages
Final OS Lab Manual 2021-22 (Winter)
No ratings yet
Final OS Lab Manual 2021-22 (Winter)
40 pages
Dronacharya College of Engg Gr. Noida Operating System Subject Code: TCS-601 Cse/It Vi
No ratings yet
Dronacharya College of Engg Gr. Noida Operating System Subject Code: TCS-601 Cse/It Vi
4 pages
Programming Assignment 1 2
No ratings yet
Programming Assignment 1 2
7 pages
OS Syllabus
No ratings yet
OS Syllabus
3 pages
Difference Between Multitasking Multithreading and Multi20difference20between20multitasking20multithreading20and20multiprocessing
No ratings yet
Difference Between Multitasking Multithreading and Multi20difference20between20multitasking20multithreading20and20multiprocessing
3 pages
Slide 02
No ratings yet
Slide 02
101 pages
DPDK Locks Optimizations and New Locks APIs
No ratings yet
DPDK Locks Optimizations and New Locks APIs
16 pages
0) Unit 2 Master
No ratings yet
0) Unit 2 Master
26 pages
Lab 1
No ratings yet
Lab 1
18 pages
Module 3 - CPU Scheduling
No ratings yet
Module 3 - CPU Scheduling
42 pages
Pipelining With Numerical
No ratings yet
Pipelining With Numerical
55 pages
OpenMP Matrix
No ratings yet
OpenMP Matrix
6 pages
Unit 5
No ratings yet
Unit 5
29 pages
Recon2015 06 Sophia D Antoine Exploiting Out of Order Execution
No ratings yet
Recon2015 06 Sophia D Antoine Exploiting Out of Order Execution
46 pages
Aos 6
No ratings yet
Aos 6
36 pages
PAR Final Lab Sol 2023 24Q1
No ratings yet
PAR Final Lab Sol 2023 24Q1
3 pages
Design Patterns Concurrency Pattern
No ratings yet
Design Patterns Concurrency Pattern
11 pages
PDC Assignment 3
No ratings yet
PDC Assignment 3
3 pages
All My IT Tech Posts
From Everand
All My IT Tech Posts
Stephen Edwards
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
Information Technology HandBook
From Everand
Information Technology HandBook
Duong Tran
3/5 (1)

22 Clusters Slides

Uploaded by

22 Clusters Slides

Uploaded by

Distributed Systems

Fully fault tolerant system will offer

Cluster: group of standard, autonomous servers

approach single system image

(at least not in one package)

• Many custom efforts

• Low cost, high speed networking

• Nodes are dedicated to the cluster

• Interconnect network isolated from external network

• Global process ID provided

– Network device drivers

– File system (file sharing is generally not critical)

• API for sending/receiving messages

• Present a view of virtual processing elements

– Mass installation is a core part of the system

– Front-end central server + compute & storage nodes

– Rolls: collection of packages

• Virtually no IPC needed

• Coordinator dispatches jobs

– Sony Pictures’ Imageworks:

Planned outage management

• Successive requests automatically go to the

• Bookmarks will usually tag a specific site

Forwards request via load balancing

Cisco LocalDirector, Altheon, F5 Big-IP

• Special assignments can be made per port

Six nines 99.9999% 30 seconds

Fault Tolerant 99.999% 5 minutes

Fault Resilient 99.99% 53 minutes

High Availability 99.9% 8.3 hours

Normal 99-99.5% 44-87 hours

Costly and inefficient

• If one server fails

• Component load balancing

• Failover support for applications

Middle tier: distributed operations

Bottom tier: OS and drivers

• Need to distinguish system faults from network faults

• Ideally, use a network with a bounded response time

– Works well if applications do not generate

– Disk access must be synchronized

– Each system has its own storage resources

– No need to deal with DLMs

– If a machine A needs resources on B, A

– Storage Area Network (SAN)

– System Area Network (SAN)

switch A heartbeat switch B

Fibre Fibre Storage Area

Switch A heartbeat switch B

You might also like