What Is GPFS
What Is GPFS
GPFS is available on AIX since 1998 (on Linux since 2001, on Windows since 2008).
It provides concurrent high-speed file access on multiple nodes of cluster. In 2015
IBM rebranded GPFS as IBM Spectrum Scale.
----------------------------------------------
Cluster:
A Spectrum Scale cluster is formed by a collection of nodes that share access to
the file systems defined in the cluster. mmcrcluster, mmstatup -a, mmshutdown -a
Node:
A node is any server that has the Spectrum Scale product installed on a physical
machine, or on a virtual machine. Mmaddnode ,mmdelnode, mmlsnode
Quorum nodes:
During cluster creation some nodes can be designated as quorum nodes. Maintaining
quorum in a GPFS cluster means that a majority of the nodes designated as quorum
nodes are able to successfully communicate. In a three quorum node configuration
two nodes have to be communicating for cluster operations to continue. When one
node is isolated by a network failure, it stops all file system operations until
communications are restored, so no data is corrupted by a lack of coordination.
(The exact calculation of quorum is, one plus half of the defined quorum nodes.)
(show quorum nodes: mmgetstate)
Cluster manager:
The cluster manager node monitors disk lease expiration, state of the quorum,
detects failures and starts recovery. In overall it has the responsibility for
correct operation of the nodes and the cluster. It is chosen through an election
held among the quorum nodes. (Starting in GPFS 3.5 it is possible to define by
command as well.) (show cluster manager: mmlsmgr)
Storage pool
It ss a collection of NSDs and with this featue some disks can be grouped together
(for example based on type of storage, or vendor...) (check storage pool: mmlspool)
Failure group
In GPFS you can replicate (mirror) any files or the entire file system. A
replication factor of two in GPFS means that each block of a replicated file is in
at least two failure groups. A failure group is defined by the administrator and
contains one or more NSDs. Each file system can contain one or more failure groups
which are defined by the administrator and can be changed at any time. So when a
file system is fully replicated any single failure group can fail and the data
remains online.
(check failure groups: mmlsdisk)