0% found this document useful (0 votes)

37 views11 pages

Black Gpfs 2up

The document summarizes research on the performance of the IBM General Parallel File System (GPFS). The research tested how throughput varied based on the number of clients writing to a single file, the amount of data transferred, and different I/O access patterns (sequential vs interleaved writes). Key findings include: - Throughput scaled linearly up to 58 servers with a constant client to server ratio below 6:1. - GPFS performed poorly with small interleaved writes due to token management overhead. - Access pattern (sequential vs interleaved writes) significantly impacted performance.

Uploaded by

Hit Maker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views11 pages

Black Gpfs 2up

Uploaded by

Hit Maker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Performance of the IBM General Parallel File System

Terry Jones, Alice Koniges, R. Kim Yates Lawrence Livermore Laboratory

Presented by Michael Black CMSC714, Fall 2005

Purpose of paper
IBM has developed the GPFS. How well does it really perform? Bandwidth? Scalability? Problems to be solved in future development?

Presentation Outline
What is the General Parallel File System? How does it work? How well does it work?

What is the GPFS?

Parallel file system: hundreds of computers (nodes)
massive scalability!

nodes can:
run applications manage RAID disks

each node has access to all files on all disks

Features of GPFS
Single files can span several disks on several nodes
All processes can write output to same file No need for hundreds of output files

Processes can write to different parts of same file at same time

High bandwidth for concurrent access

File access uses standard Unix POSIX calls

Comparable File Systems

Intel PFS:
Nonstandard interface Poor performance on concurrent access to same file

SGI XFS:
Standard interface, good performance, but only works on shared memory architectures

Platform for GPFS

IBMs RS/6000: scales to thousands of processors autonomous nodes run Unix kernel proprietary interconnect - The Switch
provides 83 MB/s one-way allows all nodes to talk in pairs simultaneously uniform access time

How is GPFS Implemented?

Set of services located on some nodes
services provided by mmfsd - a GPFS daemon

mmfsd provides:
file system access (for mounting GPFS) metanode service (file permissions & attributes) stripe group manager (info on the disks) token manager server (synchronizes file access) configuration manager (ensures last 2 services are working)

Layers on each node:

Application or Disk mmfsd daemon
allows node to mount file system performs reads/writes on that nodes disks

Virtual Shared Disk

handles reads/writes to remote disks contains pagepool - ~50 MB disk cache
allows write-behind - application need not wait for actual write

IP layer

Consistency
Maintained by token manager server mmfsd determines if node has write access to file
if not, must acquire token gets list of who has tokens from token manager talks to node with token, tries to acquire it

tokens guard bytes in files, not whole files

allows consistent concurrent access to different parts of same file

Steps in remote write

Write steps continued

Application calls mmfsd with pointer to buffer mmfsd acquires write token mmfsd checks metanode to see where disk is mmfsd copies data to pagepool (application can now continue) VSD copies from pagepool to IP, breaks into packets Data copied through Switch to VSD receive buffer VSD server reassembles data to buddy buffer VSD releases receive buffer, writes to disk device driver VSD releases buddy buffer, sends ack to client Client releases pagepool

Architecture issues with GPFS v1.2

clients can send faster than server can save to disk
exponential backoff used to slow down client

data copied twice in client

memory -> pagepool pagepool -> IP buffer

Analyzing performance: How much bandwidth is needed?

Rule of thumb:
At peak, 1 byte every 500 flops --> 7.3 GB/s on RS/6000

Rule of thumb:
Store half of memory every hour on average Should take 5 minutes ideally --> 4.4 GB/s on RS/6000

Also must take varying I/O access patterns and reliability into account

Experimental objectives
Looking at multiple tasks writing to 1 file: How does throughput vary based on:
# clients amount of data being transferred

How does GPFS scale? How do I/O access characteristics affect performance?
large sequential writes to same file small interleaved writes to same file

Methodology
Benchmark: ileave_or_random
written in C, uses MPI

Measuring throughput:
time for all processes to write fixed amount of data to single file

Effect of I/O access patterns:

benchmarks are adjusted for highly sequential (segmented) or highly interleaved (strided)

Test: Vary client:server ratio

Optimal client:server ratio is 4:1 why? when too low, server starves when too high, server buffers overflow

Test: Vary transfer block size

block size makes no difference # clients makes no difference up to 256

Test: Round-robin file access

GPFS performs poorly with strided access if block size < 256kB Reflects token management overhead

Test: Scalability with constant client:server ratio

Linear up to 58 servers

Conclusions
Good throughput as long as client:server ratio is less than 6
could be increased if data flow is improved

Programs must use segmented access patterns

suggested improvement: allow token management to be turned off (user must manage consistency)

U1L05 Activity Guide - Inputs and Outputs
No ratings yet
U1L05 Activity Guide - Inputs and Outputs
4 pages
Data Communications and Networking
No ratings yet
Data Communications and Networking
6 pages
Performance of The IBM General Parallel File System
No ratings yet
Performance of The IBM General Parallel File System
9 pages
Gpfs Overview v33
No ratings yet
Gpfs Overview v33
54 pages
Gpfs & Storm: Jon Wakelin University of Bristol
No ratings yet
Gpfs & Storm: Jon Wakelin University of Bristol
22 pages
An Introduction To GPFS Version 3.2: September, 2007
No ratings yet
An Introduction To GPFS Version 3.2: September, 2007
17 pages
GPFS Faq
100% (1)
GPFS Faq
41 pages
An Introduction To GPFS: July 2006
No ratings yet
An Introduction To GPFS: July 2006
13 pages
Distributed Computing Module 5 Important Topics PYQs
No ratings yet
Distributed Computing Module 5 Important Topics PYQs
23 pages
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
No ratings yet
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
51 pages
Proceedings of The FAST 2002 Conference On File and Storage Technologies
No ratings yet
Proceedings of The FAST 2002 Conference On File and Storage Technologies
15 pages
Networked File System: CS 537 - Introduction To Operating Systems
No ratings yet
Networked File System: CS 537 - Introduction To Operating Systems
23 pages
Chapter 2 1712934164766
No ratings yet
Chapter 2 1712934164766
21 pages
Design and Implementation of The Sun Network Filesystem: R. Sandberg, D. Goldberg S. Kleinman, D. Walsh, R. Lyon
No ratings yet
Design and Implementation of The Sun Network Filesystem: R. Sandberg, D. Goldberg S. Kleinman, D. Walsh, R. Lyon
34 pages
File Systems 2
No ratings yet
File Systems 2
43 pages
Other File Systems: LFS, NFS, and Afs
No ratings yet
Other File Systems: LFS, NFS, and Afs
37 pages
NFS
No ratings yet
NFS
27 pages
The Linux File System
No ratings yet
The Linux File System
4 pages
Tips For GPFS Administration
No ratings yet
Tips For GPFS Administration
29 pages
GPFS 4.1.0.5
No ratings yet
GPFS 4.1.0.5
70 pages
04 en Network File Systems
No ratings yet
04 en Network File Systems
57 pages
GPFS Configuration and Tuning
No ratings yet
GPFS Configuration and Tuning
270 pages
Network File System (NFS)
No ratings yet
Network File System (NFS)
31 pages
Thegooglefilesystem Lecturebyromainjacotin 141001154546 Phpapp02
No ratings yet
Thegooglefilesystem Lecturebyromainjacotin 141001154546 Phpapp02
52 pages
Sun NFS Overview: Network File System (NFS) Is A Protocol Originally Developed by
No ratings yet
Sun NFS Overview: Network File System (NFS) Is A Protocol Originally Developed by
4 pages
GPFSV3.1 Admin Prog
No ratings yet
GPFSV3.1 Admin Prog
422 pages
Linux UNIT III
No ratings yet
Linux UNIT III
29 pages
GPFS For AIX PDF
No ratings yet
GPFS For AIX PDF
304 pages
Linux NFS
100% (1)
Linux NFS
11 pages
03 Nfs PDF
No ratings yet
03 Nfs PDF
48 pages
Network File System
No ratings yet
Network File System
6 pages
Chap 6
No ratings yet
Chap 6
54 pages
The Google File System: Firas Abuzaid
No ratings yet
The Google File System: Firas Abuzaid
22 pages
Andrew - Cmu.edu: Let's Start With A Familiar Example: Andrew 10,000s of People Terabytes of Disk
No ratings yet
Andrew - Cmu.edu: Let's Start With A Familiar Example: Andrew 10,000s of People Terabytes of Disk
7 pages
Case Study On Network File System
No ratings yet
Case Study On Network File System
8 pages
Case Study On Network File System
No ratings yet
Case Study On Network File System
8 pages
Distributed File Systems
No ratings yet
Distributed File Systems
18 pages
Usc Csci555 f12 Part2
No ratings yet
Usc Csci555 f12 Part2
222 pages
GPFS V3 2 Administration and Programming Reference
No ratings yet
GPFS V3 2 Administration and Programming Reference
446 pages
Storage Systems
No ratings yet
Storage Systems
23 pages
Google File System 1
No ratings yet
Google File System 1
48 pages
Unit 5 CC
No ratings yet
Unit 5 CC
8 pages
Rapid Application Development and Short-Time To The Market Low Latency Scalability High Availability Consistent View of The Data
No ratings yet
Rapid Application Development and Short-Time To The Market Low Latency Scalability High Availability Consistent View of The Data
21 pages
02 NFS
No ratings yet
02 NFS
9 pages
P2P File Sharing
No ratings yet
P2P File Sharing
43 pages
05 en Distributed File Systems
No ratings yet
05 en Distributed File Systems
63 pages
18-Distributed File Systems Study On Operating Systems
No ratings yet
18-Distributed File Systems Study On Operating Systems
24 pages
The Google File System: Alexandru Costan
No ratings yet
The Google File System: Alexandru Costan
38 pages
9238 DC Assignment 3
No ratings yet
9238 DC Assignment 3
5 pages
CC U3
No ratings yet
CC U3
40 pages
Requirements For Distributed File Systems
No ratings yet
Requirements For Distributed File Systems
4 pages
Google File System
No ratings yet
Google File System
48 pages
The Google File System Final
No ratings yet
The Google File System Final
20 pages
Network File System
No ratings yet
Network File System
30 pages
Lecture 14 HDFS GFS
No ratings yet
Lecture 14 HDFS GFS
30 pages
Distributed File System
100% (1)
Distributed File System
17 pages
Paper Gfs Summary
No ratings yet
Paper Gfs Summary
14 pages
Storage Systems
No ratings yet
Storage Systems
23 pages
Lecture Ch4 Performance
No ratings yet
Lecture Ch4 Performance
25 pages
Intools Software Free Download Crack For Windows PDF
No ratings yet
Intools Software Free Download Crack For Windows PDF
4 pages
Sigrity How To Efficiently Analyze ddr4 Interface CP
No ratings yet
Sigrity How To Efficiently Analyze ddr4 Interface CP
51 pages
LEP Virtual Private Network (VPN) Policy
No ratings yet
LEP Virtual Private Network (VPN) Policy
2 pages
NSX Command Line Interface Reference
No ratings yet
NSX Command Line Interface Reference
94 pages
Angular Notes
No ratings yet
Angular Notes
4 pages
Printers
No ratings yet
Printers
3 pages
Digital Design Lab (Ec661) Experiment N0.:1: Brief Description of Work
No ratings yet
Digital Design Lab (Ec661) Experiment N0.:1: Brief Description of Work
63 pages
The Bro Network Security Monitor
No ratings yet
The Bro Network Security Monitor
73 pages
NMX NVR N6123.DataSheet
No ratings yet
NMX NVR N6123.DataSheet
4 pages
Ferroelectric RAM FRAM Seminar Report1
No ratings yet
Ferroelectric RAM FRAM Seminar Report1
20 pages
STM32F4
100% (1)
STM32F4
86 pages
MCB1700 Hardware
No ratings yet
MCB1700 Hardware
4 pages
Avid Console Commands List
No ratings yet
Avid Console Commands List
10 pages
ExOS 6.4 - User Manual
No ratings yet
ExOS 6.4 - User Manual
480 pages
Netwrix Auditor Installation Configuration Guide
No ratings yet
Netwrix Auditor Installation Configuration Guide
195 pages
Samsung NP-R60+
No ratings yet
Samsung NP-R60+
54 pages
ARK - Intel® Core™ I5-480m Processor (3M Cache, 2
No ratings yet
ARK - Intel® Core™ I5-480m Processor (3M Cache, 2
3 pages
Report On HART COMMUNICATION
75% (4)
Report On HART COMMUNICATION
26 pages
Asterisx Mitel 3300
No ratings yet
Asterisx Mitel 3300
4 pages
Data Structure
No ratings yet
Data Structure
5 pages
CN Full Lab
100% (1)
CN Full Lab
56 pages
EMC Vmax Architecture
50% (2)
EMC Vmax Architecture
12 pages
Unconditional Jump Instructions
No ratings yet
Unconditional Jump Instructions
13 pages
Sigrok - Using Logic To Debug Logic
No ratings yet
Sigrok - Using Logic To Debug Logic
34 pages
MODULE 3 Storage Devices & Media
No ratings yet
MODULE 3 Storage Devices & Media
38 pages
SIM Platforms: For Individual Solutions and Open Standards
No ratings yet
SIM Platforms: For Individual Solutions and Open Standards
8 pages

Black Gpfs 2up

Uploaded by

Black Gpfs 2up

Uploaded by

Performance of the IBM General Parallel File System

Terry Jones, Alice Koniges, R. Kim Yates Lawrence Livermore Laboratory

Presented by Michael Black CMSC714, Fall 2005

What is the GPFS?

each node has access to all files on all disks

Processes can write to different parts of same file at same time

File access uses standard Unix POSIX calls

Comparable File Systems

Platform for GPFS

How is GPFS Implemented?

Layers on each node:

Virtual Shared Disk

tokens guard bytes in files, not whole files

Steps in remote write

Write steps continued

Architecture issues with GPFS v1.2

data copied twice in client

Analyzing performance: How much bandwidth is needed?

Effect of I/O access patterns:

Test: Vary client:server ratio

Test: Vary transfer block size

block size makes no difference # clients makes no difference up to 256

Test: Round-robin file access

Test: Scalability with constant client:server ratio

Programs must use segmented access patterns

You might also like