0% found this document useful (0 votes)

134 views26 pages

CC - Unit-5

This chapter discusses storage systems for big data in cloud computing. It covers the evolution of storage technologies and increasing data volumes. Key storage concepts reviewed include data and storage models, database management systems, and file systems. Distributed file systems like Network File System (NFS) and parallel file systems (PFS) are described. Requirements of cloud applications in terms of scalability, availability and consistency are also summarized.

Uploaded by

kavists20

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

134 views26 pages

CC - Unit-5

Uploaded by

kavists20

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Chapter 8 – Storage Systems

Contents
 Big data.
 Evolution of storage systems.
 Storage and data models.
 Database management systems.
 Network File System.
 General Parallel File System.
 Google File System.
 Apache Hadoop.
 Chubby.
 Online transaction processing.
 NoSQL databases.
 Bigtable.
 Megastore.

Dan C. Marinescu Cloud Computing: Theory and Practice. Chapter 8 2

Data storage on a cloud
 Storage and processing on the cloud are intimately tied to one another.
 Most cloud applications process very large amounts of data. Effective data
replication and storage management strategies are critical to the
computations performed on the cloud.
 Strategies to reduce the access time and to support real-time multimedia
access are necessary to satisfy the requirements of content delivery.
 Sensors feed a continuous stream of data to cloud applications.
 An ever increasing number of cloud-based services collect detailed data
about their services and information about the users of these services.
The service providers use the clouds to analyze the data.
 Humongous amounts of data - in 2013
 The Internet video will generate over 18 EB/month.
 Global mobile data traffic will reach 2 EB/month.
(1 EB = 1018 bytes, 1 PB = 1015 bytes, 1 TB = 1012 bytes, 1 GB = 1012 bytes)

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 8 3
Big data
 New concept  reflects the fact that many applications use data
sets that cannot be stored and processed using local resources.
 Applications in genomics, structural biology, high energy physics,
astronomy, meteorology, and the study of the environment carry
out complex analysis of data sets often of the order of TBs
(terabytes). Examples:
 In 2010, the four main detectors at the Large Hadron Collider (LHC)
produced 13 PB of data.
 The Sloan Digital Sky Survey (SDSS) collects about 200 GB of data
per night.
 Three-dimensional phenomena.
 Increased volume of data.
 Requires increased processing speed to process more data and
produce more results.
 Involves a diversity of data sources and data types.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 8 4
Evolution of storage technology
 The capacity to store information in units of 730-MB (1 CD-ROM)
 1986 - 2.6 EB  <1, CD-ROM /person.
 1993 - 15.8 EB  4 CD-ROM/person.
 2000 - 54.5 EB  12 CD-ROM/person.
 2007 -295.0 EB 61 CD-ROM/person.
 Hard disk drives (HDD) - during the 1980-2003 period:
 Storage density of has increased by four orders of magnitude from about
0.01 Gb/in2 to about 100 Gb/in2
 Prices have fallen by five orders of magnitude to about 1 cent/MB.
 HDD densities are projected to climb to 1,800 Gb/in2 by 2016, up from 744
Gb/in2 in 2011.
 Dynamic Random Access Memory (DRAM) - during the period 1990-2003:
 The density increased from about 1 Gb/in2 in 1990 to 100 Gb/in2 .
 The cost has tumbled from about $80/MB to less than $1/MB.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 8 5
Storage and data models
 A storage model  describes the layout of a data structure in a
physical storage - a local disk, a removable media, or storage
accessible via the network.
 A data model  captures the most important logical aspects of a
data structure in a database.
 Two abstract models of storage are used.
 Cell storage  assumes that the storage consists of cells of the same
size and that each object fits exactly in one cell. This model reflects the
physical organization of several storage media; the primary memory of a
computer is organized as an array of memory cells and a secondary
storage device, e.g., a disk, is organized in sectors or blocks read and
written as a unit.
 Journal storage  system that keeps track of the changes that will be
made in a journal (usually a circular log in a dedicated area of the file
system) before committing them to the main file system. In the event of a
system crash or power failure, such file systems are quicker to bring
back online and less likely to become corrupted.
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 8 6
A A

Write item A to Read item A from

memory cell M memory cell M Previous Current Next
Read/Write Read/Write Read/Write

M M
A A time

time
Before-or-after atomicity: the result of every
Read/Write coherence: the result of a Read Read or Write is the same as if that Read or
of memory cell M should be the same as the Write occurred either completely before or
most recent Write to that cell completely after any other Read or Write.

Read/write coherence and before-or-after atomicity are two highly desirable

properties of any storage model and in particular of cell storage

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 8 7
Data Base Management System (DBMS)
 Database  a collection of logically-related records.
 Data Base Management System (DBMS)  the software that
controls the access to the database.
 Query language  a dedicated programming language used to
develop database applications.
 Most cloud application do not interact directly with the file systems,
but through a DBMS.
 Database models  reflect the limitations of the hardware available
at the time and the requirements of the most popular applications of
each period.
 navigational model of the 1960s.
 relational model of the 1970s.
 object-oriented model of the 1980s.
 NoSQL model of the first decade of the 2000s.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 8 8
Requirements of cloud applications
 Most cloud applications are data-intensive and test the limitations of
the existing infrastructure. Requirements:
 Rapid application development and short-time to the market.
 Low latency.
 Scalability.
 High availability.
 Consistent view of the data.
 These requirements cannot be satisfied simultaneously by existing
database models; e.g., relational databases are easy to use for
application development but do not scale well.
 The NoSQL model is useful when the structure of the data does not
require a relational model and the amount of data is very large.
 Does not support SQL as a query language.
 May not guarantee the ACID (Atomicity, Consistency, Isolation, Durability)
properties of traditional databases; it usually guarantees the eventual
consistency for transactions limited to a single data item.
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 8 9
Logical and physical organization of a file
 File  a linear array of cells stored on a persistent storage device.
Viewed by an application as a collection of logical records; the file is
stored on a physical device as a set of physical records, or blocks,
of size dictated by the physical media.
 File pointer identifies a cell used as a starting point for a read or
write operation.
 The logical organization of a file  reflects the data model, the view
of the data from the perspective of the application.
 The physical organization of a file  reflects the storage model and
describes the manner the file is stored on a given storage media.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 8 10
File systems
 File system  collection of directories; each directory provides information
about a set of files.
 Traditional – Unix File System.
 Distributed file systems.
 Network File Systems (NFS) - very popular, have been used for some time, but do
not scale well and have reliability problems; an NFS server could be a single point of
failure.
 Storage Area Networks (SAN) - allow cloud servers to deal with non-disruptive
changes in the storage configuration. The storage in a SAN can be pooled and
then allocated based on the needs of the servers. A SAN-based implementation
of a file system can be expensive, as each node must have a Fibre Channel
adapter to connect to the network.
 Parallel File Systems (PFS) - scalable, capable of distributing files across a
large number of nodes, with a global naming space. Several I/O nodes serve
data to all computational nodes; it includes also a metadata server which
contains information about the data stored in the I/O nodes. The
interconnection network of a PFS could be a SAN.
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 8 11
Unix File System (UFS)
 The layered design provides flexibility.
 The layered design allows UFS to separate the concerns for the physical
file structure from the logical one.
 The vnode layer allowed UFS to treat uniformly local and remote file
access.
 The hierarchical design supports scalability reflected by the file
naming convention. It allows grouping of files directories, supports
multiple levels of directories, and collections of directories and files,
the so-called file systems.
 The metadata supports a systematic design philosophy of the file
system and device-independence.
 Metadata includes: file owner, access rights, creation time, time of the
last modification, file size, the structure of the file and the persistent
storage device cells where data is stored.
 The inodes contain information about individual files and directories.
The inodes are kept on persistent media together with the data.
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 8 12
UFS layering Symbolic path
name layer

Absolute path
name layer

Path name
layer

Logical file structure Logical

record
File name layer
Physical file structure
Block Block

Inode layer

File layer

Block layer

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 8 13
Network File System (NFS)
 Design objectives:
 Provide the same semantics as a local Unix File System (UFS) to ensure
compatibility with existing applications.
 Facilitate easy integration into existing UFS.
 Ensure that the system will be widely used; thus, support clients running on
different operating systems.
 Accept a modest performance degradation due to remote access over a network
with a bandwidth of several Mbps.
 NFS is based on the client-server paradigm. The client runs on the
local host while the server is at the site of the remote file system;
they interact by means of Remote Procedure Calls (RPC).
 A remote file is uniquely identified by a file handle (fh) rather than a
file descriptor. The file handle is a 32-byte internal name - a
combination of the file system identification, an inode number, and a
generation number.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 8 14
Application

Local host Remote host

File system API interface File system API interface

Vnode layer Vnode layer

NFS client NFS server

NFS stub NFS stub

Local file system Remote file system

Communication network

The NFS client-server interaction. The vnode layer implements file operation in a
uniform manner, regardless of whether the file is local or remote.
An operation targeting a local file is directed to the local file system, while one for a
remote file involves NFS; an NSF client packages the relevant information about
the target and the NFS server passes it to the vnode layer on the remote host
which, in turn, directs it to the remote file system.
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 8 15
 The API of the UNIX file system and the corresponding RPC issued
by an NFS client to the NFS server.
 fd  file descriptor.
 fh  for file handle.
 fname  file name,
 dname  directory name.
 dfh the directory were the file handle can be found.
 count  the number of bytes to be transferred.
 buf the buffer to transfer the data to/from.
 device  the device where the file system is located.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 8 16
Comparison of distributed file systems

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 8 17
Application NFS client NFS server
API RPC
LOOKUP(dfh,fname)
Lookup fname
OPEN READ(fh, offset,count) in directory dfh and retun
(fname,flags,mode) -------------------------------------- fh (the file handle) and
CREATE(dfh,fname,mode) file attributes or create a
new file
CLOSE (fh) Remove fh from the open file table of
the process
Read data from file fh at
READ(fd,buf,count) READ(fh, offset,count) offset and length count
and return it.

Write count bytes of data

WRITE(fd,buf,count) WRITE(fh, offset,count,buf) to file fh at location given
by offset

Update the file pointer in the open file

SEEK(fd,buf,whence)
table of the process

Write all cached data to persistent Write data

FSYNCH(fd) storage

CHMOD(fd, mode) SETATTR(fh, mode)

Update inode info

RENAME RENAME(dfh,fromfname,
Rename file
(fromfname,tofname) tofh,tofname)

STAT(fname) GETATTR(fh) Get metadata

MKDIR(dname) MKDIR(dfh, dname, attr)

Create/delete directory
RMDIR(dname) RMDIR(dfh, dname)

LOOKUP(dfh, fname)
LINK(fname, linkname) READLINK(fh) Create a link
LINK(dfh,fnam)

MOUNT Check the pathname

LOOKUP(dfh, fname) and sender’s IP address
(fsname,device) and return the fh of the
export root directory.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 8 18
General Parallel File System (GPFS)
 Parallel I/O implies concurrent execution of multiple input/output
operations. Support for parallel I/O is essential for the performance of
many applications.
 Concurrency control is a critical issue for parallel file systems. Several
semantics for handling the shared access are possible. For example,
when the clients share the file pointer successive reads issued by
multiple clients advance the file pointer; another semantics is to allow
each client to have its own file pointer.
 GPFS.
 Developed at IBM in the early 2000s as a successor of the TigerShark
multimedia file system.
 Designed for optimal performance of large clusters; it can support a file
system of up to 4 PB consisting of up to 4,096 disks of 1 TB each.
 Maximum file size is (263 -1) bytes.
 A file consists of blocks of equal size, ranging from 16 KB to 1 MB,
stripped across several disks.
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 8 19
I/O servers

LAN1

LAN2

disk
SAN
disk

disk
LAN4 disk

disk LAN3
disk

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 8 20
GPFS reliability

 To recover from system failures, GPFS records all metadata updates

in a write-ahead log file.
 Write-ahead  updates are written to persistent storage only after the
log records have been written.
 The log files are maintained by each I/O node for each file system it
mounts; any I/O node can initiate recovery on behalf of a failed node.
 Data striping allows concurrent access and improves performance, but
can have unpleasant side-effects. When a single disk fails, a large
number of files are affected.
 The system uses RAID devices with the stripes equal to the block size
and dual-attached RAID controllers.
 To further improve the fault tolerance of the system, GPFS data files
as well as metadata are replicated on two different physical disks.

Cloud Computing: Theory and Practice.

Dan C. Marinescu Chapter 8 21
GPFS distributed locking
 In GPFS, consistency and synchronization are ensured by a
distributed locking mechanism. A central lock manager grants lock
tokens to local lock managers running in each I/O node. Lock tokens
are also used by the cache management system.
 Lock granularity has important implications on the performance.
GPFS uses a variety of techniques for different types of data.
 Byte-range tokens  used for read and write operations to data files as
follows: the first node attempting to write to a file acquires a token
covering the entire file; this node is allowed to carry out all reads and
writes to the file without any need for permission until a second node
attempts to write to the same file; then, the range of the token given to
the first node is restricted.
 Data-shipping an alternative to byte-range locking, allows fine-grain
data sharing. In this mode the file blocks are controlled by the I/O nodes
in a round-robin manner. A node forwards a read or write operation to
the node controlling the target block, the only one allowed to access the
file.
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 8 22
Google File System (GFS)
 GFS  developed in the late 1990s; uses thousands of storage
systems built from inexpensive commodity components to provide
petabytes of storage to a large user community with diverse needs.
 Design considerations.
 Scalability and reliability are critical features of the system; they must be
considered from the beginning, rather than at some stage of the design.
 The vast majority of files range in size from a few GB to hundreds of TB.
 The most common operation is to append to an existing file; random write
operations to a file are extremely infrequent.
 Sequential read operations are the norm.
 The users process the data in bulk and are less concerned with the
response time.
 The consistency model should be relaxed to simplify the system
implementation but without placing an additional burden on the application
developers.
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 8 23
GFS – design decisions
 Segment a file in large chunks.
 Implement an atomic file append operation allowing multiple
applications operating concurrently to append to the same file.
 Build the cluster around a high-bandwidth rather than low-latency
interconnection network. Separate the flow of control from the data
flow. Pipeline data transfer over TCP connections. Exploit network
topology by sending data to the closest node in the network.
 Eliminate caching at the client site. Caching increases the overhead
for maintaining consistency among cashed copies.
 Ensure consistency by channeling critical file operations through a
master, a component of the cluster which controls the entire system.
 Minimize the involvement of the master in file access operations to
avoid hot-spot contention and to ensure scalability.
 Support efficient checkpointing and fast recovery mechanisms.
 Support an efficient garbage collection mechanism.
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 8 24
GFS chunks
 GFS files are collections of fixed-size segments called chunks.
 The chunk size is 64 MB; this choice is motivated by the desire to
optimize the performance for large files and to reduce the amount
of metadata maintained by the system.
 A large chunk size increases the likelihood that multiple operations
will be directed to the same chunk thus, it reduces the number of
requests to locate the chunk and, at the same time, it allows the
application to maintain a persistent network connection with the
server where the chunk is located.
 A chunk consists of 64 KB blocks and each block has a 32 bit
checksum.
 Chunks are stored on Linux files systems and are replicated on
multiple sites; a user may change the number of the replicas, from
the standard value of three, to any desired value.
 At the time of file creation each chunk is assigned a unique chunk
handle.
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 8 25
File name & chunk index Master
Application

Chunk handle & chunk location Meta-information

Chunk data

State
information
Instructions

Communication network
Chunk handle
& data count

Chunk server Chunk server Chunk server

Linux file system Linux file system Linux file system

 The architecture of a GFS cluster; the master maintains state information

about all system components; it controls a number of chunk servers. A
chunk server runs under Linux; it uses metadata provided by the master to
communicate directly with the application. The data and the control paths
are shown separately, data paths with thick lines and the control paths with
thin lines. Arrows show the flow of control between the application, the
master and the chunk servers.
Cloud Computing: Theory and Practice.
Dan C. Marinescu Chapter 8 26

Unit 5
No ratings yet
Unit 5
128 pages
Cloud Unit3
No ratings yet
Cloud Unit3
198 pages
Unit-3 PPT Updated
No ratings yet
Unit-3 PPT Updated
33 pages
Pictus 400 User Manual
No ratings yet
Pictus 400 User Manual
185 pages
Chapter8 - Storage Systems
No ratings yet
Chapter8 - Storage Systems
37 pages
ISM MODULE 1 Introduction To Information Storage
No ratings yet
ISM MODULE 1 Introduction To Information Storage
46 pages
CC 3
No ratings yet
CC 3
67 pages
Data Processing SS1
100% (1)
Data Processing SS1
29 pages
Computer Operator Final Full
No ratings yet
Computer Operator Final Full
203 pages
Unit 3 Storage in Clouds
No ratings yet
Unit 3 Storage in Clouds
44 pages
Unit 1 - Data Access in The Internet Era (Part A)
No ratings yet
Unit 1 - Data Access in The Internet Era (Part A)
134 pages
UE20CS351 Unit3 Slides PDF
No ratings yet
UE20CS351 Unit3 Slides PDF
177 pages
Storage Virtualization
No ratings yet
Storage Virtualization
25 pages
UNIT 5 Storage Systems
No ratings yet
UNIT 5 Storage Systems
9 pages
Cloud and Virtual Data Storage Networking - Your Journey To Efficient and Effective Information Services PDF
100% (1)
Cloud and Virtual Data Storage Networking - Your Journey To Efficient and Effective Information Services PDF
380 pages
W7 - CLO2 - File System and Storage
No ratings yet
W7 - CLO2 - File System and Storage
21 pages
PM Debug Info
No ratings yet
PM Debug Info
201 pages
I ST Internal-CE
No ratings yet
I ST Internal-CE
26 pages
Module 3:the Memory System: Courtesy: Text Book: Carl Hamacher 5 Edition
No ratings yet
Module 3:the Memory System: Courtesy: Text Book: Carl Hamacher 5 Edition
73 pages
Unit 2 PPT CC
No ratings yet
Unit 2 PPT CC
96 pages
Primary Ict Third Term Exam
67% (3)
Primary Ict Third Term Exam
17 pages
Lecture04 Storage Services - UGv1.1
No ratings yet
Lecture04 Storage Services - UGv1.1
78 pages
COMP100 - TOPIC THREE, FOUR and FIVE NOTES
No ratings yet
COMP100 - TOPIC THREE, FOUR and FIVE NOTES
57 pages
Cloud Computing Unit-1
No ratings yet
Cloud Computing Unit-1
60 pages
Unit V
No ratings yet
Unit V
43 pages
CBD2234 Lecture1 Ch1
No ratings yet
CBD2234 Lecture1 Ch1
28 pages
Cloud Unit-4-2
No ratings yet
Cloud Unit-4-2
32 pages
Cc-Unit-5
No ratings yet
Cc-Unit-5
27 pages
Lecture 5 - Cloud Storage - Machine Cycle
No ratings yet
Lecture 5 - Cloud Storage - Machine Cycle
29 pages
Comp Pp2 Kcse 2024 Prediction Trials
No ratings yet
Comp Pp2 Kcse 2024 Prediction Trials
54 pages
Cloud Data Storage
No ratings yet
Cloud Data Storage
47 pages
Cloud Computing Unit 2 Notes
No ratings yet
Cloud Computing Unit 2 Notes
14 pages
09 - Cloud-Enabling Technologies - v2
No ratings yet
09 - Cloud-Enabling Technologies - v2
45 pages
x3 Manual en v1.1
No ratings yet
x3 Manual en v1.1
24 pages
Chapter 3 - Cloud Product and Services
No ratings yet
Chapter 3 - Cloud Product and Services
29 pages
Cloud Computing Unit-I
No ratings yet
Cloud Computing Unit-I
12 pages
Tool Tips
No ratings yet
Tool Tips
15 pages
1.CAB - MBA Notes
No ratings yet
1.CAB - MBA Notes
20 pages
Emerging Trends - Q&A
No ratings yet
Emerging Trends - Q&A
5 pages
CC Unit 4
No ratings yet
CC Unit 4
46 pages
Unit 1
No ratings yet
Unit 1
62 pages
L9 - Cloud Storage
No ratings yet
L9 - Cloud Storage
35 pages
Cloud Computing U-2
No ratings yet
Cloud Computing U-2
11 pages
Storage Area Network Module-1
No ratings yet
Storage Area Network Module-1
77 pages
Top Cloud Computing Trends For 2019: Based On Cloud Location
No ratings yet
Top Cloud Computing Trends For 2019: Based On Cloud Location
16 pages
Unit-Iii CC
No ratings yet
Unit-Iii CC
14 pages
Cloud Computing Chapter8
No ratings yet
Cloud Computing Chapter8
43 pages
CS526 3 Design of Parallel Programs
No ratings yet
CS526 3 Design of Parallel Programs
83 pages
Ddco Mod 4
No ratings yet
Ddco Mod 4
18 pages
CC Unit-5
No ratings yet
CC Unit-5
9 pages
CC - Unit - 4.
No ratings yet
CC - Unit - 4.
34 pages
CC Unit 3
No ratings yet
CC Unit 3
20 pages
Distributed Systems CH7-2022
No ratings yet
Distributed Systems CH7-2022
15 pages
An5342 Error Correction Code Ecc Management For Internal Memories Protection On stm32h7 Series Stmicroelectronics
No ratings yet
An5342 Error Correction Code Ecc Management For Internal Memories Protection On stm32h7 Series Stmicroelectronics
15 pages
Cloud Computing Unit 1
No ratings yet
Cloud Computing Unit 1
12 pages
Encrypting Root File System With Zymbit Security Modules
No ratings yet
Encrypting Root File System With Zymbit Security Modules
7 pages
IOT-UNIT-3 Material
100% (1)
IOT-UNIT-3 Material
19 pages
Unit 5 CC
No ratings yet
Unit 5 CC
8 pages
Curso: Sistemas Operativos Profesor: Ivan Vladimir Martinez Moran Salon: 17366 Alumno: Vasquez Chavez Jose Manuel Codigo: U19300899
No ratings yet
Curso: Sistemas Operativos Profesor: Ivan Vladimir Martinez Moran Salon: 17366 Alumno: Vasquez Chavez Jose Manuel Codigo: U19300899
24 pages
CC Unit-V
No ratings yet
CC Unit-V
6 pages
Classic Data Centre
No ratings yet
Classic Data Centre
16 pages
Previewpdf
No ratings yet
Previewpdf
43 pages
Cloud Storage Seminar
No ratings yet
Cloud Storage Seminar
11 pages
Motherboard Manual Gigabyte K8VM800M Via8237
No ratings yet
Motherboard Manual Gigabyte K8VM800M Via8237
14 pages
FOG COMPUTING AN ABLE EXTENSION TO CLOUD COMPUTING Research Paper - New
100% (1)
FOG COMPUTING AN ABLE EXTENSION TO CLOUD COMPUTING Research Paper - New
16 pages
Cloud Unit3
No ratings yet
Cloud Unit3
26 pages
Online Voting
No ratings yet
Online Voting
22 pages
IOT UNIT 2 Material Me
No ratings yet
IOT UNIT 2 Material Me
19 pages
Checklists For SAP Administration Practical Guide
No ratings yet
Checklists For SAP Administration Practical Guide
17 pages
Unit 5 - Cloud Computing
No ratings yet
Unit 5 - Cloud Computing
62 pages
Cloud Storage Seminar
No ratings yet
Cloud Storage Seminar
21 pages
4-1 Syllabus
No ratings yet
4-1 Syllabus
6 pages
Is621 Semester 1 MCQ Updated Complete Highlighted
No ratings yet
Is621 Semester 1 MCQ Updated Complete Highlighted
39 pages
Chapter 1
No ratings yet
Chapter 1
42 pages
4.-Revised-Tle-As-Css10-Q3-Disk Management
No ratings yet
4.-Revised-Tle-As-Css10-Q3-Disk Management
5 pages
Data Mining y Cloud Computing
No ratings yet
Data Mining y Cloud Computing
7 pages
ICT Helpdesk Procedure PDF
No ratings yet
ICT Helpdesk Procedure PDF
11 pages
1 Computer System
No ratings yet
1 Computer System
6 pages
Big Data and Cloud Computing
No ratings yet
Big Data and Cloud Computing
27 pages
CloudComputing Unit 3
No ratings yet
CloudComputing Unit 3
8 pages
Cloud Computing Introduction PDF
No ratings yet
Cloud Computing Introduction PDF
28 pages
Wms Config
No ratings yet
Wms Config
73 pages
Mcqs
No ratings yet
Mcqs
14 pages
E Learning Project Report
No ratings yet
E Learning Project Report
53 pages
Fame WhitePaper FameBenchmarking
No ratings yet
Fame WhitePaper FameBenchmarking
12 pages
Computer Studies I PDF
No ratings yet
Computer Studies I PDF
4 pages
Seminar Report New
No ratings yet
Seminar Report New
27 pages
Sata Ide 2 Manual
No ratings yet
Sata Ide 2 Manual
10 pages
Touro Mobile: Portable Storage For All Your Files
No ratings yet
Touro Mobile: Portable Storage For All Your Files
2 pages
iosrjournals.org
No ratings yet
iosrjournals.org
8 pages
Latest Basic Computer Hardware Interview Questions
No ratings yet
Latest Basic Computer Hardware Interview Questions
3 pages
Distributed Caching & Data Management: Mastering Redis, Memcached, And Apache Ignite Caching
From Everand
Distributed Caching & Data Management: Mastering Redis, Memcached, And Apache Ignite Caching
Rob Botwright
No ratings yet
Storage Area Networks For Dummies
From Everand
Storage Area Networks For Dummies
Christopher Poelker
3.5/5 (2)
Information Storage and Management: Storing, Managing, and Protecting Digital Information in Classic, Virtualized, and Cloud Environments
From Everand
Information Storage and Management: Storing, Managing, and Protecting Digital Information in Classic, Virtualized, and Cloud Environments
EMC Education Services
No ratings yet
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
From Everand
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
Byron Ellis
No ratings yet

CC - Unit-5

Uploaded by

CC - Unit-5

Uploaded by

Chapter 8 – Storage Systems

Dan C. Marinescu Cloud Computing: Theory and Practice. Chapter 8 2

Cloud Computing: Theory and Practice.

Cloud Computing: Theory and Practice.

Cloud Computing: Theory and Practice.

Write item A to Read item A from

Read/write coherence and before-or-after atomicity are two highly desirable

Cloud Computing: Theory and Practice.

Cloud Computing: Theory and Practice.

Cloud Computing: Theory and Practice.

Logical file structure Logical

Cloud Computing: Theory and Practice.

Cloud Computing: Theory and Practice.

Local host Remote host

File system API interface File system API interface

Vnode layer Vnode layer

NFS client NFS server

Local file system Remote file system

Cloud Computing: Theory and Practice.

Cloud Computing: Theory and Practice.

Write count bytes of data

Update the file pointer in the open file

Write all cached data to persistent Write data

CHMOD(fd, mode) SETATTR(fh, mode)

STAT(fname) GETATTR(fh) Get metadata

MKDIR(dname) MKDIR(dfh, dname, attr)

MOUNT Check the pathname

Cloud Computing: Theory and Practice.

Cloud Computing: Theory and Practice.

 To recover from system failures, GPFS records all metadata updates

Cloud Computing: Theory and Practice.

Chunk handle & chunk location Meta-information

Chunk server Chunk server Chunk server

Linux file system Linux file system Linux file system

 The architecture of a GFS cluster; the master maintains state information

You might also like