UNIT 5 Storage Systems
UNIT 5 Storage Systems
Unmanaged cloud storage means the storage is preconfigured for the customer. The
customer can neither format, nor install his own file system or change drive properties.
Managed cloud storage offers online storage space on-demand. The managed cloud
storage system appears to the user to be a raw disk that the user can partition and
format.
Explore our latest online courses and learn new skills at your own pace. Enroll and
become a certified expert to boost your career.
The cloud storage system stores multiple copies of data on multiple servers, at multiple
locations. If one system fails, then it is required only to change the pointer to the
location, where the object is stored.
To aggregate the storage assets into cloud storage systems, the cloud provider can use
storage virtualization software known as StorageGRID. It creates a virtualization layer
that fetches storage from different storage devices into a single management system. It
can also manage data from CIFS and NFS file systems over the Internet. The following
diagram shows how StorageGRID virtualizes the storage into storage clouds:
Virtual Storage Containers
The virtual storage containers offer high performance cloud storage systems. Logical
Unit Number (LUN) of device, files and other objects are created in virtual storage
containers. Following diagram shows a virtual storage container, defining a cloud storage
domain:
Challenges
Storing the data in cloud is not that simple task. Apart from its flexibility and
convenience, it also has several challenges faced by the customers. The customers must
be able to:
storage models
Unmanaged cloud storage means the storage is preconfigured for the customer. The
customer can neither format, nor install his own file system or change drive properties.
Managed cloud storage offers online storage space on-demand. The managed cloud
storage system appears to the user to be a raw disk that the user can partition and
format.
The cloud storage system stores multiple copies of data on multiple servers, at multiple
locations. If one system fails, then it is required only to change the pointer to the
location, where the object is stored.
To aggregate the storage assets into cloud storage systems, the cloud provider can use
storage virtualization software known as StorageGRID. It creates a virtualization layer
that fetches storage from different storage devices into a single management system. It
can also manage data from CIFS and NFS file systems over the Internet. The following
diagram shows how StorageGRID virtualizes the storage into storage clouds:
Virtual Storage Containers
The virtual storage containers offer high performance cloud storage systems. Logical
Unit Number (LUN) of device, files and other objects are created in virtual storage
containers. Following diagram shows a virtual storage container, defining a cloud storage
domain:
B. File system
Definition of a file system − A file system is a software component that
manages files and directories on a storage device. It provides a way for
applications and users to access and organize files. A file system also manages
space allocation, file naming, and file permissions.
Types of file system − Types of file systems include local file systems that are
used on a single computer such as NTFS, FAT32, and HFS+, and network file
systems that allow files to be accessed over a network such as NFS, CIFS, and
AFS.
Fault Tolerance
Fault tolerance is the ability of a system to continue operating in the presence of
hardware or software failures. In a distributed file system, fault tolerance is crucial
to ensure that data remains available and accessible in the event of failures.
Challenges in achieving fault tolerance in distributed file systems include issues
with data replication, network partitioning, and failure detection. Techniques such
as replication, redundancy, and fault detection can be used to manage fault
tolerance in distributed file systems.
Security
Security in distributed file systems refers to the ability of a system to protect data
from unauthorized access or modification. This includes ensuring that data is
encrypted, authenticated, and authorized based on user roles and permissions.
Challenges in achieving security in distributed file systems include issues with
data privacy, integrity, and authentication. Techniques such as encryption, access
control, and firewalls can be used to manage security in distributed file systems.
Synchronization Techniques
Synchronization methods are used to ensure that data remains consistent and up-
to-date across multiple copies in a distributed file system. Examples of
synchronization methods include locking, versioning, and time-stamping.
Consensus algorithms are used to achieve agreement among multiple nodes in a
distributed file system. Examples of consensus algorithms include the Paxos
algorithm and the Raft algorithm. These algorithms are used to ensure that data
remains consistent and available even in the presence of network failures or node
crashes.
Conclusion
In conclusion, a distributed file system provides a way for users to access and store files
across multiple machines in a networked environment. It offers advantages such as
scalability, fault tolerance, and support for data-intensive applications. However, it also
presents challenges such as data consistency, fault tolerance, and security. To design a
distributed file system, one must consider the architecture, data access mechanisms,
and synchronization techniques. Examples of distributed file systems include Google File
System (GFS) and Hadoop Distributed File System (HDFS), each with its own advantages
and disadvantages. Overall, distributed file systems play a crucial role in managing and
processing large amounts of data in today's interconnected world.