0% found this document useful (0 votes)
14 views14 pages

Unit-Iii CC

Unit III of the Cloud Computing course covers data storage and cloud computing, detailing various storage systems like DAS, SAN, and NAS, as well as cloud storage management and file systems. It discusses the benefits of cloud storage, including cost-effectiveness and security, and introduces cloud data management interfaces and provisioning methods. Additionally, it highlights applications utilizing cloud storage and different cloud services such as IaaS, PaaS, and SaaS.

Uploaded by

eshak3778
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views14 pages

Unit-Iii CC

Unit III of the Cloud Computing course covers data storage and cloud computing, detailing various storage systems like DAS, SAN, and NAS, as well as cloud storage management and file systems. It discusses the benefits of cloud storage, including cost-effectiveness and security, and introduces cloud data management interfaces and provisioning methods. Additionally, it highlights applications utilizing cloud storage and different cloud services such as IaaS, PaaS, and SaaS.

Uploaded by

eshak3778
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 14

III BSC(CS) CLOUD COMPUTING UNIT-III

UNIT - III
Data Storage and Cloud Computing : Data Storage – Cloud Storage – Cloud
Storage from LANs to WANs – Cloud Computing Services : Cloud Services –
Cloud Computing at Work
1)Data Storage in Cloud Computing:
1.1)Introduction to enterprise data storage:
Cloud Storage is a service that allows to save data on offsite storage system
managed by third-party and is made accessible by a web services API.
The various types of storage subsystems are:
● Direct Attached Storage (DAS)
● Storage Area Network (SAN)
● Network Attached Storage (NAS)
DAS: Direct Attached Storage DAS is the basic storage system providing
block-level storage and used for building SAN and NAS. The performance of
SAN and NAS depends on DAS. Performance of DAS will always be high,
because it is directly connected to the system.
SAN: Storage Area Network When multiple hosts want to connect a single
storage device, then SAN is used. SAN provides block-level storage and
simultaneous access is not permitted and hence it is suitable for clustering
environment. SAN technologies are FC (Fibre Channel), iSCSI (Internet SCSI)
and AoE (ATA over Ethernet).
NAS: Network Attached Storage For fi le-level storage, NAS is used. SAN and
DAS act as base system for NAS. NAS is also called as ‘File Server’. The main
advantages of NAS are that multiple hosts can share a single volume at the same
time, whereas when using SAN or DAS only one client can access the volume
at a time.
1.2)DATA STORAGE MANAGEMENT:
• Storage Resource Management (SRM) tools include configuration tools,
provisioning tools and measurement tools.
Configuration tools handle the set-up of storage resources. These tools help to
organize and manage RAID devices by assigning groups, defining levels or
assigning spare drives.

MP 1
III BSC(CS) CLOUD COMPUTING UNIT-III

● Provisioning tools define and control access to storage resources for


preventing a network user from being able to use any other user’s storage.
● Measurement tools analyse performance based on behavioural information
about a storage device. An administrator can use that information for future
capacity and upgrade planning.
Storage Management Process:
• Data storage management tools must rely on policies which governs the
usage of storage devices and its procedures. Storage management
encompasses three areas—change management, performance and
capacity planning and tiering (tiered storage).
1.3)FILE SYSTEMS:
• A file system is a structure used in computer to store data on a hard disk.
When we install a new hard disk, we need to partition and format it using
a fi le system before storing data. There are three file systems in use in
Windows OS; they are NTFS, FAT32 and rarely-used FAT.
FAT File System:
• FAT system was first devised in the so-called computer environment in
the early years. FAT was planned for systems with very small RAM and
small disks. It required much less system resources compared to other fi
le systems like UNIX.
NTFS:
• NTFS is much simpler than FAT. While fi les are used, the system areas
can be customized, enlarged, or moved as required. NTFS has much more
security incorporated. NTFS is not apt for small-sized disks.
Cloud File System:
In cloud file systems, the considerations are:
• It must sustain basic file system functionality.
• It should be an open source.
• It should be grown-up enough that users will at least think about trusting
their data to it.
• It should be shared, i.e., available over a network.
• It should be paralleling scalable.

MP 2
III BSC(CS) CLOUD COMPUTING UNIT-III

● It should provide honest data protection, still on commodity hardware with


only internal storage.
Ghost File System :
• Ghost cloud file system is used in Amazon Web Services (AWS). It gives
high redundant elastic mountable, cost-effective and standards-based file
system.
Gluster File System:
• The Gluster File System is a distributed file system that provides
scalable, high-performance, and reliable storage in cloud computing
environments. It is open-source and particularly well-suited for handling
large amounts of unstructured data, making it a popular choice for
applications like media streaming, data analytics, and large-scale file
sharing .
Hadoop File System:
• The Hadoop Distributed File System (HDFS) is a key component of the
Hadoop ecosystem and serves as the primary storage system for big data
processing in cloud computing.
• HDFS is designed to store and manage vast amounts of data in a
distributed and fault-tolerant manner across clusters of commodity
hardware.
XtreemFS:
• The XtreemFS (eXtreme Trusted Elastic File System) is a distributed
file system specifically designed for cloud computing and high-
performance computing (HPC) environments.
• It provides scalable, fault-tolerant, and secure storage for large-scale
distributed systems. XtreemFS is particularly suited for dynamic and
elastic cloud environments where storage requirements can change
frequently.
CloudFS:
• CloudFS (Cloud File System) is a distributed file system designed
specifically for cloud computing environments.
• It enables efficient, scalable, and reliable storage and access to data over
a network of distributed servers, typically in private, public, or hybrid
cloud architectures.

MP 3
III BSC(CS) CLOUD COMPUTING UNIT-III

1.4)CLOUD DATA STORES:


• A data store is a data repository where data are stored as objects. Data
store includes data repositories, flat files that can store data.
Data stores can be of different types:
• Relational databases (Examples: MySQL, PostgreSQL, Microsoft SQL
Server, Oracle Database) Object-oriented databases
• Operational data stores
• Schema-less data stores, e.g. Apache Cassandra or Dynamo Paper files
• Data files (spread sheets, flat files, etc)
• Distributed Data Store
Examples :Google’s BigTable, Amazon’s Dynamo and Windows Azure
Storage.
Types of Data Stores:
BigTable
• BigTable is a compressed, high performance and proprietary data storage
system construct on Google File System, Chubby Lock Service, SSTable
and a small number of other Google technologies.
Dynamo: A Distributed Storage System
• Dynamo is a vastly offered, proprietary key-value structured storage
system or a dispersed data store. It can act as databases and also
distributed hash tables (DHTs). It is used with parts of Amazon web
services such as Amazon S3.
1.5)USING GRIDS FOR DATA STORAGE:
Grid Storage for Grid Computing:
• It provides users and applications to use shared pool of resources. The
compute grid connects computers both desktops and servers and storage
across an organization.
• Grid computing allows sharing of computing and data resources for
multiple workloads and enables collaboration both within and across
organizations.

MP 4
III BSC(CS) CLOUD COMPUTING UNIT-III

Grid Oriented Storage (GOS):


• Grid Oriented Storage (GOS) is a dedicated data storage architecture
connected directly to a computational grid.
• It supports and acts as a data bank and reservoirs for data, which can be
shared among multiple grid clients.
2)CLOUD STORAGE:
The benefits of cloud storage are:
• Unpredictable storage growth
• Cost and complexity of conventional storage
• Security
Cloud storage is data storage hosted remotely using data storage devices in
WWW and maintained by the third party (service provider).
Cloud storage is a part of cloud computing. It is deployed using WAN
infrastructure which includes hardware components such as switches and
routers.
Cloud storage can be deployed in many ways. For example:
• Local data (desktop/laptop) can be backed up to cloud storage.
• A virtual disk can be ‘sync’ to the cloud and distributed.
• The cloud can be used as a reservoir for storing data.
2.1)DATA MANAGEMENT FOR CLOUD STORAGE:
Cloud storage will lose its abstraction and its benefits such as simplicity,
heterogeneity and good performance, if complex management services are
added.
Cloud storage should incorporate new services according to change of time.

MP 5
III BSC(CS) CLOUD COMPUTING UNIT-III

• For cloud storage, a standard document is placed by SNIA Storage


Industry Resource Domain Model (SIRDM). It states the importance of
simplicity for cloud storage.
• Figure 12.1 shows the SIRDM model which uses CDMI standards.
SIRDM model adopts three metadata: system consisting of storage
metadata, data metadata and user metadata. By using these metadata,
cloud storage interface can offer services without adding unnecessary
complexity in managing the data.
• Storage system and data system metadata are used to meet the
requirements of the data and the simplicity required is maintained.
• User metadata is used by the cloud to find the data objects and containers.
Storage system metadata is used by the cloud to offer basic storage
functions like assigning, modifying and access control.
• Data system metadata is used by the cloud to offer data as a service based
on user requirements and controls the operation based on that data.
Cloud Data Management Interface (CDMI):
• To create, retrieve, update and delete objects in a cloud the cloud data
management interface (CDMI) is used.
The functions in CDMI are:
• Cloud storage offerings are discovered by clients
• Management of containers and the data
• Sync metadata with containers an objects

MP 6
III BSC(CS) CLOUD COMPUTING UNIT-III

Cloud Storage Requirements:


Multi-tenancy
• In a multi-tenancy model, resources provided are pooled, so that it may
be shared by multiple customers based on their needs. Due to the
elasticity property in cloud computing, shared pool of storage model
makes the provider cost effective and billing is made easy.
Security
• Secure cloud storage requires a secure transmission channel and methods.
Securing data can be done using encryption, authentication and
authorization.
Secure Transmission Channel :
The four primary methods used to secure network communications are as
follows:
1. Transport Layer Security (TLS) and Secure Sockets Layer (SSL)
2. Hypertext Transfer Protocol Secure (HTTPS)
3. Private Networks
4. Virtual Private Networks (VPNs)
Performance
• Cloud storage performance can be categorized into two: speed and
latency. Factors that affect cloud storage performance are: available
network bandwidth, types of systems available in provider’s end, method
adopted for compression and caching.
Quality of Service (QoS) :
• Quality of service (QoS) refers to levels of performance and effi ciency of
the system that they can provide.
Data Protection and Availability:
• Physical site security
• Protection against power loss
• Protection against loss of network access
• Data redundancy

MP 7
III BSC(CS) CLOUD COMPUTING UNIT-III

Metering and Billing


• Metering and billing in cloud storage are done based on: data uploaded,
data downloaded, data stored and depends on requests and types of
request.
2.2)PROVISIONING CLOUD STORAGE:
• Cloud means sharing third party resources via the Internet. This sharing
can be done on need basis and there is no need to invest any
infrastructure at consumers end.
• By adopting Cloud Data Management Interface (CDMI), standard
service providers can implement the method for metering the storage and
data usage of consumers.
2.3)DATA-INTENSIVE TECHNOLOGIES FOR CLOUD COMPUTING:
Processing Approach
● The principle mechanism used for collection of the data and programs or
algorithms to perform the computation
● Programming model used
● Reliability and availability
● Scalability of both hardware and software
System Architecture:
Map reduce:
• MapReduce is a programming model and an associated implementation
for processing and generating large datasets that can be parallelized
across a distributed cluster of computers.
• The core idea of MapReduce is to divide a large dataset into smaller
chunks, process each chunk in parallel (mapping), and then combine the
results (reducing). This model is particularly effective for tasks like
sorting, filtering, and aggregating large amounts of data.
HPCC:
• HPCC stands for High-Performance Computing Cluster. It consists of a
collection of powerful servers or nodes that work together in parallel to
process large datasets, perform complex computations, or analyze big
data.

MP 8
III BSC(CS) CLOUD COMPUTING UNIT-III

• In a cloud computing context, an HPCC typically refers to a cluster of


virtual machines or containers hosted on cloud infrastructure that is
configured to run high-performance computing tasks.
3)Cloud Storage from LANs to WANs:
• Cloud storage represents a significant shift in how data is stored,
accessed, and managed, and it plays a key role in the evolution of
network architecture.
• Traditionally, organizations used Local Area Networks (LANs) to store
and manage data within their own premises, utilizing file servers,
network-attached storage (NAS), or storage area networks (SAN).
• With the advent of cloud computing, however, organizations have moved
to more centralized, off-site models of storage, using Wide Area
Networks (WANs) to access cloud storage resources.
DISTRIBUTED DATA STORAGE:
i) Amazon Dynamo
• Amazon DynamoDB is a fully managed NoSQL database service
offered by Amazon Web Services . It is designed to handle large-scale,
low-latency, and high-throughput applications with high availability and
reliability.
• DynamoDB is widely used for applications that require consistent
performance at scale, such as mobile apps, web apps, gaming platforms,
IoT devices, and more.

• DynamoDB is named after the original Dynamo system created by


Amazon, which was designed to be a highly available, distributed key-
value store.
CouchDB:
• Apache CouchDB is an open-source NoSQL database that uses a
document-oriented storage model.
• CouchDB stores data in JSON format and offers powerful features for
replication and synchronization, which are important in cloud computing
scenarios.
• In cloud computing, CouchDB can be used to build highly available,
distributed systems that require efficient storage and management of large
volumes of semi-structured data.

MP 9
III BSC(CS) CLOUD COMPUTING UNIT-III

ThruDB :
• It is a specialized database engine designed for high-performance data
storage and processing.
• It is often associated with cloud computing environments due to its
ability to scale efficiently, manage large datasets, and optimize data
access across distributed systems.
• ThruDB's features make it suitable for applications that require low-
latency, high-throughput, and scalability, which are all critical factors
in cloud environments.
APPLICATIONS UTILIZING CLOUD STORAGE:
Online File Storage:
i)DropBox
Dropbox is a popular cloud storage service that allows users to store files
online and access them from any device connected to the internet.
It is part of the broader field of cloud computing, which refers to the delivery
of computing resources and services—such as storage, computing power,
and software—over the internet, often referred to as "the cloud."
Box.net
Box.net(now simply known as Box) is a cloud-based storage and
collaboration platform that focuses on business and enterprise users. It
enables organizations to store, manage, and share files online.
Livemesh:
• Live Mesh was designed to provide users with an integrated solution to
sync and manage files across multiple devices and share them via the
cloud.
• The key features of Live Mesh, particularly in the context of cloud
computing, revolved around its ability to offer cross-platform file
synchronization, remote desktop capabilities, and cloud storage.
Oosah:
• Oosah was an early cloud storage and file-sharing platform.
• Oosah aimed to provide users with a simple and easy way to store,
organize, and share their media files in the cloud.

MP 10
III BSC(CS) CLOUD COMPUTING UNIT-III

• It was marketed primarily as a way to store digital content and provide


online backup for photos, music, and videos.
Jungle Disk:
It is a cloud storage and backup service that was initially launched in 2006. It
provided a way for individuals and businesses to securely store and back up
data in the cloud.
Jungle Disk made use of Amazon Web Services (AWS) to offer scalable,
reliable, and secure cloud storage.
Cloud Storage Companies:
• Box cloud storage
• Amazon cloud
• SugarSync online backup
• Hubic online storage
• Google cloud drive
4)CLOUD SERVICES:
• Infrastructure as a Service (IaaS)
• Platform as a Service (PaaS)
• Software as a Service (SaaS)
OTHER CLOUDS SERVICES:
Storage as a Service (STaaS):
• Storage as a service is a methodology wherein large organizations leases
their accessible storage space to a small scale business or individuals.
Database as a Service (DaaS):
• DBaaS inherits the benefits of SaaS and permits IT businesses to use the
hardware and programs services without being installed in their
premises.
• DBaaS can be characterized as ‘an organized service, suggested on a
pay-per-usage basis, that presents on-demand access to a database for the
storage of application data’.
Information as a Service (INaaS):
• For large IT organizations, data related to stakeholders and others should
be accurate.

MP 11
III BSC(CS) CLOUD COMPUTING UNIT-III

• Due to heterogeneous data formats, batch processing and application


integration charges, there is an obstacle in the system to get
uninterrupted information.
5)CLOUD COMPUTING AT WORK:
CLOUD SERVICE DEVELOPMENT TOOL:
Application Development Using IDE:
• Integrated Development Environments (IDEs) comprise of source codes,
automation experts and a debugger.
• Cloud IDEs, for example, koding.com, Cloud9 or eXo are all free of
subscription as they are in the cloud and it allows for developers to
conveniently edit and code anytime, anywhere.
Application Using Databases:
• Most applications today need some sort of database connectivity/usage.
• When you evolve and deploy in the cloud, the easiest thing to do is
ensure that your database is consigned as a service in the cloud, so that
you regularly observe Xeround’s FREE cloud database.
Deploying Application:
• CloudBees boasts an IaaS solution called Run@Cloud, which presents all
services needed to be established in the cloud, free of charge.
MANAGEMENT/ADMINISTRATIVE SERVICES:
Collaborating Event Management:
i)Event Planning and Workflow Management
• A successful event begins well in advance of its due date. Most event
management applications encompass robust task designing modules
similar to what you would find in higher-end or lower-end task
administration applications.
Advance Registration:
• Larger events need registration of participants. For this purpose, most
event management apps encompass a web-based registration module,
where attendees can signal up for the event.
• Attendee data is stored into a web module and that data is then retained
on the application provider’s cloud servers.

MP 12
III BSC(CS) CLOUD COMPUTING UNIT-III

• In the entire management program, the registration module plays a vital


role. Ensure that it does everything required to manage and does so in
available means.
Payment Processing:
• The key role of the event management is collecting fee from onsite
applicants. Event management programs bind fee processing into the
registration module, accepting fee through credit card, PayPal, or
anything other procedures.
Contact Management:
• This is a kind of service that numerous event managers offer. Using the
expert database of the event managers, communication services can be
provided to help attendees get in touch with one another.
• At the least, event management application should publish a guide book
or directions to the venue, which can then be supplied as part of the
welcome package.
Budget Management :
• Running an event is a costly and convoluted undertaking. The general
allowance comprises of hundreds of individual estimations.
• For that purpose, event management application should encompass a
robust accounting or allowance management module, to track both costs
and income.
Post-event Reporting and Analysis:
• When the event is over, the job is not rather finished yet. It is essential to
look back at the whole event and analyse how successful it was.
• For this purpose, most event management applications encompass some
kind of post-event analysis. Some apps even let us monitor attendee
reviews, which can provide precious feedback.
ii)Collaborating Contact Management:
CRM: CRM software not only stores clientele communication data, it also
stores and analyses all data pertaining to a specific clientele and evaluates them
to assist you to communicate better with that clientele.

MP 13
III BSC(CS) CLOUD COMPUTING UNIT-III

iii)Collaborating Project Management:


• Engineering and Construction PM Cloud
• Information Technology PM Cloud
• Research and Development PM Cloud
• Government PM Cloud
• Education PM Cloud

MP 14

You might also like