Chapter 8 - Virtualization and Cloud Computing Part 1
Chapter 8 - Virtualization and Cloud Computing Part 1
• The virtualization layer is the software responsible for hosting and managing
all VMs. The virtualization layer is a hypervisor running directly on the
hardware.
Hardware
Characteristics of Virtualization
Virtualization has three characteristics that support the scalability and
operating efficiency required for big data environments:
Partitioning: In virtualization, many applications and operating systems are
supported in a single physical system by partitioning the available resources.
Isolation: Each virtual machine is isolated from its host physical system and
other virtualized machines. Because of this isolation, if one virtual instance
crashes, the other virtual machines and the host system aren’t affected. In
addition, data isn’t shared between one virtual instance and another.
Encapsulation: A virtual machine can be represented as a single file, so you
can identify it easily based on the services it provides.
Benefits of Virtualization
Sharing of resources helps cost reduction
Isolation:Virtual machines are isolated from each other as if they are
physically separated
Encapsulation: Virtual machines encapsulate a complete computing
environment
Hardware Independence: Virtual machines run independently of underlying
hardware
Portability: Virtual machines can be migrated between different hosts.
Application of Virtualization for Big Data
Big data application virtualization
Application infrastructure virtualization provides an efficient way to manage
applications in context with customer demand.
The application is encapsulated in a way that removes its dependencies from the
underlying physical computer system.
This helps to improve the overall manageability and portability of the application.
Application of Virtualization for Big Data
Big data application virtualization
In addition, the application infrastructure virtualization software typically allows
for codifying business and technical usage policies to make sure that each of your
applications leverages virtual and physical resources in a predictable way.
Efficiencies are gained because you can more easily distribute IT resources
according to the relative business value of your applications.
Application infrastructure virtualization used in combination with server
virtualization can help to ensure that business service-level agreements are met.
Server virtualization monitors CPU and memory usage, but does not account for
variations in business priority when allocating resources.
Application of Virtualization for Big Data
Big data network virtualization
Network virtualization provides an efficient way to use networking as a pool of
connection resources.
Instead of relying on the physical network for managing traffic, you can create
multiple virtual networks all utilizing the same physical implementation.
This can be useful if you need to define a network for data gathering with a certain
set of performance characteristics and capacity and another network for
applications with different performance and capacity.
Virtualizing the network helps reduce these bottlenecks and improve the
capability to manage the large distributed data required for big data analysis.
Application of Virtualization for Big Data
Big data processor and memory virtualization
Processor virtualization helps to optimize the processor and maximize
performance. Memory virtualization decouples memory from the servers.
In big data analysis, you may have repeated queries of large data sets and the
creation of advanced analytic algorithms, all designed to look for patterns and
trends that are not yet understood.
These advanced analytics can require lots of processing power (CPU) and memory
(RAM).
For some of these computations, it can take a long time without sufficient CPU
and memory resources.
Application of Virtualization for Big Data
Big data and storage virtualization
Data virtualization can be used to create a platform for dynamic linked data
services. This allows data to be easily searched and linked through a unified
reference source.
As a result, data virtualization provides an abstract service that delivers data in a
consistent form regardless of the underlying physical database.
In addition, data virtualization exposes cached data to all applications to improve
performance.
Storage virtualization combines physical storage resources so that they are more
effectively shared. This reduces the cost of storage and makes it easier to manage
data stores required for big data analysis.
Cloud Computing and Big Data
One of the vital issues that organizations face with the storage and management of
Big Data is the huge amount of investment to get the required hardware setup and
software packages.
Elasticity
Resource Pooling
Self-Service
Low Cost
Fault Tolerance
Cloud Computing and Big Data
Community Cloud
Hybrid Cloud
Cloud Computing and Big Data
• Public Cloud (End-User Level Cloud): A cloud that is owned and managed by a
company than the one (which can be either an individual user or a company) using it
is known as a public cloud.
• Private Cloud (Enterprise Level Cloud): The cloud that remains entirely in the
ownership of the organization using it is known as a private cloud.
Cloud Computing and Big Data
• Community Cloud: Community cloud is a type of cloud that is shared among various
organizations with a common tie.
• Hybrid Cloud: The cloud environment in which various internal or external service
providers offer services to many organizations is known as a hybrid cloud.
Cloud Computing and Big Data
Big Data cloud providers have been gearing up to bring the most advanced technologies at
competitive prices in the market.
Some providers are established, whereas some of them are relatively new to the field of
cloud services. Some of these providers are rendering services that are relevant to Big Data
analytics only. Some such providers are as follows:
Amazon
Google
Windows Azure
SaaS, PaaS and Iaas applications
Benefits of Big data analysis in Cloud.
Improved analysis
With the advancement of Cloud technology, big data analysis has become more
improved causing better results. Hence, companies prefer to perform big data
analysis in the Cloud. Moreover, Cloud helps to integrate data from numerous
sources.
Benefits of Big data analysis in Cloud.
Simplified Infrastructure
Big Data analysis is a tremendous strenuous job on infrastructure as the data comes
in large volumes with varying speeds, and types which traditional infrastructures
usually cannot keep up with.
capacity
scalability
security
privacy
Big Data & Virtualization: Concept familiarization and relation between them
accelerate the insight of Big Data and Virtualization with a laconic concept and
significant overview, https://www.ijedr.org/papers/IJEDR1803107.pdf