0% found this document useful (0 votes)

104 views23 pages

h14344 Emc Scaleio Basic Architecture

Uploaded by

srikanthreddymadhira

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

104 views23 pages

h14344 Emc Scaleio Basic Architecture

Uploaded by

srikanthreddymadhira

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

EMC SCALEIO BASIC ARCHITECTURE

ABSTRACT

This document describes the concepts, basic architecture and components

of a ScaleIO system.

June 2015

EMC WHITE
PAPER
To learn more about how EMC products, services, and solutions can help solve your business and IT challenges,
contact your local representative or authorized reseller, visit www.emc.com, or explore and compare products in the
EMC Store

Copyright © 2014 EMC Corporation. All Rights Reserved.

EMC believes the information in this publication is accurate as of its publication date. The information is subject to
change without notice.
The information in this publication is provided “as is.” EMC Corporation makes no representations or warranties of
any kind with respect to the information in this publication, and specifically disclaims implied warranties of
merchantability or fitness for a particular purpose.
Use, copying, and distribution of any EMC software described in this publication requires an applicable software
license.
For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com.
VMware and <insert other VMware marks in alphabetical order; remove sentence if no VMware marks needed.
Remove highlight and brackets> are registered trademarks or trademarks of VMware, Inc. in the United States
and/or other jurisdictions. All other trademarks used herein are the property of their respective owners.
Part Number H14344

2
EXECUTIVE SUMMARY .............................................................................. 5
AUDIENCE ................................................................................................ 5
TERMINOLOGY TABLE ............................................................................... 5
SCALEIO SYSTEM ARCHITECTURE BASICS ................................................ 6
ScaleIO Data Client – SDC .................................................................................. 6
ScaleIO Data Server – SDS ................................................................................. 7

SCALEIO CONFIGURATIONS ..................................................................... 8

Converged Configuration or Converged Infrastructure (CI) ...................................... 8
Two-layer Configuration ...................................................................................... 8
Mixed Configuration............................................................................................ 9

META DATA MANAGER – MDM ................................................................. 10

Distributed data layout scheme .......................................................................... 10
Rebuilds .......................................................................................................... 11
Rebalance ....................................................................................................... 11

IO FLOW ................................................................................................. 12
CACHE .................................................................................................... 13
XtremCache .................................................................................................... 14
Write buffering ................................................................................................ 15

IO TYPES ................................................................................................ 15
Read Hits (RH)................................................................................................. 15
Read Misses (RM) ............................................................................................. 15
Writes............................................................................................................. 16

3
PROTECTION DOMAINS .......................................................................... 16
STORAGE POOLS ..................................................................................... 17
FAULT SET .............................................................................................. 17
SNAPSHOTS ............................................................................................ 18
QUALITY OF SERVICE (QOS)................................................................... 19
THROTTLING .......................................................................................... 19
SCALEIO MANAGEMENT .......................................................................... 20
CONCLUSIONS ........................................................................................ 22
REFERENCES ........................................................................................... 23

4
EXECUTIVE SUMMARY
This document is designed to help users understand the basic concepts and the architecture of ScaleIO.

EMC ScaleIO® is software that creates a server-based SAN from local application server storage (local or network
storage devices). ScaleIO delivers flexible, scalable performance and capacity on demand. ScaleIO integrates
storage and compute resources, scaling to thousands of servers (also called nodes). As an alternative to traditional
SAN infrastructures, ScaleIO combines hard disk drives (HDD), solid state disk (SSD), and Peripheral Component
Interconnect Express (PCIe) flash cards to create a virtual pool of block storage with varying performance tiers.

As opposed to traditional Fibre Channel SANs, ScaleIO has no requirement for a Fibre Channel fabric between the
servers and the storage. This further reduces the cost and complexity of the solution. In addition, ScaleIO is
hardware-agnostic and supports either physical or virtual application servers. It creates a software-defined storage
environment that allows users to exploit the unused local storage capacity in any server. ScaleIO provides a
scalable, high performance, fault tolerant distributed shared storage system.

AUDIENCE
This white paper is intended for customers, partners and employees interested in understanding the concepts,
basic architecture and components of a ScaleIO system.

TERMINOLOGY TABLE
Term Definition
HDD Hard disk drives. Traditional magnetic devices that store
digitally encoded data.
SSD Solid state disk that has no moving parts and uses flash
memory to store data persistently.
PCIe Peripheral Component Interconnect Express is a high-
speed serial computer expansion bus.
SDS ScaleIO Data Server. Contributes local storage space to an
aggregated pool of storage within the ScaleIO virtual SAN.
SDC ScaleIO Data Client. A lightweight device driver that
exposes ScaleIO shared block volumes to applications.
MDM ScaleIO Meta Data Manager. Manages, configures and
monitors the ScaleIO system.
Hyperconverged A converged configuration or converged infrastructure (CI)
where the application runs in the same layer as the storage
and compute.
LAN Local Area Network providing interconnectivity within a
limited (local) area. ScaleIO supports all network speeds
including 100Mb, 1Gb, 10Gb, 40Gb and InfiniBand (IB)
OpenStack Is a free, open source cloud computing software platform.
CTQ Command Tag Queueing: reordering the IO to optimize the
drive seek and improve the IOPs of the drive
5
Term Definition
RTO Recovery time objective. The time for a system/application
to be restored, e.g. from back-up, after a failure or
disruption.
DRAM Dynamic RAM. This is a type of random access memory
used in servers for caching, etc.
Protection Domain A logical container for SDSs. Each SDS can belong to only
one Protection Domain.
Storage Pool A logical cross-SDS group of drives within a single
Protection Domain. Usually used to map drives that share
common characteristics, e.g. a SAS pool, an SSD pool etc.
IOC Basic IO controller. It passes through the IO without any
additional features or function.
ROC RAID-on-Chip. Adds additional features such as buffering
Writes, RAID calculations, etc.

SCALEIO SYSTEM ARCHITECTURE BASICS

ScaleIO systems contain a number of elements including the SDC, SDS and MDM and are discussed in detail in the
following sections.

ScaleIO Data Client – SDC

The SDC is a lightweight block device driver that exposes ScaleIO shared block volumes to applications. The SDC
runs on the same server as the application. This enables the application to issue an IO request and the SDC fulfills it
regardless of where the particular blocks physically reside. The SDC communicates with other nodes (beyond its
own local server) over TCP/IP-based protocol, so it is fully routable.
Note: ScaleIO supports all network speeds including 100Mb, 1Gb, 10Gb, 40Gb and InfiniBand (IB).

6
Figure 1 - ScaleIO SDC

Users may modify the default ScaleIO configuration parameter to allow two SDCs to access the same data. This
feature provides supportability of applications like Oracle RAC.

ScaleIO Data Server – SDS

The SDS owns local storage that contributes to the ScaleIO Storage Pools. An instance of the SDS runs on every
server that contributes some or all of its local storage space (HDDs, SSDs, or PCIe flash cards) to the aggregated
pool of storage within the ScaleIO virtual SAN. Local storage may be disks, disk partitions, even files. The role of the
SDS is to actually perform the Back-End IO operations as requested by an SDC.

Figure 2 - ScaleIO Data Server

7
SCALEIO CONFIGURATIONS
There are three standard configurations for ScaleIO implementations, all providing flexibility and scalability. They
are as follows and are discussed in the following section.

 Converged Infrastructure (CI) or fully converged configuration (sometimes referred to as Hyperconverged)

 Two-layer configuration

 Mixed configuration

Converged Configuration or Converged Infrastructure (CI)

A converged configuration/infrastructure (referred to as Hyperconverged) involves the installation of both the SDC
and the SDS on the same server. This is considered the best practice configuration as well. In this configuration,
the applications and storage share the same compute resources and in many ways the storage is like another
application running on the server.

The applications perform IO operations via the local SDC. All servers contribute some or all of their local storage to
the ScaleIO system via their local SDS. Components communicate over the Local Area Network (LAN).

Figure 3 - Converged Configuration

Two-layer Configuration
There is no ScaleIO requirement to implement a Converged configuration, as shown above, where the SDCs and
SDSs reside on the same servers.

In certain situations, customers prefer to have the SDS separated from an SDC and installed on a different server.
This type of configuration is called a two-layer configuration where the SDCs are configured on one group of servers,
and the SDSs are configured on another distinct group of servers, shown in Figure 4.

The applications that run on the first group of servers issue the IO requests to their local SDC. The second group,
running SDSs, contributes the servers’ local storage in the virtual SAN. Both groups communicate over a local area
network. Applications run in one layer, while storage resides in another layer.

This deployment is similar to a traditional external storage system such as VNX and VMAX, but without the Fibre
Channel layer.

8
Figure 4 - Two-layer Configuration

Mixed Configuration
ScaleIO is very flexible and allows any combination of the two configurations. When a two-layer and converged
configuration exist, this is called a mixed configuration. ScaleIO has no restriction on when configuration changes
can be made. This configuration is common in a transient case when moving from a two-layered configuration to a
converged configuration.

When a new group of servers are added as SDS servers, ScaleIO will automatically rearrange, optimize and
rebalance the data in the background without any downtime. ScaleIO deployments can be changed, or grown
quickly - with ease and supporting hundreds of nodes.

Figure 5 – Mixed configuration consisting of Two-layer and CI

9
META DATA MANAGER – MDM
The Meta Data Manager manages the ScaleIO system. The MDM contains all the metadata required for system
operation; such as configuration changes. The MDM also allows monitoring capabilities to assist users with most
system management tasks.

The MDM manages the meta data, SDC, SDS, devices mapping, volumes, snapshots, system capacity including
device allocations and/or release of capacity, RAID protection, errors and failures, and system rebuild tasks
including rebalancing. In addition, the MDM responds to all user commands or queries. In a normal IO flow, the
MDM is not part of the data path and user data does not pass through the MDM. Therefore, the MDM is never a
performance bottleneck for IO operations.

The MDM uses an Active/Passive methodology with a tie breaker component where the primary node is Active, and
the secondary is passive. The data repository is stored in both Active and Passive.

Currently, an MDM can manage up to 1024 servers. When several MDMs are present, an SDC may be managed by
several MDMs, whereas, an SDS can only belong to one MDM.

The MDM is extremely lightweight and has an asynchronous (or lazy) interaction with the SDCs and SDSs. The MDM
daemon produces a heartbeat where updates are performed every few seconds. If the MDM does not detect the
heartbeat from an SDS it will initiate a forward-rebuild.

All ScaleIO commands are asynchronous with one exception. For consistency reasons, the unmap command is
synchronous where the user must wait for the completion before continuing.

Each SDC holds mapping information that is light-weight and efficient so it can be stored in real memory. For every 8
PB of storage, the SDC requires roughly 2 MB RAM. Mapping information may change without the client being
notified, this is the nature of a lazy or loosely-coupled approach.

Distributed data layout scheme

ScaleIO’s distributed data layout scheme is designed to maximize protection and optimize performance. A single
volume is divided into 1 MB chunks. These chunks will be distributed (striped) on physical disks throughout the
cluster, in a balanced and random manner. Each chunk has a total of two copies for redundancy. It is important to
understand that the ScaleIO volume chunks are not the same as data blocks. The IO operations are done at a block
level. If an application writes out 4 KB of data, only 4 KB are written, not 1 MB. The same goes for read operations—
only the required data is read.
Note: Two copies are never stored on the same physical server. They are “meshed” throughout the
cluster.

10
Figure 6 - ScaleIO volume layout

Rebuilds
ScaleIO systems automatically rebuild a failed drive or failed server. For example if SDS1 crashes, ScaleIO will
rebuild its 1 MB chunks by copying from its mirrors. This process is called a forward rebuild. It is a many-to-many
copy operation, which is what makes the rebuild such a quick operation

Upon completion of the forward rebuild operation, the system is fully protected and optimized. Better still, while
this operation is in progress, all of the data is accessible to applications so that users experience no outage or
disruption in service.

The backward rebuild option is used when a node goes down for only a short period of time. This option is managed
by the MDM that determines whether updating the mirrored volumes will be faster than rebuilding the data on the
downed server The MDM collects all tracked changes from all SDSs, and is therefore equipped to make the best
decision on forward or backward rebuild methods.

ScaleIO always reserves space on servers in the case of an unplanned outage, when rebuilds are going to require
unused disk space. To ensure data protection during server failures, ScaleIO reserves 10% of the capacity by
default, not allowing it to be used for volume allocation.

To ensure full system protection during the event of a node failure, users must ensure that the spare capacity is at
least equal to the amount of capacity in the node containing the maximum capacity, or the maximum Fault Set
capacity. If all nodes contain equal capacity, it is recommended to set the spare capacity value to at least 1/N of the
total capacity (where N is the number of SDS nodes).

Rebalance
One of ScaleIO’s greatest benefits is its elasticity. Adding or removing devices, and/or servers to a ScaleIO
configuration triggers an automatic migration and rebalance of the remaining devices.
11
During this process, the data is migrated and then rebalanced across the servers. The rebalance process simplifies
ScaleIO management, eliminating long refresh cycles, and making the environment more dynamic and flexible. If
more compute power or storage is required, simply add more devices or more servers.

Figure 7 - Rebalance after adding 3 devices or nodes

Figure 8 - Rebalance after removing 3 devices or nodes

IO FLOW
IOs from the application are serviced by the SDC that runs on the same server as the application. The SDC fulfills the
IO request regardless of where any particular block physically resides.

When the IO is a Write, the SDC sends the IO to the SDS where the Primary copy is located. The Primary SDS will
send the IO to the local drive and in parallel, another IO is sent to the secondary mirror. Only after an
acknowledgment is received from the secondary SDS, the primary SDS will acknowledge the IO to the SDC.

A Read IO from the application will trigger the SDC to issue the IO to the SDS with the Primary chunk.

In terms of resources consumed, one host Write IO will generate two IOs over both the network and back-end drives.
A read will generate one network IO and one back-end IO to the drives. For example, if the application is issuing an
8 KB Write, the network and drives will get 2x8 KB IOs. For an 8 KB Read, there will be only one 8 KB IO on the
network and drives.

Note: The IO flow does not require any MDM or any other central management point. For this reason, ScaleIO is able
to scale linearly in terms of performance.

Every SDC knows how to direct an IO operation to the destination SDS. Because ScaleIO volume chunks are evenly
distributed between drive and nodes, the workload will always be sufficiently balanced. There is no flooding or
broadcasting. This is extremely efficient parallelism that eliminates single points of failure. Since there is no central
12
point of routing, all of this happens in a distributed manner. The SDC has all the intelligence needed to route every
request, preventing unnecessary network traffic and redundant SDS resource usage.

CACHE
Cache is a critical aspect of storage performance. At present ScaleIO uses RAM cache for Read Hits. ScaleIO cache
keeps recently-accessed data readily available. IOs read from cache have a lower response time than IOs serviced
by the drives, including Flash drives.

Another benefit of caching IO is that it reduces the drive workload which in many cases is a performance bottleneck
in the system.

Cache in ScaleIO is managed by the SDS. It is a simple and clean implementation that does not require cache
coherency management, which would have been required if cache were managed by the SDC.

Figure 9 - Cache managed by the SDS

Read cache characteristics:

• No caching of rebuild and rebalance IOs (because there is no chance these IOs will be reused)
• Writes are buffered for Read after Write IOs
o Default = ON
• Unaligned (to 4KB boundary ) writes are not buffered
• The cache page size is 4 KB
o If IO is smaller than 4 KB or not aligned to 4 KB, the back-end IO will be aligned to 4 KB. This is a
type of pre-fetch mechanism.
• Cache size (per SDS)
o Default = 128 MB
o Cache can be resized dynamically and disabled on the fly.
o Max Cache Size = 128 GB
• Cache can be defined for each Storage Pool
• Cache can be defined per Volume
• Max IO size cached: 128 KB
As always with cache devices, it takes time for the cache to warm up and reach its maximum potential. Figure 10
shows a test using 8KB random reads and a 46GB address space. The cache will warm up with time. When the
cache is completely warm, IOPs reach 250K and remain consistent throughout the test.

13
Figure 10 - Cache warming up

Cache is managed by two main data structures:

• User Data (UD): This is a copy of the disk data residing in Cache
– pre-allocated in one continuous buffer
– divided into 128 KB blocks
– managed using an efficient Least Recently Used (LRU) algorithm
• Meta Data (MD): Contains pointers to addresses in the UD
– MD uses Hash with two inputs (keys): physical LBA, Device number
–

Figure 11 - Cache Meta data pointers and User Data structures

Note: Both the MD and UD use Least Recently Used (LRU) algorithms to make sure “old” data is
evicted from cache first.

XtremCache
ScaleIO is equipped to use another caching option; XtremCache (formerly named XtremSW). This is a software layer
located under the SDS. XtremCache allows any Flash drive type at any size to be used as additional system Read
14
cache (writes are buffered only for Reads after Writes). When compared to Raid controller caching solutions,
XtremCache also allows the use of host PCI Flash cards/drives which can have an order of magnitude better
performance than enterprise level regular SAS solid state disk (SSD).

Write caching is achieved by the Raid controller as explained in the next section.

Write buffering
Writes are only buffered in the host memory for Read after Write caching. One way to achieve Write buffering is to
use Raid controllers (e.g. LSI, PMC etc.) that have battery backup for write buffering. It is important that the DRAM
buffer be protected against sudden power outages to avoid any data loss.

Raid controllers also have an option to increase their cache capability to Flash drives configured in the systems (e.g.
LSI CacheCade and PMC MaxCache). This allows increasing cache from 1-4 GB of DRAM to 512 GB. Up to 2 TB of the
Flash cache is managed by the Raid Controller. This cache is used for both Reads and Writes.

The main effect of having write buffering relates to Write Response Time which is much lower when the IO is
acknowledged from DRAM/Flash, rather than a HDD drive.

An added benefit of buffering the writes in a Raid controller is that it utilizes the “elevator reordering”. This is
sometimes referred to as Command Tag Queuing (CTQ). Elevator reordering increases the max IOPs of a HDD drive,
and even reduces the drive’s load because of rewrites to the same address locations.
Note: Apart from the re-write effect and CTQ, write buffering does not affect sustained random
writes maximum throughputs.

For Flash only configurations, it is usually recommended to use a pass-through IOC instead of a ROC (Raid-On-Chip)
controller since writes are acknowledged from the Flash drives regardless.

IO TYPES
There are three types of IO operations in a ScaleIO system:

 Read Hit

 Read Miss

 Write

Each IO type and size behaves differently since they exercise different components inside the ScaleIO system.

Read Hits (RH)

A Read Hit is a read to the ScaleIO system (SDS) where it finds the requested data already in the server Read Cache
space. Therefore, RHs run at memory speeds, not disk speeds, and there are no disk operations required.

Read Misses (RM)

A Read Miss is a read to the ScaleIO system when requested data is not in cache and must be retrieved from
physical disks (HDD or SSD). Reads that are serviced by the Raid controller‘s cache are still considered a Read Miss
from ScaleIO’s management point of view.

It’s important to consider that sequential reads are not counted separately. If any IO is serviced from the host read
cache, those IOs are counted as Read Hits. Any other IO is counted as Read Misses.
15
Note: There is minimal pre-fetch as part of the ScaleIO cache code. For example, a sequential read
of 512B will bring 4 KB into cache. A best practice recommendation is to use the Read-ahead
feature in the Cache controller only for HDD drives. This allows pre-fetching IOs to increase the
performance when using HDD drives. With Flash drives, this feature is not necessary and not
recommended.

Writes
A Write is a Write IO operation to the ScaleIO system. Apart from the write buffering cases described in the above
section, there is little difference between the various write types, e.g. Sequential and Random.

PROTECTION DOMAINS

A Protection Domain is a set of SDSs. Each SDS belongs to one (and only one) Protection Domain. Thus, by
definition, each Protection Domain is a unique set of SDSs.

The ScaleIO Data Client (SDC) is not part of the Protection Domain. An SDC residing on the same server as an SDS
that belongs to Protection Domain X can also access data in Protection Domain Y.

Using a Protection Domain has the following benefits:

– Reduces the impact of simultaneous multiple failures in large clusters

– Performance isolation when needed

– Data location control (e.g., multi tenancy)

– Helps to fit network constraints

The recommended number of nodes in a protection domain is a 100. Users can add Protection Domains during
installation. In addition, Protection Domains can be modified, post-installation, with all the management clients
(except for OpenStack).

Figure 12 - Protection Domains protecting a set of ScaleIO Data Servers

16
STORAGE POOLS
Storage Pools allow the generation of different performance tiers in the ScaleIO system. A Storage Pool is a set of
physical storage devices in a Protection Domain. Each storage device belongs to one (and only one) Storage Pool.
When a Protection Domain is generated, it has one Storage Pool by default.

Storage Pools are mostly used to group drives based on drive types and drive speeds, e.g. SSD and HDD.

Figure 13 - Different Storage Pools within a Protection Domain

FAULT SET
In many cases, data centers are designed such that a unit of failure may consist of more than a single node. An
example use case is where a rack contains several SDSs and the customer wants to protect the environment from a
situation where the whole rack fails, or is lost by a power outage or some disaster.

The fault set will limit mirrored chunks from being in the same fault set. A minimum of 3 fault sets is required per
protection domain. Deploying Fault Sets will prevent both copies of data from being written to SDS’s in the same
fault set. This ensures that one copy of the data is available in the event that an entire fault set fails.

17
Figure 14 - Fault set data distribution

Figure 15 - Example Configuration; Fault Sets, Storage Pools and Protection Domain

SNAPSHOTS
The ScaleIO storage system enables users to take snapshots of existing volumes, up to 31 per volume. The
snapshots are thinly provisioned and are extremely quick. Once a snapshot is generated, it becomes a new
unmapped volume in the system. Users manipulate snapshots in the same manner as any other volume exposed to
the ScaleIO storage system.

18
Figure 16 - Snapshot operations

This structure in Figure 16 relates to all the snapshots resulting from one volume, and is referred to as a VTree (or
Volume Tree). It’s a tree spanning from the source volume as the root, whose siblings are either snapshots of the
volume itself or descendants of it.

Each volume has a construct called a vTree which holds the volume and all snapshots associated with it.

The limit on a vTree is 32 members – so 1 is taken by the original volume and the rest (31) are available for
snapshots.

QUALITY OF SERVICE (QOS)

Users can adjust the amount of IOPS and bandwidth that one SDC can generate for a volume. These parameters are
configured using the ScaleIO CLI (Command Line Interface), and the REST interface, on a per client/per volume
basis.

THROTTLING
ScaleIO allows users to change (or throttle) certain parameters in order to set higher priorities for some operations
over others. The most common use case is to slow down a rebuild/rebalance operation which can help reduce the
impact on host IOs.

Network throttling parameters:

– Limits ScaleIO network usage

– Is defined per Protection Domain

o Affecting all SDSs

– Limits total network bandwidth

o Separately controlling

 Rebalance

19
 Rebuild

 Total (including IO)

Rebalance/Rebuild throttling parameters: Setting these parameters allows users to set the rebalance/rebuild I/O
priority policy for a Storage Pool. It determines the priority policy that will be imposed to favor application I/O over
rebalance/rebuild I/O.

There are four possible priority policies that may be applied:

No Limit: No limit on rebalance/rebuild I/Os. This option will help complete the rebuild/rebalance ASAP, but
may have an impact of the host applications.

Limit Concurrent I/O: Limit rebalance/rebuild number of concurrent I/Os per SDS device.

Favor Application I/O: Limit rebalance/rebuild in both bandwidth and concurrent I/Os.

Dynamic Bandwidth Throttling: Limit rebalance/rebuild bandwidth and concurrent I/Os according to device
I/O thresholds. This option helps to increase the rebalance/rebuild rate when the host application workload
is low.

SCALEIO MANAGEMENT
Users manage ScaleIO in various ways including the CLI, the REST API, the vSphere plug-in for ESX, and the ScaleIO
GUI. Other tools including ViPR-C and ViPR SRM are integrated and capable of managing a ScaleIO system.

The ScaleIO command line interface or, scli, allows users to log into a ScaleIO system to create, manage and
monitor various system components including protection domains, the MDM, SDS, SDC, storage pools, volumes
and more.

The scli “—help” command provides information on syntax and usage for all ScaleIO commands.

The REST API for ScaleIO is serviced from the ScaleIO Gateway (which includes the REST gateway).

The ScaleIO Gateway connects to a single MDM and services requests by querying the MDM, and reformatting the
answers it receives from the MDM in a RESTful manner, back to a REST client. Every ScaleIO scli command is also
available in the ScaleIO REST API. Responses returned by the Gateway are formatted in JSON format. The API is
available as part of the ScaleIO Gateway package. If the ScaleIO Installation Manager was used to install ScaleIO,
the Gateway has already been installed and configured with the MDM details.

VMware provides a plug-in that allows users to view and provision ScaleIO components. The plug-in communicates
with the MDM and the vSphere server enabling users to view components and perform many
configuration/provisioning tasks right from within the VMware environment.

To use the plug-in, it must be registered in your vCenter. For more information, refer to the ScaleIO Installation
Guide at https://support.emc.com/docu59356_ScaleIO-Installation-Guide-1.32.pdf.

20
An EMC ScaleIO GUI is available for Windows, Linux and vSphere. The GUI allows installation, monitoring and
management of a ScaleIO system. Figure 17 displays the GUI dashboard providing a complete overview of the
current system state.

Figure 17 - ScaleIO GUI system overview

For more detailed information on the management tools available for ScaleIO, refer to the ScaleIO User Guide on the
ScaleIO Product Page at https://community.emc.com/docs/DOC-45035.

21
CONCLUSIONS
ScaleIO is software-defined storage that delivers a full suite of storage services and uses commodity hardware built
on off-the-shelf components and products. ScaleIO has no vendor specific hardware dependencies, is able to run
on any commodity server, supported on nearly any operating system and/or hypervisor, and can leverage existing
and future datacenter infrastructures.

Leading use case scenarios include cloud-based platforms built on ScaleIO to provide consumer and enterprise
applications to support banking, billing and much more. Managed Service Providers have implemented ScaleIO in
order to eliminate vendor lock-in, and grow with a solution that allows the use of any flash or HDD for storage, and
with any kind of server.

In short, ScaleIO simplifies data center operations making it flexible and efficient.

For more information on ScaleIO, refer to https://community.emc.com/docs/DOC-45035.

22
REFERENCES
Probability, Statistics, and Queueing Theory, 2nd Edition, Arnold O. Allen, Academic Press, 1990.

EMC ScaleIO User Guide on the ScaleIO Product Page at

https://support.emc.com/search/?text=scaleio&product_id=33925&adv=y

RC msn4
No ratings yet
RC msn4
151 pages
High Performance Multi-Cloud Object Storage: White Paper
No ratings yet
High Performance Multi-Cloud Object Storage: White Paper
21 pages
Scaleio 301 Technical Deep Dive Selected
No ratings yet
Scaleio 301 Technical Deep Dive Selected
15 pages
ScaleIO Fundamentals MR 1WN SIOFUN - Student Guide
No ratings yet
ScaleIO Fundamentals MR 1WN SIOFUN - Student Guide
74 pages
ScaleIO Install
No ratings yet
ScaleIO Install
10 pages
Before You Begin: Deploying Scaleio V1.32 On Linux Servers Quick Start Guide
No ratings yet
Before You Begin: Deploying Scaleio V1.32 On Linux Servers Quick Start Guide
3 pages
1.1-ScaleIO v2.0.x Quick Start Guide - Linux
No ratings yet
1.1-ScaleIO v2.0.x Quick Start Guide - Linux
25 pages
CPG MinIO Implementation Guide
No ratings yet
CPG MinIO Implementation Guide
14 pages
BDA Lab Manual R22
0% (1)
BDA Lab Manual R22
70 pages
Information Technology British English Teacher B2 C1
No ratings yet
Information Technology British English Teacher B2 C1
13 pages
Plastic University MCQ Merged
No ratings yet
Plastic University MCQ Merged
13 pages
Present Perfect Continuous
100% (1)
Present Perfect Continuous
22 pages
Intermittent Fasting
100% (1)
Intermittent Fasting
36 pages
EMC ScaleIO Best Practices - VMware
No ratings yet
EMC ScaleIO Best Practices - VMware
29 pages
EMC ScaleIO Performance Reports
No ratings yet
EMC ScaleIO Performance Reports
18 pages
EMC ScaleIO Best Practices - Performance
No ratings yet
EMC ScaleIO Best Practices - Performance
16 pages
Onga'nya 24
No ratings yet
Onga'nya 24
23 pages
ARDUINO SOLAR CHARGE CONTROLLER Version 30
No ratings yet
ARDUINO SOLAR CHARGE CONTROLLER Version 30
79 pages
A Review On Lifting Beams: July 2017
No ratings yet
A Review On Lifting Beams: July 2017
14 pages
Egyptian Heaven and Hell Volume II
No ratings yet
Egyptian Heaven and Hell Volume II
314 pages
AI-Powered Course Recommendation System
No ratings yet
AI-Powered Course Recommendation System
11 pages
YLSTD30-40K01小功率直流充电桩用户手册User Manua V1 - (EN&CN) ) 已校对
No ratings yet
YLSTD30-40K01小功率直流充电桩用户手册User Manua V1 - (EN&CN) ) 已校对
17 pages
(Ebooks PDF) Download Charting Spiritual Care The Emerging Role of Chaplaincy Records in Global Health Care Simon Peng-Keller Full Chapters
100% (1)
(Ebooks PDF) Download Charting Spiritual Care The Emerging Role of Chaplaincy Records in Global Health Care Simon Peng-Keller Full Chapters
53 pages
Powerplant Exercises
No ratings yet
Powerplant Exercises
3 pages
Cardiosync Corporate Business Plan
No ratings yet
Cardiosync Corporate Business Plan
7 pages
Lecture 8 - Transport Layer
No ratings yet
Lecture 8 - Transport Layer
50 pages
NOTES CH 9 Living Organisms G6 2
No ratings yet
NOTES CH 9 Living Organisms G6 2
5 pages
Computer (Eng) SSC CHSL 2024 All 70 Questions (RBE)
No ratings yet
Computer (Eng) SSC CHSL 2024 All 70 Questions (RBE)
8 pages
20 Questions 35 Minutes
No ratings yet
20 Questions 35 Minutes
7 pages
KP Technical Seminal Final Report FINAL
No ratings yet
KP Technical Seminal Final Report FINAL
30 pages
Lab Report Liquid Flow
No ratings yet
Lab Report Liquid Flow
17 pages
Modeling Class X AI
No ratings yet
Modeling Class X AI
24 pages
Edible Oil Industry 1 PDF
No ratings yet
Edible Oil Industry 1 PDF
45 pages
Daftar Harga Barang Toko GMC Mojokerto Jl. Gajah Mada No. 42 Tlp. 0321-7229919 Mojokerto
No ratings yet
Daftar Harga Barang Toko GMC Mojokerto Jl. Gajah Mada No. 42 Tlp. 0321-7229919 Mojokerto
6 pages
Rubric Ict
No ratings yet
Rubric Ict
1 page
D-155 - 3 Cylinder Diesel Engine (01/75 - 12/85) 00 - Complete Machine 04-02 - Piston and Cylinder Sleeve
No ratings yet
D-155 - 3 Cylinder Diesel Engine (01/75 - 12/85) 00 - Complete Machine 04-02 - Piston and Cylinder Sleeve
4 pages
HDBS Parameters
No ratings yet
HDBS Parameters
1 page
KT Remote G PowerRemote en
No ratings yet
KT Remote G PowerRemote en
2 pages
Diversity of Life Practice Final Exam
No ratings yet
Diversity of Life Practice Final Exam
4 pages
Activity 1 BRS NSC Mar 2017 Cheques Out)
No ratings yet
Activity 1 BRS NSC Mar 2017 Cheques Out)
1 page
Storage Area Networks For Dummies
From Everand
Storage Area Networks For Dummies
Christopher Poelker
3.5/5 (2)
OpenStack Object Storage (Swift) Essentials
From Everand
OpenStack Object Storage (Swift) Essentials
Amar Kapadia
No ratings yet
Oracle Recovery Appliance Handbook: An Insider’S Insight
From Everand
Oracle Recovery Appliance Handbook: An Insider’S Insight
Ramesh Raghav
No ratings yet
Storage Optimization with Unity All-Flash Array: Learn to Protect, Replicate or Migrate your data across Dell EMC Unity Storage and UnityVSA
From Everand
Storage Optimization with Unity All-Flash Array: Learn to Protect, Replicate or Migrate your data across Dell EMC Unity Storage and UnityVSA
Victor Wu
5/5 (1)
Micropolis Data Storage Primer: Micropolis Handbooks
From Everand
Micropolis Data Storage Primer: Micropolis Handbooks
Micropolis Handbooks
No ratings yet
Mastering Terraform A Comprehensive Guide to Infrastructure As Code
From Everand
Mastering Terraform A Comprehensive Guide to Infrastructure As Code
Mario Marinov
No ratings yet
Essays on Infrastructure-as-code
From Everand
Essays on Infrastructure-as-code
Ravi Rajamani
No ratings yet
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Mastering the Art of x86 Assembly Programming: Unlocking the Secrets of Expert-Level Skills
From Everand
Mastering the Art of x86 Assembly Programming: Unlocking the Secrets of Expert-Level Skills
Steve Jones
No ratings yet
TinyGo for Embedded Systems and WebAssembly: The Complete Guide for Developers and Engineers
From Everand
TinyGo for Embedded Systems and WebAssembly: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Kubernetes: Build and Deploy Modern Applications in a Scalable Infrastructure. The Complete Guide to the Most Modern Scalable Software Infrastructure.: Docker & Kubernetes, #2
From Everand
Kubernetes: Build and Deploy Modern Applications in a Scalable Infrastructure. The Complete Guide to the Most Modern Scalable Software Infrastructure.: Docker & Kubernetes, #2
Jordan Lioy
No ratings yet
Professional Heroku Programming
From Everand
Professional Heroku Programming
Chris Kemp
4/5 (2)
Embedded Systems Programming with C: Writing Code for Microcontrollers
From Everand
Embedded Systems Programming with C: Writing Code for Microcontrollers
Larry Jones
No ratings yet
Terraform for Developers, Second Edition
From Everand
Terraform for Developers, Second Edition
Kimiko Lee
No ratings yet
Terraform for Developers, Second Edition: Essentials of Infrastructure Automation and Provisioning
From Everand
Terraform for Developers, Second Edition: Essentials of Infrastructure Automation and Provisioning
Kimiko Lee
No ratings yet
Vulkan ICD Architecture and Implementation: The Complete Guide for Developers and Engineers
From Everand
Vulkan ICD Architecture and Implementation: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Rust for Embedded Systems
From Everand
Rust for Embedded Systems
James Oakton
No ratings yet
JAVA PROGRAMMING FOR BEGINNERS: Master Java Fundamentals and Build Your Own Applications (2023 Crash Course)
From Everand
JAVA PROGRAMMING FOR BEGINNERS: Master Java Fundamentals and Build Your Own Applications (2023 Crash Course)
Theo Houle
No ratings yet
Software Containers: The Complete Guide to Virtualization Technology. Create, Use and Deploy Scalable Software with Docker and Kubernetes. Includes Docker and Kubernetes.
From Everand
Software Containers: The Complete Guide to Virtualization Technology. Create, Use and Deploy Scalable Software with Docker and Kubernetes. Includes Docker and Kubernetes.
Jordan Lioy
No ratings yet
ESP8266 Programming and Applications: Definitive Reference for Developers and Engineers
From Everand
ESP8266 Programming and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Study Guide Automating and Programming Cisco Data Center Solutions 300-635 DCAUTO Exam
From Everand
Study Guide Automating and Programming Cisco Data Center Solutions 300-635 DCAUTO Exam
Anand Vemula
No ratings yet
TrueNAS Administration and Configuration: Definitive Reference for Developers and Engineers
From Everand
TrueNAS Administration and Configuration: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Cloud Infrastructure and Data Center
From Everand
Cloud Infrastructure and Data Center
Duong Tran
No ratings yet
Practical iSCSI Deployment and Management
From Everand
Practical iSCSI Deployment and Management
Richard Johnson
No ratings yet
ROCm Deep Dive: Definitive Reference for Developers and Engineers
From Everand
ROCm Deep Dive: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Information Technology HandBook
From Everand
Information Technology HandBook
Duong Tran
3/5 (1)
Tomcat Administration and Deployment: Definitive Reference for Developers and Engineers
From Everand
Tomcat Administration and Deployment: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
Aerospike Architecture and Implementation: Definitive Reference for Developers and Engineers
From Everand
Aerospike Architecture and Implementation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Developing Applications on EOSIO: Definitive Reference for Developers and Engineers
From Everand
Developing Applications on EOSIO: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Systemd-nspawn in Practice: Definitive Reference for Developers and Engineers
From Everand
Systemd-nspawn in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Technical Foundations of Emulation: Definitive Reference for Developers and Engineers
From Everand
Technical Foundations of Emulation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
VirtualBox Essentials: Definitive Reference for Developers and Engineers
From Everand
VirtualBox Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Deploying Scalable Systems with Nomad: Definitive Reference for Developers and Engineers
From Everand
Deploying Scalable Systems with Nomad: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
PlatformIO Development Essentials: Definitive Reference for Developers and Engineers
From Everand
PlatformIO Development Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Distributed Cluster Operations with DC/OS: Definitive Reference for Developers and Engineers
From Everand
Distributed Cluster Operations with DC/OS: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Embedded Systems Programming with C++: Real-World Techniques
From Everand
Embedded Systems Programming with C++: Real-World Techniques
Robert Johnson
No ratings yet
SteamOS Deployment and Configuration: Definitive Reference for Developers and Engineers
From Everand
SteamOS Deployment and Configuration: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Programming and Prototyping with Teensy Microcontrollers: Definitive Reference for Developers and Engineers
From Everand
Programming and Prototyping with Teensy Microcontrollers: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Cortex-A Architecture and System Design: Definitive Reference for Developers and Engineers
From Everand
Cortex-A Architecture and System Design: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Podman Essentials: Definitive Reference for Developers and Engineers
From Everand
Podman Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Comprehensive Guide to Mbed Development: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Mbed Development: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Pop!_OS System Administration Guide: Definitive Reference for Developers and Engineers
From Everand
Pop!_OS System Administration Guide: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Jetson Platform Development Guide: Definitive Reference for Developers and Engineers
From Everand
Jetson Platform Development Guide: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
BeagleBone Systems and Applications: Definitive Reference for Developers and Engineers
From Everand
BeagleBone Systems and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Design and Implementation with i.MX Processors: Definitive Reference for Developers and Engineers
From Everand
Design and Implementation with i.MX Processors: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Distributed Caching & Data Management: Mastering Redis, Memcached, And Apache Ignite Caching
From Everand
Distributed Caching & Data Management: Mastering Redis, Memcached, And Apache Ignite Caching
Rob Botwright
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

h14344 Emc Scaleio Basic Architecture

Uploaded by

h14344 Emc Scaleio Basic Architecture

Uploaded by

EMC SCALEIO BASIC ARCHITECTURE

This document describes the concepts, basic architecture and components

Copyright © 2014 EMC Corporation. All Rights Reserved.

SCALEIO CONFIGURATIONS ..................................................................... 8

META DATA MANAGER – MDM ................................................................. 10

SCALEIO SYSTEM ARCHITECTURE BASICS

ScaleIO Data Client – SDC

ScaleIO Data Server – SDS

Figure 2 - ScaleIO Data Server

 Converged Infrastructure (CI) or fully converged configuration (sometimes referred to as Hyperconverged)

Converged Configuration or Converged Infrastructure (CI)

Figure 3 - Converged Configuration

Figure 5 – Mixed configuration consisting of Two-layer and CI

Distributed data layout scheme

Figure 7 - Rebalance after adding 3 devices or nodes

Figure 8 - Rebalance after removing 3 devices or nodes

Figure 9 - Cache managed by the SDS

Read cache characteristics:

Cache is managed by two main data structures:

Figure 11 - Cache Meta data pointers and User Data structures

Read Hits (RH)

Read Misses (RM)

Using a Protection Domain has the following benefits:

– Reduces the impact of simultaneous multiple failures in large clusters

– Performance isolation when needed

– Data location control (e.g., multi tenancy)

– Helps to fit network constraints

Figure 12 - Protection Domains protecting a set of ScaleIO Data Servers

Figure 13 - Different Storage Pools within a Protection Domain

QUALITY OF SERVICE (QOS)

Network throttling parameters:

– Limits ScaleIO network usage

– Is defined per Protection Domain

o Affecting all SDSs

– Limits total network bandwidth

 Total (including IO)

There are four possible priority policies that may be applied:

Figure 17 - ScaleIO GUI system overview

For more information on ScaleIO, refer to https://community.emc.com/docs/DOC-45035.

EMC ScaleIO User Guide on the ScaleIO Product Page at

You might also like