0% found this document useful (0 votes)

15 views17 pages

Unit 5 Scaling

This document discusses scaling in cloud computing, focusing on resource provisioning, load balancing, and security. It outlines various scaling strategies, including proactive, reactive, and combinational scaling, as well as the concept of auto scaling and its implementation in cloud environments. The document emphasizes the importance of cloud elasticity, cost optimization, and the need for effective scaling techniques to manage variable traffic loads efficiently.

Uploaded by

yadavrajani3396

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views17 pages

Unit 5 Scaling

Uploaded by

yadavrajani3396

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

RESOURCE PROVISIONING,

LOAD BALANCING AND

SECURITY

UNIT 5 SCALING

Structure:-

5.1 Introduction
5.2 Objective
5.3 Scaling primitives
5.4 Scaling Strategies
5.4.1 Proactive Scaling
5.4.2 Reactive Scaling
5.4.3 Combinational Scaling
5.5 Auto Scaling in Cloud
5.6 Types of Scaling
5.6.1 Vertical Scaling or Scaling Up
5.6.2 Horizontal Scaling or Scaling Out

5.1 INTRODUCTION

The scalability in cloud computing refers to the flexibility of allocating IT

resources as per the demand. Various applications running on cloud instances
experience variable traffic loads and hence the need of scaling arises. The need
of such applications can be of different types such as CPU allocation, Memory
expansion, storage and networking requirements etc. To address these different
requirements, virtual machines are one of the best ways to achieve scaling.
Each of the virtual machines is equipped with a minimum set of configurations
for CPU, Memory and storage. As and when required, the machines can be
configured to meet the traffic load. This is achieved by reconfiguring the
virtual machine for better performance for the target load. Sometimes it is quite
difficult to manage such ondemand configurations by the persons, hence auto
scaling techniques plays a good role.

In this unit we will focus on the various methods and algorithms used in the
process of scaling. We will discuss various types of scaling, their usage and a
few examples. We will also discuss the importance of various techniques in
saving cost and man efforts by using the concepts of cloud scaling in highly
dynamic situations. The suitability of scaling techniques in different scenarios
is also discussed in detail.

For scaling, to understand elastic property of Cloud is important. I would

recommend to brief about the Cloud Elasticity here?

5.2 OBJECTIVES

1
SCALING
After going through this unit you should be able to:
➔ describe scaling and its advantage;

➔ understand the different scaling techniques;

➔ learn about the scaling up and down approaches;

➔ understand the basics of auto scaling

➔ compare among Proactive and Reactive scaling;

5.3 SCALING PRIMITIVES

The basic purpose of scaling is to enable one to use cloud computing

infrastructure as much as required by the application. Here, the cloud resources
are added or removed according to the current need of the applications. The
property to enhance or to reduce the resources in the cloud is referred to as
cloud elasticity, the process is known as scaling. Scaling exploits the elastic
property of the Cloud. The scalability of cloud architecture is achieved using
virtualization (see Unit 3: Resource Virtualization). Virtualization uses virtual
machines (VM’s) for enhancing (up scaling) and reducing (down scaling)
computing power. The scaling provides opportunities to grow businesses to a
more secure, available and need based computing/ storage facility on the cloud.
Scaling also helps in optimizing the financial involved for highly resource
bound applications for small to medium enterprises.
Better to include one picture to explain cloud elasticity?
The key advantages of cloud scaling are: -

1. Minimum cost: The user has to pay a minimum cost for access usage of
hardware after upscaling. The hardware cost for the same scale can be
much greater than the cost paid by the user. Also, the maintenance and
other overheads are also not included here. Further, as and when the
resources are not required, they may be returned to the Service provider
resulting in the cost saving.

2. Ease of use: The cloud upscaling and downscaling can be done in just a
few minutes (sometime dynamically) by using service providers
application interface.

3. Flexibility: The users have the flexibility to enable/ disable certain

VM’s for upscaling and downscaling by them self and thus saving
configuration/ installation time for new hardware if purchased
separately.

4. Recovery: The cloud environment itself reduces the chance of disaster

and amplifies the recovery of information stored in the cloud.

2
RESOURCE PROVISIONING,
LOAD BALANCING AND
SECURITY

The scalability of the clouds aims to optimize the utilization of various

resources under varying workload conditions such as under provisioning and
over provisioning of resources. In non-cloud environments resource utilization
can be seen as a major concern as one has no control on scaling. Various
methods exist in literature which may be used in traditional environment
scaling. In general, a peak is forecasted and accordingly infrastructure is set up
in advance. This scaling experience high latency and require manual
monitoring. The associated drawbacks of this type of setup is quite crucial in
nature as estimation of maximum load may exist at both ends making either
high end or poorly configured systems.

In the case of the clouds, virtual environments are utilized for resource
allocation. These virtual machines enable clouds to be elastic in nature which
can be configured according to the workload of the applications in real time. In

costs

Workload

Checkpoint|

Time

Figure 1. Manual scaling in traditional environments

costs

Workload
Checkpoint|

Time

Figure 2. Semi-automatic scaling in cloud environments. 3

SCALING

such scenarios, downtime is minimized and scaling is easy to achieve.

On the other hand, scaling saves cost of hardware setup for some small time
peaks or dips in load. In general most cloud service providers provide scaling
as a process for free and charge for the additional resource used. Scaling is also
a common service provided by almost all cloud platforms. Also need to
mention that user saves when usage of the resources declines by using scale
down.?

5.4 SCALING SRATEGIES

Let us now see what are the strategies for scaling, how one can achieve scaling
in a cloud environment and what are its types. In general, scaling is categorized
based on the decision taken for achieving scaling. The three main strategies for
scaling are discussed below.

5.4.1 Proactive Scaling

Consider a scenario when a huge surge in traffic is expected on one of the

applications in the cloud. In this situation a proactive scaling is used to cater
the load. The proactive scaling can also be pre scheduled according to the
expected traffic and demand. This also expects the understanding of traffic
flow in advance to utilize maximum resources, however wrong estimates
generally lead to poor resource management. The prior knowledge of the load
helps in better provisioning of the cloud and accordingly minimum lag is
experienced by the end users when sudden load arrives. The given below
figure shows the resource provision when load increases with time.
Load

Time of Day

4
RESOURCE PROVISIONING,
LOAD BALANCING AND
SECURITY
5.4.2 Reactive Scaling

The reactive scaling often monitors and enables smooth workload changes to
work easily with minimum cost. It empowers users to easily scale up or down
computing resources rapidly. In simple words, when the hardwares like CPU
or RAM or any other resource touches highest utilization, more of the
resources are added to the environment by the service providers. The auto
scaling works on the policies defined by the users/ resource managers for
traffic and scaling. One major concern with reactive scaling is a quick change
in load, i.e. user experiences lags when infrastructure is being scaled.

F
i
g
u
F r
Load

i e
g
u 1
r .
e
M
1 a
. n
Time of Day
u
M a
5.4.3 Combinational Scaling
a l
n
Till now we have seen uneed based
s and forecast based scaling techniques for
scaling. However, for better
a performance
c and low cool down period we can
also combine both of the l reactive
a and proactive scaling strategies where we
have some prior knowledge lof traffic. This helps us in scheduling timely
s
scaling strategies for expected iload. On the other hand, we also have provision
c
of load based scaling apart fromn the predicted load on the application. This
a
way both the problems of sudden g and expected traffic surges are addressed.
l
i i
Given below is the comparison between proactive and reactive scaling
n n
strategies. g
t
Parameters i r
Proactive Scaling Reactive Scaling
n a
Suitability For applications
d increasing For applications increasing loads in
loads tin expected/
i known unexpected/ unknown manner
mannerr t
a
Working User sets thei threshold but a User defined threshold values
d o
i n 5
t a
i l
o
SCALING
downtime is required. optimize the resources

Cost Reduction Medium cost reduction Medium cost reduction

Implementation A few steps required Fixed number of steps required

Check your Progress 1

1) Explain the importance of scaling in cloud computing?

…………………………………………………………………………
…………………………………………………………………………
…………………………………………………………………………

2) How proactive scaling is achieved through virtualization?

3) Write differences between combinational and reactive scaling.

…………………………………………………………………………………………

5.5 AUTO SCALING IN CLOUD

One of the potential risks in scaling a cloud infrastructure is its magnitude of

scaling. If we scale it down to a very low level, it will adversely affect the
throughput and latency. In this case, a high latency will be affecting the user’s
experience and can cause dissatisfaction of the users. On the other hand, if we
scale-up the cloud infrastructure to a large extent then it will not be a resource
optimization and also would cost heavily, affecting the host and the whole
purpose of cost optimization fails.

In a cloud, auto scaling can be achieved using user defined policies, various
machine health checks and schedules. Various parameters such as Request
counts, CPU usage and latency are the key parameters for decision making in
autoscaling. A policy here refers to the instruction sets for clouds in case of a
particular scenario (for scaling -up or scaling -down). The autoscaling in the
cloud is done on the basis of following parameters.

6
RESOURCE PROVISIONING,
LOAD BALANCING AND
SECURITY

1. The number of instances required to scale.

2. Absolute no. or percentage (of the current capacity)

The process of auto scaling also requires some cooldown period for resuming
the services after a scaling takes place. No two concurrent scaling are triggered
so as to maintain integrity. The cooldown period allows the process of
autoscaling to get reflected in the system in a specified time interval and saves
any integrity issues in cloud environment.

Costs

Workload

Time

Figure 4. Automatic scaling in cloud environments

Consider a more specific scenario, when the resource requirement is high for
some time duration e.g. in holidays, weekends etc., a Scheduled scaling can
also be performed. Here the time and scale/ magnitude/ threshold of scaling
can be defined earlier to meet the specific requirements based on the previous
knowledge of traffic. The threshold level is also an important parameter in auto
scaling as a low value of threshold results in under utilization of the cloud
resources and a high level of threshold results in higher latency in the cloud.

After adding additional nodes in scale-up, the incoming requests per second
drops below the threshold. This results in triggering the alternate scale-up-
down processes known as a ping-pong effect. To avoid both underscaling and
overscaling issues load testing is recommended to meet the service level
agreements (SLAs). In addition, the scale-up process is required to satisfy the
following properties. Need to brief on SLA also?

1. The number of incoming requests per second per node > threshold of
scale down, after scale-up.
2. The number of incoming requests per second per node < threshold of
scale up, after scale-down

Here, in both the scenarios one should reduce the chances of ping-pong effect.

7
SCALING
Now we know what scaling is and how it affects the applications hosted on the
cloud. Let us now discuss how auto scaling can be performed in fixed amounts
as well as in percentage of the current capacity.

Fixed amount autoscaling

As discussed earlier, the auto scaling can be achieved by determining the
number of instances required to scale by a fixed number. The detailed
algorithm for fixed amount autoscaling threshold is given below. The
algorithm works for both scaling-up and scaling-down and takes inputs U and
D for both respectively.

--------------------------------------------------------------------------------------------
Algorithm : 1
--------------------------------------------------------------------------------------------
Input : SLA specific application
Parameters:
N_min minimum number of nodes
D - scale down value.
U scale up value.
T_U scale up threshold
T_D scale down threshold

Let T (SLA) return the maximum incoming request per second (RPS) per node
for the specific SLA.

T_D ← 0.50 x T_U

T_U ← 0.90 x T (SLA)

Let N_c and RPS_n represent the current number of nodes and incoming
requests per second per node respectively.

L1: /* scale up (if RPS_n> T_U) */

Repeat:
N_(c_old) ←N_c
N_c ←N_c + U
RPS_n ←RPS_n x N_(c_old) / N_c
Until RPS_n> T_U

L2: /* scale down (if RPS_n< T_D) */

Repeat:
N_(c_old) ←N_c
N_c ← max(N_min, N_c - D)
RPS_n ←RPS_n x N_(c_old) / N_c
Until RPS_n< T_D or N_c = N_min

8
RESOURCE PROVISIONING,
LOAD BALANCING AND
Now, let us discuss how this algorithm works in detail. Let the values of a few SECURITY

parameters are given as U = 2, D = 2, T_U = 120 and T_D = 150. Suppose in

the beginning, RPS = 450 and N_c = 4. Now RPS is increased to 1800 and
RPS_n almost reached to T_U, in this situation an autoscaling request is
generated leading to adding U = 2 nodes. Table - 1 lists all the parameters as
per the scale -up requirements.

Nodes Nodes RPS RPS_n Total nodes New

(Current) (added) (required) RPS_n

4 0 450 112.5 4

1800

2 6 300

2510

2 8 313.75

3300

2 10 330.00

4120

2 12 343.33

5000

2 14 357.14

Similarly, in case of scaling down, let initially RPS = 8000 and N_c = 19. Now
RPS is reduced to 6200 and following it RPS_n reaches T_D, here an
autoscaling request is initiated deleting D = 2 nodes. Table - 2 lists all the
parameters as per the scale -down requirements.

Nodes Nodes RPS RPS_n Total New

(Current) (reduced) (required) nodes RPS_n

18 8000 421.05 19

6200

2 17 364.7

4850

2 15 323.33

3500

9
SCALING
2 13 269.23

2650

2 11 240.90

1900

2 8 211.11
The given table shows the stepwise increase/ decrease in the cloud capacity
with respect to the change in load on the application(request per node per
second).

Percentage Scaling:

In the previous section we discussed how scaling up or down is carried out by

a fixed amount of nodes. Considering the situation when we scale up or down
by a percentage of current capacity we change using percentage change in
current capacity. This seems a more natural way of scaling up or down as we
are already running to some capacity.

The below given algorithm is used to determine the scale up and down
thresholds for respective autoscaling.

-----------------------------------------------------------------------------------------------
Algorithm : 2
-----------------------------------------------------------------------------------------------
Input : SLA specific application
Parameters:
N_min - minimum number of nodes
D - scale down value.
U - scale up value.
T_U - scale up threshold
T_D - scale down threshold

Let T (SLA) returns the maximum requests per second (RPS) per node for
specific SLA.

T_U ← 0.90 x T (SLA)

T_D ← 0.50 x T_U

Let N_c and RPS_n represent the current number of nodes and incoming
requests per second per node respectively.

L1: /* scale up (if RPS_n> T_U) */

Repeat:
N_(c_old) ←N_c

10
RESOURCE PROVISIONING,
LOAD BALANCING AND
N_c ←N_c + max(1, N_c x U/100) SECURITY

RPS_n ←RPS_n x N_(c_old) / N_c

Until RPS_n> T_U

L2: /* scale down (if RPS_n< T_D) */

Repeat:
N_(c_old) ←N_c
N_c ← max(N_min, N_c - max(1, N_c x D/ 100))
RPS_n ←RPS_n x N_(c_old) / N_c
Until RPS_n< T_D or N_c = N_min

Let us now understand the working of this algorithm by an example. Let

N_min = 1, at the beginning RPS = 500 and N_c = 6. Now the demand rises
and RPS reaches to 1540 while RPS_n reaches T_U. Here an upscaling is
requested adding 1 i.e. max(1, 6 x 10/200) nodes.

Similarly in case of scaling down, initial RPS = 5000 and N_c = 19, here RPS
reduces to 4140 and RPS_n reaches T_D requesting scale down and hence
deleting 1 i.e. max(1, 1.8 x 8/100). The detailed example is explained using
Table -3 giving details of upscaling with D = 8, U = 1, N_min = 1, T_D = 230
and T_U = 290 .

Nodes Nodes RPS RPS_n Total New

(Current) (added) (required) nodes RPS_n

6 0 500 83.33 6

1695

1 7 242.14

2190

1 8 273.75

2600

1 9 288.88

3430

1 10 343.00

3940

1 11 358.18

4420

1 12 368.33

11
SCALING
4960

1 13 381.53

5500

1 14 392.85

5950

1 15 396.6

The scaling down with the same algorithm is detailed in the table below.

Nodes Nodes RPS RPS_n Total New

(Current) (added) (required) nodes RPS_n

19 5000 263.15 19

3920

1 18 217.77

3510

1 17 206.47

3200

1 16 200

2850

1 15 190

2600

1 14 185.71

2360

1 13 181.53

2060

1 12 171.66

1810

1 11 164.5

1500

150

12
RESOURCE PROVISIONING,
LOAD BALANCING AND
Here if we compare both the algorithms 1 and 2, it is clear that the values of SECURITY

the threshold U and D are at the higher side in case of 2. In this scenario the
utilization of hardware is more and the cloud experiences low footprints.

Check your Progress 2

1) Explain the concept of fixed amount auto scaling.
…………………………………………………………………………
…………………………………………………………………………
…………………………………………………………………………

2) In Algorithm 1 for fixed amount auto scaling, calculate the values in table
if U = 3.
…………………………………………………………………………
…………………………………………………………………………
…………………………………………………………………………

3) What is a cool down period?

…………………………………………………………………………………………

5.6 TYPE OF SCALING

Let us now discuss the types of scaling, how we see the cloud infrastructure for
capacity enhancing/ reducing. In general we scale the cloud in a vertical or
horizontal way by either provisioning more resources or by installing more
resources.

5.6.1 Vertical scaling or scaling up

The vertical scaling in the cloud refers to either scaling up i.e. enhancing the
computing resources or scaling down i.e. reducing/ cutting down computing
resources for an application. In vertical scaling, the actual number of VMs are
constant but the quantity of the resource allocated to each of them is increased/
decreased. Here no infrastructure is added and application code is also not
changed. The vertical scaling is limited to the capacity of the physical machine
or server running in the cloud. If one has to upgrade the hardware requirements
of an existing cloud environment, this can be achieved by minimum changes.

13
SCALING

B 4 CPUs

vertical scaling
A 2 CPUs

An IT resource (a virtual server with two CPUs) is scaled up by replacing it with a more
powerful IT resource with increased capacity for data storage (a physical server with four CPUs).

5.6.2 Horizontal scaling or scaling out

In horizontal scaling, to meet the user requirements for high availability,

excess resources are added to the cloud environment. Here, the resources are
added/ removed as VMs. This includes addition of storage disks, new server
for increasing CPUs or installation of additional RAMs and work like a single
system. To achieve horizontal scaling, a minimum downtime is required. This
type of scaling allows one to run distributed applications in a more efficient
manner.

14
RESOURCE PROVISIONING,
LOAD BALANCING AND
SECURITY
Pooled
physical
servers

virtual demand demand

servers

A A B A B C

horizontal scaling
An IT resource (Virtual Server A) is scaled out by adding more of the same IT resources (Virtual Servers B and C).

Another way of maximizing the resource utilization is Diagonal Scaling. This

combines the ideas of both vertical and horizontal scaling. Here, the resource is
scaled up vertically till one hit the physical resource capacity and afterwards
new resources are added like horizontal scaling. The new added resources have
further capacity of being scaled like vertical scaling.

SUMMARY

In the end, we are now aware of various types of scaling, scaling strategies and
their use in real situations. Various cloud service providers like Amazon AWS,
Microsoft Azure and IT giants like Google offer scaling services on their
application based on the application requirements. These services offer good
help to the entrepreneurs who run small to medium businesses and seek IT
infrastructure support. We have also discussed various advantages of
cloudscaling for business applications.

SOLUTION/ANSWERS

Answers to CYPs 1.

1. Explain the importance of scaling in cloud computing: Clouds being used

extensively in serving applications and in other scenarios where the cost and
installation time of infrastructure/ capacity scaling is expectedly high. Scaling helps in
achieving optimized infrastructure for the current and expected load for the
applications with minimum cost and setup time. Scaling also helps in reducing the
disaster recovery time if happens. (for details see section 5.3)
15
SCALING

2. How proactive scaling is achieved through virtualization: The proactive scaling is

a process of forecasting and then managing the load on the could infrastructure in
advance. The precise forecasting of the requirement is key to success here. The
preparedness for the estimated traffic/ requirements is done using the virtualization. In
virtualization, various resources may be assigned to the required machine in no time
and the machine can be scaled to its hardware limits. The virtualization helps in
achieving low cool down period and serve instantly. (for details you may refer
Resource Utilization Unit.)

3) Write differences between proactive and reactive scaling: The reactive scaling
technique only works for the actual variation of load on the application however, the
combination works for both expected and real traffic. A good estimate of load
increases performance of the combinational scaling.

Answers to CYPs 2.

1) Explain the concept of fixed amount auto scaling: The fixed amount scaling is a
simplistic approach for scaling in cloud environment. Here the resources are scaled
up/ down by a user defined number of nodes. In fixed amount scaling resource
utilization is not optimized. It can also happen that only a small node can solve the
resource crunch problem but the used defined numbers are very high leading to
underutilized resources. Therefore a percentage amount of scaling is a better
technique for optimal resource usage.

2) In Algorithm 1 for fixed amount auto scaling, calculate the values in table if U = 3:
For the given U = 3, following calculation are made.

Nodes Nodes RPS RPS_n Total nodes New

(Curren (added) (required) RPS_n
t)

4 0 450 112.5 4

1800

3 7 257.14

2510

3 10 251

3300

3 13 253.84

4120

3 16 257.50
16
RESOURCE PROVISIONING,
LOAD BALANCING AND
SECURITY
5000

3 19 263.15

3) What is a cool down period: When auto scaling takes place in cloud, a small time
interval (pause) prevents the triggering next auto scale event. This helps in
maintaining the integrity in the cloud environment for applications. Once the cool
down period is over, next auto scaling event can be accepted.

Cloud Computing Benefits Challenges
No ratings yet
Cloud Computing Benefits Challenges
33 pages
Unit1 Material Own
No ratings yet
Unit1 Material Own
39 pages
FLAS
No ratings yet
FLAS
19 pages
Fundamentals of Cloud Computing: Learning Objectives
No ratings yet
Fundamentals of Cloud Computing: Learning Objectives
35 pages
5.issues and Challenges
No ratings yet
5.issues and Challenges
31 pages
UCS531-Cloud Computing
No ratings yet
UCS531-Cloud Computing
41 pages
Unit 4-1
No ratings yet
Unit 4-1
26 pages
Unit 5
No ratings yet
Unit 5
21 pages
Lect 1a - Cloud Computing Lecture
No ratings yet
Lect 1a - Cloud Computing Lecture
81 pages
3rd Cloud
No ratings yet
3rd Cloud
14 pages
CC Unit 1
No ratings yet
CC Unit 1
25 pages
CC Unit-Ii
No ratings yet
CC Unit-Ii
23 pages
Software Process Models Quick Notes
No ratings yet
Software Process Models Quick Notes
7 pages
Cloud Computing Unit-1 Notes
No ratings yet
Cloud Computing Unit-1 Notes
16 pages
Workflow Chapter1
No ratings yet
Workflow Chapter1
32 pages
Chapter 4 Cloud Computing OS 2
No ratings yet
Chapter 4 Cloud Computing OS 2
5 pages
A Systematic Mapping Study of Cloud Reso
No ratings yet
A Systematic Mapping Study of Cloud Reso
17 pages
Cloud Computing
No ratings yet
Cloud Computing
19 pages
Unit 3 - Cloud Computing
No ratings yet
Unit 3 - Cloud Computing
27 pages
CS8791-CC Unit-I
No ratings yet
CS8791-CC Unit-I
32 pages
A Review of Auto-Scaling Techniques For Elastic Applications in Cloud Environments
No ratings yet
A Review of Auto-Scaling Techniques For Elastic Applications in Cloud Environments
34 pages
2024 Mdpi Sensors
No ratings yet
2024 Mdpi Sensors
21 pages
1.1 Cloud Computing - Introduction
No ratings yet
1.1 Cloud Computing - Introduction
19 pages
Cloud Scalability
No ratings yet
Cloud Scalability
16 pages
Scaling (1st Topic) Unit 3
No ratings yet
Scaling (1st Topic) Unit 3
6 pages
Cloud Computing: Aldo Erianda
No ratings yet
Cloud Computing: Aldo Erianda
9 pages
Dynamic 20 Scaling 20 of 20 Web 20 Applications 202020 in 20 A 20 Virtualized 20 Cloud 20 Computing 20 Environment
No ratings yet
Dynamic 20 Scaling 20 of 20 Web 20 Applications 202020 in 20 A 20 Virtualized 20 Cloud 20 Computing 20 Environment
7 pages
Lecture 9 ICT723
No ratings yet
Lecture 9 ICT723
35 pages
Final GE 408
No ratings yet
Final GE 408
76 pages
Cloud Scalability Versus Cloud Elasticity
No ratings yet
Cloud Scalability Versus Cloud Elasticity
20 pages
Describe The Benefits of High Avail
No ratings yet
Describe The Benefits of High Avail
1 page
Cloud Computing
No ratings yet
Cloud Computing
11 pages
Cloud Scalability Considerations
No ratings yet
Cloud Scalability Considerations
11 pages
Efficient Autoscaling in The Cloud Using Predictive Models For Workload Forecasting
No ratings yet
Efficient Autoscaling in The Cloud Using Predictive Models For Workload Forecasting
8 pages
Mainframe to Cloud Mastery: Best Practices: Mainframes
From Everand
Mainframe to Cloud Mastery: Best Practices: Mainframes
Ricardo Nuqui
No ratings yet
Auto-Scaling Techniques For Elastic Applications in Cloud Environments
No ratings yet
Auto-Scaling Techniques For Elastic Applications in Cloud Environments
44 pages
Chapter #3 Understanding Cloud Computing
No ratings yet
Chapter #3 Understanding Cloud Computing
15 pages
CC - Unit1 Notes
No ratings yet
CC - Unit1 Notes
5 pages
Enhancing Decision Making
100% (1)
Enhancing Decision Making
28 pages
Cloud Computing
No ratings yet
Cloud Computing
5 pages
Unit 5 - IMED
No ratings yet
Unit 5 - IMED
13 pages
Implementing Automated Software Testing Neha Kaul PDF Download
No ratings yet
Implementing Automated Software Testing Neha Kaul PDF Download
83 pages
Cloud Introduction: Cloud Computing Fundamentals Vision
No ratings yet
Cloud Introduction: Cloud Computing Fundamentals Vision
31 pages
Chap 5 Scalability and Redundancy
No ratings yet
Chap 5 Scalability and Redundancy
29 pages
Cloud Computing: Unit-1 - Introduction To Cloud Technologies
83% (6)
Cloud Computing: Unit-1 - Introduction To Cloud Technologies
78 pages
Seminar Documentation Final
No ratings yet
Seminar Documentation Final
9 pages
06 - Ch6 Architectural Design
No ratings yet
06 - Ch6 Architectural Design
47 pages
Cloud Computing
No ratings yet
Cloud Computing
19 pages
Cloud Definitions: Experiment 1: Cloud Conceptualization and Performance Evolution of Service Over Cloud
No ratings yet
Cloud Definitions: Experiment 1: Cloud Conceptualization and Performance Evolution of Service Over Cloud
4 pages
Block 1
No ratings yet
Block 1
107 pages
Facebook Ads 2023 Sent
No ratings yet
Facebook Ads 2023 Sent
64 pages
14 - 8 - 2018 - 16 - 22 - 55 - 449 - UNIT I (Part A)
No ratings yet
14 - 8 - 2018 - 16 - 22 - 55 - 449 - UNIT I (Part A)
29 pages
Industries-CPQ Updated
No ratings yet
Industries-CPQ Updated
26 pages
Introduction To Cloud Computing: References Ae All Mentioned in The Last Two Slides
No ratings yet
Introduction To Cloud Computing: References Ae All Mentioned in The Last Two Slides
29 pages
CC Unit 1
No ratings yet
CC Unit 1
12 pages
Introduction To Cloud Computing
No ratings yet
Introduction To Cloud Computing
19 pages
Unit 10 IoT Application Development
No ratings yet
Unit 10 IoT Application Development
22 pages
The Effect of The Resource Consumption Characteristics of Cloud Applications On The Efficiency of Low-Metric Auto Scaling Solutions
No ratings yet
The Effect of The Resource Consumption Characteristics of Cloud Applications On The Efficiency of Low-Metric Auto Scaling Solutions
9 pages
Scalability and Elasticity in Cloud Computing
No ratings yet
Scalability and Elasticity in Cloud Computing
7 pages
Configuring BIRT For Maximo
No ratings yet
Configuring BIRT For Maximo
11 pages
Praveen Murugesan - Salesforce Architect
No ratings yet
Praveen Murugesan - Salesforce Architect
4 pages
Unit-1 Cloud Computing
No ratings yet
Unit-1 Cloud Computing
17 pages
Unit Test 2 Class XII Business Studies
No ratings yet
Unit Test 2 Class XII Business Studies
6 pages
Web Content Mining Thesis PDF
100% (2)
Web Content Mining Thesis PDF
5 pages
Serverless Etl Aws Glue
No ratings yet
Serverless Etl Aws Glue
62 pages
Scalability in Cloud Computing
No ratings yet
Scalability in Cloud Computing
6 pages
Features by License Type For Planning Models
No ratings yet
Features by License Type For Planning Models
6 pages
21DM131 - Palak Mittal - SIP Report
No ratings yet
21DM131 - Palak Mittal - SIP Report
72 pages
API Governance: Risk and Control Consideration
No ratings yet
API Governance: Risk and Control Consideration
13 pages
Salad Business Plan
No ratings yet
Salad Business Plan
12 pages
LSMW For Functional Consultants in Simple Step
No ratings yet
LSMW For Functional Consultants in Simple Step
38 pages
The Ultimate Guide to Unlocking the Full Potential of Cloud Services: Tips, Recommendations, and Strategies for Success
From Everand
The Ultimate Guide to Unlocking the Full Potential of Cloud Services: Tips, Recommendations, and Strategies for Success
Rick Spair
No ratings yet
(IJCST-V6I4P18) :C.Muruganandam, M.Gayathiri
No ratings yet
(IJCST-V6I4P18) :C.Muruganandam, M.Gayathiri
8 pages
Comparative Study of Load Balancing Algorithms in Cloud Computing Environment
No ratings yet
Comparative Study of Load Balancing Algorithms in Cloud Computing Environment
7 pages
GoGrid Scaling Web Applications
No ratings yet
GoGrid Scaling Web Applications
13 pages
Unit-4 Resource Pooling Load Balancing and Provisioning
No ratings yet
Unit-4 Resource Pooling Load Balancing and Provisioning
12 pages
A Study On Marketing Strategy of Apple Products
No ratings yet
A Study On Marketing Strategy of Apple Products
10 pages
Development of 3D Printer For House Construction Using Raspberry Pi
No ratings yet
Development of 3D Printer For House Construction Using Raspberry Pi
19 pages
Cloud Computing: Paper Presentation ON
No ratings yet
Cloud Computing: Paper Presentation ON
11 pages
Cloud Computing Essentials: A Practical Guide with Examples
From Everand
Cloud Computing Essentials: A Practical Guide with Examples
William E. Clark
No ratings yet
Explain "Ybackoffice" Using Ant Extgen?
No ratings yet
Explain "Ybackoffice" Using Ant Extgen?
18 pages
Factors Affecting Consumer Buying Decision Towards Choosing A Smartphone Among Young Adults
No ratings yet
Factors Affecting Consumer Buying Decision Towards Choosing A Smartphone Among Young Adults
13 pages
Open Text File
No ratings yet
Open Text File
20 pages
The Cradle Crew
No ratings yet
The Cradle Crew
14 pages
Survey of Load Balancing and Scaling Approaches in Cloud
No ratings yet
Survey of Load Balancing and Scaling Approaches in Cloud
4 pages
Assessment For Team Lead - Quality Testing at Emoha
No ratings yet
Assessment For Team Lead - Quality Testing at Emoha
4 pages
Guides - Downloads - Pricing Guide For Accountants
No ratings yet
Guides - Downloads - Pricing Guide For Accountants
5 pages
Power BI Write Up
No ratings yet
Power BI Write Up
5 pages
Block 1
No ratings yet
Block 1
4 pages
(FREE PDF Sample) Test Bank For Essentials of MIS, 14th Edition, Kenneth C. Laudon Jane P. Laudon Ebooks
100% (29)
(FREE PDF Sample) Test Bank For Essentials of MIS, 14th Edition, Kenneth C. Laudon Jane P. Laudon Ebooks
54 pages
Resume Faiz 230221 180146
No ratings yet
Resume Faiz 230221 180146
4 pages
Software Defect Prediction Using ML
No ratings yet
Software Defect Prediction Using ML
6 pages
Press Release-Aadhaar Authentication Good Governace
No ratings yet
Press Release-Aadhaar Authentication Good Governace
1 page
Shalki Divakaran Resume
No ratings yet
Shalki Divakaran Resume
2 pages
Justina Hammond One Page CV Veche
No ratings yet
Justina Hammond One Page CV Veche
1 page
CCSP - Certified Cloud Security Professional Exam Success
From Everand
CCSP - Certified Cloud Security Professional Exam Success
SUJAN
No ratings yet

Unit 5 Scaling

Uploaded by

Unit 5 Scaling

Uploaded by

RESOURCE PROVISIONING,

LOAD BALANCING AND

The scalability in cloud computing refers to the flexibility of allocating IT

For scaling, to understand elastic property of Cloud is important. I would

➔ understand the different scaling techniques;

➔ understand the basics of auto scaling

5.3 SCALING PRIMITIVES

The basic purpose of scaling is to enable one to use cloud computing

3. Flexibility: The users have the flexibility to enable/ disable certain

4. Recovery: The cloud environment itself reduces the chance of disaster

The scalability of the clouds aims to optimize the utilization of various

Figure 1. Manual scaling in traditional environments

Figure 2. Semi-automatic scaling in cloud environments. 3

such scenarios, downtime is minimized and scaling is easy to achieve.

5.4 SCALING SRATEGIES

5.4.1 Proactive Scaling

Consider a scenario when a huge surge in traffic is expected on one of the

Cost Reduction Medium cost reduction Medium cost reduction

Implementation A few steps required Fixed number of steps required

Check your Progress 1

1) Explain the importance of scaling in cloud computing?

2) How proactive scaling is achieved through virtualization?

3) Write differences between combinational and reactive scaling.

5.5 AUTO SCALING IN CLOUD

One of the potential risks in scaling a cloud infrastructure is its magnitude of

1. The number of instances required to scale.

Figure 4. Automatic scaling in cloud environments

Fixed amount autoscaling

T_D ← 0.50 x T_U

L1: /* scale up (if RPS_n> T_U) */

L2: /* scale down (if RPS_n< T_D) */

parameters are given as U = 2, D = 2, T_U = 120 and T_D = 150. Suppose in

Nodes Nodes RPS RPS_n Total nodes New

Nodes Nodes RPS RPS_n Total New

In the previous section we discussed how scaling up or down is carried out by

T_U ← 0.90 x T (SLA)

L1: /* scale up (if RPS_n> T_U) */

RPS_n ←RPS_n x N_(c_old) / N_c

L2: /* scale down (if RPS_n< T_D) */

Let us now understand the working of this algorithm by an example. Let

Nodes Nodes RPS RPS_n Total New

Nodes Nodes RPS RPS_n Total New

Check your Progress 2

3) What is a cool down period?

5.6 TYPE OF SCALING

5.6.1 Vertical scaling or scaling up

5.6.2 Horizontal scaling or scaling out

In horizontal scaling, to meet the user requirements for high availability,

virtual demand demand

Another way of maximizing the resource utilization is Diagonal Scaling. This

1. Explain the importance of scaling in cloud computing: Clouds being used

2. How proactive scaling is achieved through virtualization: The proactive scaling is

Nodes Nodes RPS RPS_n Total nodes New

You might also like