Load Balancer: What? Why?

A load balancer distributes network or application traffic across multiple servers to improve scalability, availability, and resilience. It sits in front of web servers and uses algorithms to route traffic to servers efficiently. During deployments, it allows routing traffic to servers with new code while others are updated, preventing downtime. It also monitors server health and isolates unhealthy servers, improving overall application availability.

Uploaded by

Khagen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views9 pages

Load Balancer: What? Why?

Uploaded by

Khagen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Load Balancer

Contents

● What?
○ Concept
● Why?
○ Deployments (Scalability)
○ “Load Balancing” (Resilience)
○ Higher Availability (Resilience)
○ DDoS (Security)
○ Traffic Compression (Performance)
○ SSL Overhead (Performance)
● How?
○ Reverse proxy
○ Health checks
○ Scale itself
○ Algorithm
● Types?
○ Application (Discussion limited to)
○ DNS
○ Network
● Setup?
○ AWS
■ Listeners
■ Routing
■ Target Group
Script

Scaler pushes a wide variety of updates to the platform every day, be it, features or bug fixes.
You are a developer at Scaler verse. Those who do not know what Scaler Verse is, please
check out this link (https://www.scaler.com/scaler-verse/). You want to add a new property to the
Scaler Verse. Assume that all the non-technical stuff like legal, accounting, and marketing is
completed and we now want the users to know about the new property via a dedicated landing
page. The Landing page should allow the users to join the waitlist for the property at a nominal
fee of 500. You took your time and you built the entire page. You passed with flying colours. The
next step is to deploy the landing page on the actual Scaler Server.

For now, we will assume that Scaler is running on One Server only. You somehow log in to the
server and copy the new code to the server either by using git or by using some other tool.
Independent of the language (Node.js, Django, Ruby, etc.) being used, the running application
must be restarted for the new changes to take effect. Given how huge the platform is, the
application would take a considerable amount of time to boot up or load itself into the memory.
During the loading time, your application is briefly unavailable. Users are receiving 502 from
scaler for some time.

As the traffic scales, you realise that the server is not able to withstand the load and is
intermittently going down. To fix this, you work day and night to optimise the code. You increase
the memory, storage and processor of the server to cater to the ever-increasing demand.

Scaler wants to increase the number of users on the platform. For the same, They hold a
competition and you are informed that the traffic can increase by 10x.

DEAD END!!! What to do now?

Of course, you talk to some of your fellow engineers or read something online to find the
solution, i.e., Install a LOAD BALANCER. It can help you tackle all of the above-mentioned
issues whilst giving a significant performance boost and saving you a panic attack.

BTW: Assumed architecture is known as a Single Server Architecture.

Load Balancer
Before diving into what is Load Balancer and its features or advantages, let’s first try to think
what would be our approach or algorithm for tackling the above issues.

What if there were 2 hosts for our application? When we want to deploy a newer version of our
application, we could deploy them one by one on both these hosts while keeping at least one of
them active at all times. A piece of code would be responsible for managing the traffic based on
specific circumstances, for example, If one of the 2 hosts is not responding for some reason, be
it, application failure or memory failure, the system could notify us immediately.

The deployment algorithm would follow the following steps:

1. ‘for’ each host for our application
a. Mark the host as inactive
b. Deploy the new code.
c. Restart the application.
d. Perform some basic sanity checks.
e. Mark the host as active
2. If a host is marked as inactive, do not serve the traffic to that host and vice-versa, if a
host is not responding to the traffic, mark the host as inactive.
3. Only serve traffic from active hosts.

BTW: This piece of code is a Load Balancer.

A load balancer is a device that distributes network or application traffic across several servers.
It helps scale horizontally across an ever-increasing number of servers.
Concept
A Load Balancer is a physical/virtual device used to balance the network load across
Web-Servers. Load Balancer could sit inside the DCs for internal Load Balancing but usually are
placed facing the internet to balance the load across the Web Servers in the Data-Centers.

From the above diagram, it is clear that the entry point of our application is now the Load
Balancer itself and not the server itself. Now that we have a fleet of servers, we can efficiently
manage and distribute the load or traffic across different servers.

The entire traffic, irrespective of whether it is from a handheld device or a system, arrives
directly at the load balancer. The load balancer then sends the traffic to one of the available
servers, based on certain decision-making algorithms. The response is then received from the
server by the load balancer which is then straight away sent back to the client. Of course, we
can make certain adjustments like a different load balancer for handheld devices but for the
sake of simplicity, let’s first discuss this above architecture in detail. Assume that the number of
EC2 servers (hosts) is infinite for now and that they can be added or removed as per our
demand.

Deployments
The deployments work more or less the way we discussed before. When we are about to deploy
a new application version, we loop through all of the available hosts and for each host, we
detach the host from the load balancer, deploy the new version on this host, restart the required
services on the host and then reattach the host to the load balancer. This beautiful invisible
dance helps push the newer updates quickly, reliably, and without downtime.

Resilience
Since, the entire traffic flows through the load balancer, what if there is a certain low-latency
monitoring mechanism that notifies us if the traffic is increasing or decreasing at certain times? If
we are notified beforehand, we can increase or decrease the hosts so that the load is distributed
evenly across all the hosts. The “Load Balancing”!

The above approach is, however, a bit dangerous, because the traffic has already reached the
load balancer. Remember, the LB, in our architecture, is configured to just push the traffic to one
of the available servers blindly. By the time we add more hosts for our application, it might be
too late. To curb this, we define a threshold above which we would want our hosts to be
increased and a lower bound below which our hosts would be slowly removed.

This architecture also takes care of a scenario when some of our hosts are somehow down. If
one or more of the hosts are down, LB has a mechanism known as “Health Check” by which it
pings the available hosts after a certain point of time in regular intervals. If the host is
unreachable, it is marked as inactive or unhealthy and the host does not receive any traffic.
What if our application has some kind of failover that brings back the host online? Right, The LB
will ping inactive servers as well to mark them as active in case they are ready to receive the
traffic

Security
What if, the LB is intelligent enough to detect whether the request received is a malicious
request or a valid and genuine request? What if, the LB can automatically reject the request if it
recognises it as an attack vector?

This is where the security aspect of LB comes into play. Since the LB is the entry point of all the
requests, it reduces the chance of Servers’ IP addresses getting leaked, thereby reducing the
chance of a direct attack on the server. If the LB is intelligent enough, it can also detect and
discard malicious requests.

Performance
Since the LB is the entry point of the traffic, SSL/TLS encryption and decryption are usually
configured directly on the LB to reduce the overhead from the servers. Yes, right! The traffic
between the users and LB is encrypted but the traffic between the LB and servers is plain and is
transmitted directly.
But How does LB work in itself?

Reverse Proxy
Strictly speaking, A reverse proxy is a type of proxy server. Unlike a traditional proxy server,
which is used to protect clients, a reverse proxy is used to protect servers. A reverse proxy is a
server that accepts a request from a client, forwards the request to another one of many other
servers, and returns the results from the server that processed the request to the client as if the
proxy server had processed the request itself. The client only communicates directly with the
reverse proxy server and it does not know that some other server processed its request.

So, the LB is working as a reverse proxy for us. This also fixes a security issue for us. If the
response was directly sent from the server, it would have exposed a wide variety of information
about the server in response headers. Reverse proxying the request ensures that the
information in the response headers is that of the reverse proxy (LB in this case) and not of the
running application’s server.

Health Checks
Depending on the provider, The LB checks whether a host is active or not. Active and Inactive
terms are generic and providers generally use different terms to denote them. For instance,
AWS uses Healthy and Unhealthy terms respectively.

LB pings the server at a certain port (8080) and requests a certain path (/) over a certain
protocol (HTTP) after a certain interval (the 30s). The response timeout(5s) for every request is
the maximum time LB will wait before terminating the request and marking the same as failed. A
total number of Unhealthy Threshold (2) requests are required before marking any of the hosts
as inactive or unhealthy. Along the same lines, A total number of Healthy threshold (2) requests
are necessary to declare the target host as an active or healthy host.

Scale itself
Think of a case where LB is bombarded with an insane amount of requests such that LB goes
down! Remember, LB is the entry point of our application and if it goes down, we are doomed!

Depending on the provider, There are different mechanisms put in place to ensure that LB is
always ready to scale itself up and down as and when necessary. For instance, the load
balancer has a traffic capacity of Gbps in AWS. It can scale itself automatically up to a maximum
of 100 nodes.

Algorithms
The algorithm widely and majorly used to efficiently manage load is Round Robin. You may
already be familiar with the Round Robin algorithm but let’s cover it with the help of an
architectural diagram.

Round Robin
Round-robin load balancing is the simplest and most commonly-used load balancing algorithm.
Client requests are distributed to application servers in simple rotation. For example, if you have
three application servers: the first client request is sent to the first application server in the list,
the second client request to the second application server, the third client request to the third
application server, and the fourth to the first application server, and so on.
Concepts involved in setting up a Load Balancer

Listeners

Listeners for a load balancer are the combination of protocols and ports on which the LB will
accept incoming requests. For example, in the above image, only HTTPS connection on 443
and the HTTP connection on 80 will be accepted by the LB and the rest of the others will be
discarded as-is. We also define the destination for these requests. In the same directed image,
HTTPS and HTTP requests will be forwarded to the HTTP port 8080 of the host.

Routing
Load Balancer (in some cases) also allows us to forward a request to a defined destination
based on certain conditions. For example, Let’s say I would like to forward all the admin
requests to an internal fleet of servers that can serve admin requests. To achieve this, we can
create a rule that states that if the request is admin (path matching), forward the request to an
internal admin server.
Target Group
Each target group is used to route requests to one or more registered targets. When we create
each listener rule, we specify a target group and conditions. When a rule condition is met, traffic
is forwarded to the corresponding target group. We can create different target groups for
different types of requests. For example, create one target group for general requests and other
target groups for requests to the microservices for your application.

Further Reading and Questions

1. How is the traffic across different regions regulated? On a very high level, how would the
deployment in this case execute? (“Global Server Load Balancer”)
2. Let’s say that a user is logged in to the account on host A. What if the next request for
the same user goes to another host B? What kind of inconsistencies can arise in this
case? (Session stickiness concept)
3. What is the dynamic and static content? How do you think the load balancer architecture
works in this scenario? (Static content hosting)

Aws Elb
No ratings yet
Aws Elb
5 pages
AWS LoadBalancer & Autoscaling
No ratings yet
AWS LoadBalancer & Autoscaling
79 pages
Educative System Design Basic Part0
No ratings yet
Educative System Design Basic Part0
44 pages
Lesson 2 Load Balancing
No ratings yet
Lesson 2 Load Balancing
38 pages
CC Unit-1-2
No ratings yet
CC Unit-1-2
48 pages
(CMP-DVA) AWS Certified Developer - Associate
No ratings yet
(CMP-DVA) AWS Certified Developer - Associate
863 pages
Lec7 English
No ratings yet
Lec7 English
32 pages
Network Load Blacser
No ratings yet
Network Load Blacser
23 pages
Design Issues and Challenges
No ratings yet
Design Issues and Challenges
11 pages
Making Applications Scalable With Load Balancing
100% (1)
Making Applications Scalable With Load Balancing
18 pages
13 - Clustering and Load Balancing
No ratings yet
13 - Clustering and Load Balancing
33 pages
Cluster Computing: Definition and Architecture of A Cluster
No ratings yet
Cluster Computing: Definition and Architecture of A Cluster
7 pages
Technology Integration: Rserpool & Server Load-Balancing: Curt Kersey, Cisco Systems Aron Silverton, Motorola Labs
No ratings yet
Technology Integration: Rserpool & Server Load-Balancing: Curt Kersey, Cisco Systems Aron Silverton, Motorola Labs
43 pages
Loadbalancing 140315112256 Phpapp01
No ratings yet
Loadbalancing 140315112256 Phpapp01
18 pages
Week 4
No ratings yet
Week 4
44 pages
Unit2 PDF
No ratings yet
Unit2 PDF
68 pages
Cloud Computing
No ratings yet
Cloud Computing
39 pages
HAProxy Version 1.6
No ratings yet
HAProxy Version 1.6
26 pages
The Evolution of Application Delivery Controllers
No ratings yet
The Evolution of Application Delivery Controllers
11 pages
Unit1 LoadBal
No ratings yet
Unit1 LoadBal
20 pages
Load Balancer Desing
No ratings yet
Load Balancer Desing
8 pages
Introduction To Modern Network Load Balancing and Proxying
No ratings yet
Introduction To Modern Network Load Balancing and Proxying
49 pages
NOV19
No ratings yet
NOV19
28 pages
Alteon OS 22.0.2 Application Guide
No ratings yet
Alteon OS 22.0.2 Application Guide
744 pages
Load Balancing &virtualization
No ratings yet
Load Balancing &virtualization
9 pages
Microsoft Windows Server 2012 R2 - Configuring Advanced Services: HA
No ratings yet
Microsoft Windows Server 2012 R2 - Configuring Advanced Services: HA
54 pages
Load Balancing With Apache Tomcat
100% (1)
Load Balancing With Apache Tomcat
8 pages
Hol 1824 01 Net - PDF - en
No ratings yet
Hol 1824 01 Net - PDF - en
85 pages
SLB Alibaba
No ratings yet
SLB Alibaba
3 pages
Load Balancing Cloud Computing
No ratings yet
Load Balancing Cloud Computing
8 pages
Aws - Ha Notes
No ratings yet
Aws - Ha Notes
3 pages
Handle Large Traffic With SLB Alibaba - Student Guide
No ratings yet
Handle Large Traffic With SLB Alibaba - Student Guide
3 pages
Elastic Load Balance
No ratings yet
Elastic Load Balance
13 pages
Lect7-Load Balancing
No ratings yet
Lect7-Load Balancing
32 pages
Load Balancing 101 Nuts Bolts
No ratings yet
Load Balancing 101 Nuts Bolts
6 pages
1-Distributed Web Infrastructure
No ratings yet
1-Distributed Web Infrastructure
2 pages
System Design Basics
No ratings yet
System Design Basics
33 pages
Load Balancing - IBM
No ratings yet
Load Balancing - IBM
9 pages
What Is Load Balancing
No ratings yet
What Is Load Balancing
6 pages
Experiment 2 Aws New
No ratings yet
Experiment 2 Aws New
6 pages
6.load Balancing
No ratings yet
6.load Balancing
22 pages
Load Balancers
No ratings yet
Load Balancers
4 pages
Load Balancing
No ratings yet
Load Balancing
6 pages
Effect of Database Server Arrangement To The Performance of Load Balancing Systems
No ratings yet
Effect of Database Server Arrangement To The Performance of Load Balancing Systems
10 pages
Load Balancer & Autoscaling
No ratings yet
Load Balancer & Autoscaling
24 pages
What Is Load Blancing
No ratings yet
What Is Load Blancing
2 pages
Evaluation of Two-Level Global Load Balancing Framework in Cloud Environment
No ratings yet
Evaluation of Two-Level Global Load Balancing Framework in Cloud Environment
11 pages
1.1 NLB
No ratings yet
1.1 NLB
5 pages
Open A New Browser Tab 2
No ratings yet
Open A New Browser Tab 2
1 page
Load Balancer & Autoscaling
No ratings yet
Load Balancer & Autoscaling
24 pages
Reading 4.4 Route Traffic With AELB
No ratings yet
Reading 4.4 Route Traffic With AELB
6 pages
FortiADC Handbook
No ratings yet
FortiADC Handbook
517 pages
Redux: Toolkit
No ratings yet
Redux: Toolkit
12 pages
Cluster Computing: Definition and Architecture of A Cluster: Pravin Ganore Comments
No ratings yet
Cluster Computing: Definition and Architecture of A Cluster: Pravin Ganore Comments
6 pages
06) Ec2 Elb
No ratings yet
06) Ec2 Elb
29 pages
AWS EC2 Notes
No ratings yet
AWS EC2 Notes
7 pages
Load Balancer: (Cloud Computing)
No ratings yet
Load Balancer: (Cloud Computing)
24 pages
4 - AWS ELB & Auto Scaling
No ratings yet
4 - AWS ELB & Auto Scaling
21 pages
AWS ELB Interview Questions What Is Elastic Load Balancing in AWS?
No ratings yet
AWS ELB Interview Questions What Is Elastic Load Balancing in AWS?
9 pages
AWS Course Content MindQfghfg
100% (1)
AWS Course Content MindQfghfg
8 pages
MikroTik Load Balancing Over Multiple Gateways
100% (2)
MikroTik Load Balancing Over Multiple Gateways
6 pages
Jenkins Admin
No ratings yet
Jenkins Admin
2 pages
SAP DWC SAP BW Bridge
No ratings yet
SAP DWC SAP BW Bridge
32 pages
Aws Certified Sysops Administrator Associate
No ratings yet
Aws Certified Sysops Administrator Associate
81 pages
AlwaysonAvailabilityGroupsFAQS AOAG 1740791285
No ratings yet
AlwaysonAvailabilityGroupsFAQS AOAG 1740791285
48 pages
real-google-associate-cloud-engineer-study-questions-by-cleveland
No ratings yet
real-google-associate-cloud-engineer-study-questions-by-cleveland
14 pages
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
No ratings yet
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
20 pages
Resume Azure
No ratings yet
Resume Azure
6 pages
Tibco Ems - LB&FT
50% (2)
Tibco Ems - LB&FT
18 pages
Aws Full Notes
No ratings yet
Aws Full Notes
218 pages
Vsphere 5 Cheat Sheet
No ratings yet
Vsphere 5 Cheat Sheet
3 pages
Cs621 Highlighted by (1..8) by Pin2
No ratings yet
Cs621 Highlighted by (1..8) by Pin2
335 pages
Dell EMC Data Domain DD3300 Data Protection PDF
100% (1)
Dell EMC Data Domain DD3300 Data Protection PDF
2 pages
MongoDB Why Documents
No ratings yet
MongoDB Why Documents
15 pages
Meraki SD Wan
No ratings yet
Meraki SD Wan
35 pages
Scaling With Redis - Eviction Policies and Cluster Mode
No ratings yet
Scaling With Redis - Eviction Policies and Cluster Mode
19 pages
DMZ Configuration
No ratings yet
DMZ Configuration
2 pages
ECMP Load Balancing With Masquerade
No ratings yet
ECMP Load Balancing With Masquerade
6 pages
Implementing Win Server 2008 On HP Proliant
No ratings yet
Implementing Win Server 2008 On HP Proliant
26 pages
FatPipe Networks - High Availability
No ratings yet
FatPipe Networks - High Availability
18 pages
StarWind SQL Cluster
No ratings yet
StarWind SQL Cluster
39 pages
Kubernetes Deployment Strategies
No ratings yet
Kubernetes Deployment Strategies
18 pages
BIG-IP Service Provider Message Routing Administration
No ratings yet
BIG-IP Service Provider Message Routing Administration
58 pages
Deploying Oracle Ebusiness Suite 122 With Netscaler
No ratings yet
Deploying Oracle Ebusiness Suite 122 With Netscaler
19 pages
Wireless Overview Slides
No ratings yet
Wireless Overview Slides
26 pages
Datasheet StonegateFW AllInOne
No ratings yet
Datasheet StonegateFW AllInOne
2 pages
Getting Started Webking40
No ratings yet
Getting Started Webking40
22 pages
Ip Cef Accounting: Release Modification
No ratings yet
Ip Cef Accounting: Release Modification
7 pages
Firewall Load Balancing1
No ratings yet
Firewall Load Balancing1
7 pages
API Gateway, Cognito and Node.js Lambdas
From Everand
API Gateway, Cognito and Node.js Lambdas
Matthew Casperson
5/5 (1)
SignalR on .NET 6 - the Complete Guide
From Everand
SignalR on .NET 6 - the Complete Guide
Fiodar Sazanavets
No ratings yet
Learn SAP Basis in 24 Hours
From Everand
Learn SAP Basis in 24 Hours
Alex Nordeen
4.5/5 (2)
AWS Certified Advanced Networking - Specialty ANS-C01 Exam Preparation
From Everand
AWS Certified Advanced Networking - Specialty ANS-C01 Exam Preparation
Georgio Daccache
No ratings yet
RESTful Java Web Services Interview Questions You'll Most Likely Be Asked: Second Edition
From Everand
RESTful Java Web Services Interview Questions You'll Most Likely Be Asked: Second Edition
Vibrant Publishers
No ratings yet