0% found this document useful (0 votes)

21 views22 pages

L5 LargeScaleWebApps

The document discusses large-scale web application architecture, focusing on scale-out strategies for web servers and storage systems. It highlights the importance of load balancing, stateless servers, and cloud computing as solutions for managing scalability and resource efficiency. Additionally, it covers serverless architectures and content distribution networks as modern approaches to optimize web applications.

Uploaded by

pulivenuu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views22 pages

L5 LargeScaleWebApps

Uploaded by

pulivenuu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

Large-Scale

Web
Prof.P.V.Sudha,
Professor & Head,Applications
Dept. of CSE, UCEOU,
Director, CoE in AIML, Osmania University
Web Application
Architecture Web Server
Web A
/ pplication Storage
Browser server System

HTTP

LA
Intern

N
et

2
Large-Scale: Scale-Out
Architecture Web
Web Servers Storage
Browser System

HTTP

LA
Intern

N
et

3
Scale-out architecture

● Expand capacity by adding more instances

● Contrast: Scale-up architecture - Switch to a bigger
instance
○ Quickly hit limits on how big of single instances you can build

● Benefits of scale-out
○ Can scale to fit needs: Just add or remove instances
○ Natural redundancy make tolerating failures easier: One instance
dies others keep working

● Challenge: Need to manage multiple instances and

distribute work to them
Scale out web servers: Which server do you
use?
● Browsers want to speak HTTP to a web server - TCP/IP
connect

● Use load balancing to distribute incoming HTTP requests

across many front-end web servers

● HTTP redirection (e.g. HotMail):

○ Front-end machine accepts initial connections
○ Redirects them among an array of back-end machines

● DNS (Domain Name System) load balancing:

○ Specify multiple targets for a given name
○ Handles geographically distributed system
○ DNS servers rotate among those targets
CS142 Lecture Notes - Large-Scale Web
Apps
Load-balancing switch ("Layer 4-7
Switch")
● Special load balancer network switch
○ Incoming packets pass through load balancer switch between Internet
and web servers
○ Load balancer directs TCP connection request to one of the many web
servers
○ Load balancer will send all packets for that connection to the same
server.

● In some cases the switches are smart enough to inspect

session cookies, so that the same session always goes to
the same server.
● Stateless servers make load balancing easier (different
requests from the same user can be handled by different
nginx ("Engine X")
● Super efficient web server (i.e. speaks HTTP)
○ Handles 10s of thousands of HTTP connections

● Uses:
○ Load balancing - Forward requests to collection of front-end web
servers
○ Handles front-end web servers coming and going (dynamic pools
of server)
■ Fault tolerant - web server dies the load balance just quits
using it
○ Handles some simple request - static files, etc.
○ DOS mitigation - request rate limits

●
Scale-out assumption: any web server
will do
● Stateless servers make load balancing easier
○ Different requests from the same user can be handled by different
servers
○ Requires database to be shared across web servers

● What about session state?

○ Accessed on every request so needs to be fast (memcache?)

● WebSockets bind browsers and web server

○ Can not load balance each request
Scale-out storage system
● Traditionally Web applications have started off using
relational databases
● A single database instance doesn't scale very far.
● Data sharding - Spread database over scale-out
instances
○ Each piece is called data shard
○ Can tolerate failures by replication - place more than one copy of
data (3 is common)
● Applications must partition data among multiple
independent databases, which adds complexity.
○ Facebook initial model: One database instance per university
○ In 2009: Facebook had 4000 MySQL servers - Use hash function to
Memcache: main-memory caching
system
● Key-value store (both keys and values are arbitrary
blobs)
● Used to cache results of recent database queries
● Much faster than databases:
○ 500-microsecond access time, vs. 10's of milliseconds
● Example: Facebook had 2000 memcache servers by
2009
○Writes must still go to the DBMS, so no performance improvement
for them
○Cache misses still hurt performance
○Must manage consistency in software (e.g., flush relevant
Scale-out web Database
Server
architecture Database
Server
Web Server Database
Server
Web Server Database
Server
Web Server
Database
Server
Web Server

Intern Load Balancer Web Server

Memcache
et
Web Server Memcache
Web Server
Memcache

Web Server Memcache

Building this architecture is
hard
● Large capital and time cost in buying and installing
equipment

● Must become expert in datacenter management

● Figuring out the right number of different components

hard

○ Depends on load demand

Scaling issues were hard for early
web app
● Startup: Initially, can't afford expensive systems for
managing large scale.
● But, application can suddenly become very popular
("flash crowd"); can be disastrous if application can
not scale quickly.

● Many of the early web apps either lived or died by the

ability to scale

○ Friendster vs. Facebook

Virtualization - Virtual and Physical
machines
Virtual Machines Physical
Images (Disk Machines
Images) server server server
Load Balancer
Virtualization server server server
Web Server layer
server server server
Database Load balancer 1
Server server server server
Memcache Web Server 100
server server server
Database 50

Memcache 20 server server server

Cloud Computing
● Idea: Use servers housed and managed by
someone else
○ Use Internet to access them

● Virtualization is a key enabler Load balancer 1

Web Server 100

Specify your compute, storage, communication
needs: Cloud provider does the rest Database 50

Memcache 20
● Examples:
Amazon EC2
Microsoft Azure
Google Cloud
Many others
Cloud Computing Advantages
● Key: Pay for the resources you use
○ No upfront capital cost
○ Need 1000s machines right now? Possible
○ Perfect fit for startups:
■ 1998 software startup: First purchase: server machines
■ 2012 software startup: No server machines

● Typically billing is on resources:

○ CPU core time, memory bytes, storage bytes, network bytes

● Runs extremely efficiently

○ Buy equipment in large quantities, get volume discounts
○ Hirer a few experts to manage large numbers of machines
○ Place servers where space, electricity, and labor is cheap
Higher level interfaces to web app cloud
●services
Managing a web app backend at the level of virtual machines requires
system building skills

● If you don't need the full generality of virtual machines you can use
some already scalable platform.
○ Don't need to manage OSes: Container systems like
Docker/Kubernetes
■ Specify programs and dependencies that run as a process
○ Don't need to manage storage - Cloud database storage
■ Let the cloud run the database
○ Don't need to manage instances/load balancing:
Serverless
■ Let the cloud run the scale-out compute
infrastructure
Cloud Database
● Rather than running database instances - Use cloud
Storage
run databases
○ Cloud provider has experts at running large scale systems

● Example: Google Spanner, Amazon DynamoDB

○ You: define schama, provide data, access using queries
○ Cloud provider: runs storage services

● Features:
○ High Available
○ High Performance
○ Global replication and region containment
○ Consistency
○ Security
○ Usage based pricing
Serverless approach: Amazon
Lambda
● You provide pieces of code, URLs associated with each
piece of code

● Amazon Lambda does the rest:

○ Allocate machines to run your code
○ Arrange for name mappings so that HTTP requests find their way to
your code
○ Scale machine allocations up and down automatically as load changes
○ Lambda environment also includes a scalable storage system

● More constrained environment

○ Must use their infrastructure and supported environments: Python,
JavaScript, Java, Go, ...
Serverless architecture - Cloud
●provider
Hand over web-servers to cloud infrastructure
● Developer just specifies code to run on each URL & HTTP verb
○ Like Node/Express handlers

● Examples:
○ Amazon Lambda Functions
○ Microsoft Azure Functions
○ Google Cloud Functions

● Cloud provides services only (no servers)

○ Handles all scale-out, reliability, infrastructure security, monitoring, etc.
○ Pay by the request - Enable to pack function execution into available server
resources

● Web App backend: Schema specification for cloud storage,

handler functions
Content Distribution Network
(CDN)
● Consider a read-only part of our web app (e.g. image, React
JavaScript, etc.)
○ Browser needs to fetch but doesn't care where it comes from

● Content distribution network

○ Has many servers positions all over the world
○ You give them some content (e.g. image) and they give you an URL
○ You put that URL in your app (e.g. <img src="...)
○ When user's browsers access that URL they are sent to the closest server (DNS
trick)
● Benefits:
○ Faster serving of app contents
○ Reduce load on web app backend

● Only works on content that doesn't need to change often

Cloud Computing and Web
Apps
● The pay-for-resources-used model works well for many web app
companies
○ At some point if you use many resources it makes sense to build own data
centers

● Many useful infrastructure services available:

○ Auto scaling (spinning up and down instances on load changes)
○ Geographic distribution (can have parts of the backend in different parts of the
world)
○ Monitoring and reporting (what parts of web app is being used, etc.)
○ Fault handling (monitoring and mapping out failed servers)

● Cloud Application Programming Interfaces (APIs):

○ Analytics
○ Machine learning - Prediction, recommendation, etc.
○ Translation, image recognition, maps, etc.

All Notes of WEB
No ratings yet
All Notes of WEB
15 pages
SD Blueprint Merged
No ratings yet
SD Blueprint Merged
160 pages
Serverless Handbook
100% (1)
Serverless Handbook
360 pages
S11 - System Architecture
No ratings yet
S11 - System Architecture
79 pages
Electronic Commerce Fundamentals Applications.
100% (1)
Electronic Commerce Fundamentals Applications.
504 pages
Chapter 4 - Building Scalable Web Applications
No ratings yet
Chapter 4 - Building Scalable Web Applications
19 pages
Scaling To Millions Users
No ratings yet
Scaling To Millions Users
21 pages
CloudComputing Lect1
No ratings yet
CloudComputing Lect1
53 pages
MC4201 Unit 2
No ratings yet
MC4201 Unit 2
37 pages
Part 1 - Scalability
No ratings yet
Part 1 - Scalability
52 pages
1 Cloud S - Merged
No ratings yet
1 Cloud S - Merged
92 pages
Module 4 CC
No ratings yet
Module 4 CC
43 pages
Cloudcomputing m1&2
No ratings yet
Cloudcomputing m1&2
72 pages
Module 5
No ratings yet
Module 5
11 pages
SB http1
No ratings yet
SB http1
68 pages
Module Six Cloud Computing-1
No ratings yet
Module Six Cloud Computing-1
27 pages
Bca Ca357 Mod 3
No ratings yet
Bca Ca357 Mod 3
38 pages
Unit1 B
No ratings yet
Unit1 B
48 pages
Computing: TE: Artificial Intelligence and Data Science
No ratings yet
Computing: TE: Artificial Intelligence and Data Science
19 pages
Scalability and Security
No ratings yet
Scalability and Security
31 pages
W2C1 History Building Blocks Cloud Computing
No ratings yet
W2C1 History Building Blocks Cloud Computing
38 pages
Lecture 06
No ratings yet
Lecture 06
68 pages
09 01 Services Slides
No ratings yet
09 01 Services Slides
34 pages
Web Applications (1st Part)
No ratings yet
Web Applications (1st Part)
72 pages
CCS IA 2 Notes
No ratings yet
CCS IA 2 Notes
18 pages
Lecture 5
No ratings yet
Lecture 5
32 pages
Challenges in Cloud Security
No ratings yet
Challenges in Cloud Security
31 pages
1 Cloud S
No ratings yet
1 Cloud S
33 pages
UNIT V Cloud Platforms in Industry
No ratings yet
UNIT V Cloud Platforms in Industry
10 pages
12
No ratings yet
12
16 pages
NOTES - CIT 237 - Web Services and Cloud Computing
No ratings yet
NOTES - CIT 237 - Web Services and Cloud Computing
20 pages
SplashtopCenter v2.3.5.x Admin Guide v1.7
No ratings yet
SplashtopCenter v2.3.5.x Admin Guide v1.7
226 pages
Mad 1 Week 1
No ratings yet
Mad 1 Week 1
10 pages
Cloud Computing Infrastructure
No ratings yet
Cloud Computing Infrastructure
33 pages
17 Web Application Firewall
No ratings yet
17 Web Application Firewall
25 pages
ECS781P 2 CloudNetworking
No ratings yet
ECS781P 2 CloudNetworking
59 pages
Fundamentals System Design
No ratings yet
Fundamentals System Design
27 pages
CC Unit 5 Own Notes
No ratings yet
CC Unit 5 Own Notes
8 pages
CP1402 Week 2 I OSI and Troubleshooting STUDENT
No ratings yet
CP1402 Week 2 I OSI and Troubleshooting STUDENT
27 pages
IF-CO VISem Server Side SCripting Using JSP (CO) 141220181905 GAE2
No ratings yet
IF-CO VISem Server Side SCripting Using JSP (CO) 141220181905 GAE2
8 pages
Lecture No 2
No ratings yet
Lecture No 2
18 pages
CC Unit 5 Own Notes
No ratings yet
CC Unit 5 Own Notes
13 pages
Deployement and Best Practice - Unit-6
No ratings yet
Deployement and Best Practice - Unit-6
10 pages
Backend Burger
No ratings yet
Backend Burger
3 pages
Computer Science VIII Application Layer: Ing. Etson Guerrero
No ratings yet
Computer Science VIII Application Layer: Ing. Etson Guerrero
77 pages
All1 7ForMidTerm PDF
No ratings yet
All1 7ForMidTerm PDF
97 pages
Netflix Debunker 3.0
No ratings yet
Netflix Debunker 3.0
78 pages
Lecture 4
No ratings yet
Lecture 4
10 pages
Cloud Services and Platforms - Compute Services
No ratings yet
Cloud Services and Platforms - Compute Services
4 pages
The System Design
No ratings yet
The System Design
135 pages
Cloud Computing - Lecture 2
No ratings yet
Cloud Computing - Lecture 2
22 pages
Serverless & Faas
No ratings yet
Serverless & Faas
5 pages
Income Tax Synopsis
No ratings yet
Income Tax Synopsis
24 pages
ITT501 Chapter 3 PDF
No ratings yet
ITT501 Chapter 3 PDF
2 pages
UCR Library Serverless Application Architecture
No ratings yet
UCR Library Serverless Application Architecture
16 pages
Ict1532 - 6
No ratings yet
Ict1532 - 6
3 pages
Statistics Webservices Api Reference Guide
No ratings yet
Statistics Webservices Api Reference Guide
69 pages
Atharv 23 Cloud Computing Technology CaseStudy
No ratings yet
Atharv 23 Cloud Computing Technology CaseStudy
8 pages
User Authentication Module Design
No ratings yet
User Authentication Module Design
3 pages
Fiddler
No ratings yet
Fiddler
54 pages
3GPP CDR Specifications
No ratings yet
3GPP CDR Specifications
138 pages
Notes On Jboss Application Server and Ejb 3.0
No ratings yet
Notes On Jboss Application Server and Ejb 3.0
35 pages
Introduction To Compute
No ratings yet
Introduction To Compute
6 pages
AX Training
100% (1)
AX Training
104 pages
How To Design A System To Scale To Your First 100 Million Users - by Anh T. Dang - Level Up Coding
No ratings yet
How To Design A System To Scale To Your First 100 Million Users - by Anh T. Dang - Level Up Coding
34 pages
Cyber-Ark Privileged Identity Management 7 1 CEF Config Guide 2012
No ratings yet
Cyber-Ark Privileged Identity Management 7 1 CEF Config Guide 2012
8 pages
Cookie Cadger
No ratings yet
Cookie Cadger
53 pages
Ethical Hacking and Countermeasures: Course Outline
No ratings yet
Ethical Hacking and Countermeasures: Course Outline
51 pages
Vineet Gupta - GM - Software Engineering - Directi: Intelligent People. Uncommon Ideas
No ratings yet
Vineet Gupta - GM - Software Engineering - Directi: Intelligent People. Uncommon Ideas
73 pages
Online Examination System
33% (3)
Online Examination System
81 pages
Cloud Computing: Traditional Sever Concept
No ratings yet
Cloud Computing: Traditional Sever Concept
27 pages
System Design Cheat Sheet
No ratings yet
System Design Cheat Sheet
6 pages
Introduction To ISO
No ratings yet
Introduction To ISO
8 pages
Client - Server Architecture
No ratings yet
Client - Server Architecture
3 pages
Amazon Web Service CASE STUDY
No ratings yet
Amazon Web Service CASE STUDY
36 pages
Web2py: Ideas We Stole - Ideas We Had
100% (2)
Web2py: Ideas We Stole - Ideas We Had
47 pages
What Is Cloud Computing?: (And An Intro To Parallel/distributed Processing)
No ratings yet
What Is Cloud Computing?: (And An Intro To Parallel/distributed Processing)
10 pages
Lecture 12 - Wireless Application Protocol
No ratings yet
Lecture 12 - Wireless Application Protocol
28 pages
EGMP
No ratings yet
EGMP
29 pages
RK JSP Session Cookies
No ratings yet
RK JSP Session Cookies
39 pages
Cloud Computing Infrastructure: Take A Seat & Prepare To Fly
No ratings yet
Cloud Computing Infrastructure: Take A Seat & Prepare To Fly
33 pages
Cloud Computing Infrastructure: Take A Seat & Prepare To Fly
No ratings yet
Cloud Computing Infrastructure: Take A Seat & Prepare To Fly
33 pages
BIS 8 Trends
No ratings yet
BIS 8 Trends
36 pages
Final VT Report
No ratings yet
Final VT Report
36 pages
Part-4: What Happens When A User Performs A Voice Call From An LTE/4G Network? 4. SRVCC - Single Radio Voice Call Continuity
No ratings yet
Part-4: What Happens When A User Performs A Voice Call From An LTE/4G Network? 4. SRVCC - Single Radio Voice Call Continuity
6 pages
Modern Web Application Architecture Overview
No ratings yet
Modern Web Application Architecture Overview
9 pages
PHP Questions
No ratings yet
PHP Questions
68 pages
Introduction To Cloud Computing
No ratings yet
Introduction To Cloud Computing
36 pages
Cloud Computing Made Simple: Navigating the Cloud: A Practical Guide to Cloud Computing
From Everand
Cloud Computing Made Simple: Navigating the Cloud: A Practical Guide to Cloud Computing
Poonam Devi
No ratings yet
How To Do Virtualization: Your Step-By-Step Guide To Virtualization
From Everand
How To Do Virtualization: Your Step-By-Step Guide To Virtualization
HowExpert
No ratings yet