100% found this document useful (1 vote)

85 views32 pages

Network Design: Architecting With Google Cloud Platform: Design and Process

Uploaded by

Daniel Reyes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

85 views32 pages

Network Design: Architecting With Google Cloud Platform: Design and Process

Uploaded by

Daniel Reyes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Network Design

Architecting with Google Cloud Platform:

Design and Process

Last modified 2018-08-08

© 2017 Google Inc. All rights reserved. Google
and the Google logo are trademarks of Google Inc.
All other company and product names may be
trademarks of the respective companies with
which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
2

Agenda
Network configuration for data transfer within the service

Network integration with other environments

Photo service: periodic slowdown

Design challenge #3: Growth

GCP lab Deployment Manager: Adding load balancing

© 2018 Google Inc. All rights reserved. Google and the Google logo
are trademarks of Google Inc. All other company and product names may
be trademarks of the respective companies with which they are associated.
3

Network configuration for

data transfer within the
service
Location

Load Balancing

Caching

Location of resources within the cloud network is significant

Time in ms, 1 million ns = 1 ms

● Send 2 kB over 1 Gbps network | 2,000 ns | 0.002 ms

● Round trip within same datacenter | 500,000 ns | 0.5 ms
● Send packet CA->Netherlands->CA | 150,000,000 ns | 150 ms

No more than 6-7 round trips between Europe and the US per second are possible,
but approximately 2000 per second can be achieved within a datacenter.

The technology that allows you to control the network location of

resources used by your service is Load Balancing.

Location is significant. Only in this case, you pay more for something that is farther
away.
Note: Describes VM-to-VM communications inside the Google Network.

You can use performance testing tools such as iperf to test timing.
5

Load balancing provides control over location and scale

Load Balancing

Load balancing can get user traffic to application

servers with capacity in the closest region --
giving your design control over network location.

Load balancing can scale services by distributing

traffic over multiple servers and triggering
autoscaling.

Google provides several load balancing services

that offer different location controls and traffic
distribution methods. They are optimized for
different use cases.

Network speed is just one factor in throughput. Network location is key. Parallelism is
another factor. And load balancing combines both

https://pixabay.com/en/meditation-stone-towers-stone-tower-2262835/
6

Selecting Load Balancing Services

GLOBAL
HTTP(S) SSL Proxy TCP Proxy

REGIONAL
Network Internal

● Proxied ● Proxied ● Proxied ● Pass-through ● Can be proxied or

● Cross-region ● Certificates ● Intelligent routing ● Any TCP or UDP pass-through
● Content-based ● Intelligent routing ● IPv4 or IPv6 ● Session affinity on/off ● Any TCP or UDP
● IPv4 or IPv6 ● IPv4 or IPv6 ● Health checks ● Internal IP
● Requests stay
within the VPC
● Software-defined
© 2018 Google Inc. All rights reserved. Google and the Google logo
are trademarks of Google Inc. All other company and product names may
be trademarks of the respective companies with which they are associated.

Load balancing is either proxied or pass-through. A proxied load balancer terminates

the incoming connection and initiates a separate connection to the target (usually SSL
or TCP). A non-proxied or pass-through load balancer redirects and distributes the
traffic without terminating and initiating a separate connection. Global load balancers
can direct traffic to the closest region with capacity, whereas regional load balancers
direct traffic to or within a single region.
https://cloud.google.com/compute/docs/load-balancing/

HTTP(S) processes requests on port 80 or port 8080

SSL proxy supports the following ports: 25, 43, 110, 143, 195, 443, 465, 587, 700,
993, 995
TCP Proxy supports the following ports: 25, 43, 110, 143, 195, 443, 465, 587, 700,
993, 99

Intelligent routing means that capacity is considered in the routing decision.

Internal load balancing is software-defined, meaning that there is no hardware
interface to serve as a choke point or risk load balancer availability.

In general, for traffic originating externally, stick to the protocol-named service that is
designed for and optimized for that protocol, unless you have a compelling reason.
For multi-tier internal traffic, use the internal load balancing service.
Then use the more general network load balancing service for anything else.

Maglev research paper: https://research.google.com/pubs/pub44824.html

Choosing Load Balancing
Does the yes
start traffic originate
externally?
Is it
no web traffic?
(HTTP or HTTPS) no
Is it
yes SSL (TLS)
traffic? no
Compelling
reason? Is it no
yes
TCP
traffic?

Internal yes

Is yes
pass-through
Network required?

HTTP(S)
SSL Proxy
© 2018 Google Inc. All rights reserved. Google and the Google logo TCP Proxy Network
are trademarks of Google Inc. All other company and product names may
be trademarks of the respective companies with which they are associated. 8

HTTP(S) load balancing: You can configure URL rules that route some URLs to one
set of instances and route other URLs to other instances. Requests are always routed
to the instance group that has capacity and is closest to the user.
https://cloud.google.com/compute/docs/load-balancing/http/

SSL proxy: A proxied global load balancing service that automatically directs SSL
traffic to the closest region that has capacity.
https://cloud.google.com/compute/docs/load-balancing/tcp-ssl/

TCP proxy: Terminates IPv4 and IPv6 and initiates an IPv4 connection to the
backend servers.
https://cloud.google.com/compute/docs/load-balancing/tcp-ssl/tcp-proxy

Network load balancing allows you to balance load of your systems based on
incoming IP protocol data, such as address, port, and protocol type.
https://cloud.google.com/compute/docs/load-balancing/network/

Internal load balancing enables you to run and scale your services behind a private
load balancing IP address which is accessible only to instances internal to your Virtual
Private Cloud (VPC). https://cloud.google.com/compute/docs/load-balancing/internal/

Network load balancing was used for internal load balancing before the internal load
balancing service was available. Configuration is significantly more complicated with
network load balancing because you have to restrict access to the VPC using firewall
rules and routes. You also must be plan for capacity of the load balancer itself,
because choke points are possible and the load balancer could reach capacity and
impact availability. If there is some reason Internal load balancing won't work in your
situation, network load balancing is still an alternative. However, there are no
common use cases.
8

Network integration with

other environments
Existing on-premise

Multi-cloud

Cloud External IP Address

GCP offers global static external IP addresses:

● You can use global IPs in DNS records.

● They are only available to global forwarding rules.
● The global forwarding rule is used for global load balancing.
● You cannot assign a global IP address to a regional or zonal resource.

To get a global static IP address for a GCP resource, configure global load balancing.
12

Cloud CDN

Cloud CDN (Content Delivery Network) uses Google's globally distributed

edge points of presence to cache HTTP(S) load balanced content close to
your users.

Caching content at the edges of Google's network provides faster delivery of

content to your users while reducing costs.

● Lowers network latency

● Offloads origins
● Reduces server requirements

In certain circumstances caching can be a design issue. For example, if a value your
application relies on is cached and you want to roll out a new version of the
application that changes the value, the cached value could create issues that are
difficult to troubleshoot.

In general, Cloud CDN handles caching transparently.

If you decide to use a 3rd party or open source cache as part of your solution, please
investigate cache management.

Advice on 3rd party or open source cache products:

● Know the type of cache
● Performance (latency) cache: cache exists to serve data with lower average
latency than from the backend.
● Capacity (throughput) cache: cache exists in order to serve higher throughput
than the backend can deal with.
● Startup from cold is difficult, gradual ramp of traffic is required
● Be careful in use of caches if strict consistency matters
● Cache invalidation is complicated Cache invalidation is a process (purge,
refresh, or ban) whereby entries in a cache are replaced or removed. It can be
done explicitly, as part of a cache coherence protocol.
https://en.wikipedia.org/wiki/Cache_invalidation
11

Design pattern: Multi-cloud solution with dedicated interconnect

Dedicated Interconnect
Another Cloud

Development Development
Shared Virtual Private Cloud

Cloud Partner
Interconnect
Direct Connect
Cloud Router
Router

Production
Production
Shared Virtual Private Cloud
Cloud Partner
Interconnect
Direct Connect
Cloud Router
Router

https://cloud.google.com/interconnect/docs
12

VPN configurations
Reliability configuration Aggregate capacity configuration
PEER Network
PEER Network

VPN Gateway
Gateway
VPN
Gateway
VPN Gateway
Gateway Gateway

Two VPN gateways connect to the same peer IP. Forward the same IP range to two peer gateways
Traffic is load balanced between the two VPN gateways. Traffic is load balanced over the tunnels, combining the capacity
If one path is lost the other takes over. Max: 3 Gbps per tunnel over direct interconnect 1.5 Gbps over internet

PEER Network

VPN
Cloud Router
Gateway
Gateway

VPN
Gateway
Gateway Adds BGP dynamic
discovery of routes

Combine the two for reliability plus aggregate bandwidth.

https://cloud.google.com/compute/docs/vpn/advanced
13

VPN Performance

Verify that the capacity of the peer devices matches the VPN gateways

There are many settings, including MTU, which is normally dynamically set

You can influence performance by changing encryption during setup

● AES-GCM offers the highest throughput

If you are measuring throughput over VPN, use multiple TCP streams

● iperf -P

https://cloud.google.com/compute/docs/vpn/advanced#recommended_measures_to_i
ncrease_vpn_throughput
14

Periodic slowdown

Under certain conditions the service is very

slow, at other times it is fast.

What is causing this irregularity?

What's causing the service to slow down?

What can be done to fix it?

Okay, so let's go back to our photo service. In this case, we have a periodic
slowdown, which means that under certain conditions the service is very slow, but at
other times it's fast. So, what could be causing this irregularity? What's causing the
service itself to slow down? And, what can we do to fix it?

https://pixabay.com/en/summer-sunflower-flowers-sky-cloud-368224/
The system is slow. It is taking minutes to generate thumbnails

The thumbnail service is growing in terms of the number of thumbnails being

generated.

However, during peak periods there appears to be a slowdown and it can take up
to several minutes after submitting a photo for the thumbnail image to be
returned.

The Web Dev team thinks the problem is in the thumbnail application code.
The App Dev team thinks the problem is in the web server application code.

Other teams were impacted:

The Support team has been dealing with user calls. They have asked for help.
The Operations team does not have procedures to fix the problem.

So, the thumbnail service is growing and this can be seen through the number of
thumbnails being generated, which is great. We're starting to get more popularity, but
monitoring our log processing shows that there appears to be a slowdown during
peak periods, and it can take up to several minutes after submitting a photo for the
thumbnail image to be returned.
Now, this doesn’t really happen, but let’s take a fictitious scenario where a company
has groups of teams that don't really get along, or will blame each other. So in this
case, the Web Dev team thinks that the problem is the thumbnail application code.
Well, guess what? The people who wrote the code - the App Dev team - think the
problems in the web server application code, because it might not be handling
sessions and so on.
But there are other teams that were impacted too. The Support team are dealing with
user calls, so they're calling and asking for help. The Operations team doesn't have a
procedure to fix the problem because they're the ones who manage the deployment
and the production servers. They're not sure if it's the Web teams fault or the App
team's fault. So, who's going to fix this?
18
PROCESS
Find and fix the real problems
The root cause is always:

● Systems
● Processes BLAMELESS CULTURE
● Behaviors

System stability depends on finding and fixing

the core problems.
Fix problems, not people.
Learn together to avoid repeating mistakes

● Fix what can be fixed

● Prevent what can't be fixed
● Handle what can't be prevented

Google learned that the reliability of the service depends on how people work
together to fix problems. Every new system or upgraded system goes through a
period of stabilization. During that period you will need to respond to problems, find
the root causes, and address the problems. If you stop looking after you have
assigned blame to a person, but don't continue digging until you get to the systems,
processes, or behaviors that must be changed -- you will leave the system broken,
and it will not stabilize.

Consider this example:

The service was out.

1 .Why was the service out? Because the filesystem was full.
2. Why was the filesystem full? Because the person responsible for archiving old files
failed to do so.
3. Why didn't they archive the old files, was the archive tool broken? No the tool
wasn't broken.
4. Why didn't they archive the old files? The procedure documentation didn't tell them
to archive the files.
So the root cause was a problem with the process documentation.

If the analysis had stopped at 2, the person might have been punished without solving
the core system problem, which was absent procedures.

The service doesn't stabilize if you don't find and fix the real problems.
Learning together
Outages happen. And what may be clear to one person may not be clear to another.
Some outcomes would not have been anticipated by anyone.

Blameless
Blame makes people afraid to bring real issues to light and is detrimental to a learning
culture.
People are NEVER the root cause. There is something in the system, in the
processes, or in the behaviors that IS the root cause and needs to be identified and
fixed or mitigated.

Identify the actions that led to the incident.

Stick to the facts.
Keep communications simple. Use passive tense.
Consider modifying the auditing process.

Reference: SRE Book: Chapter 15 - Postmortem Culture: Learning from Failure

https://pixabay.com/en/pointing-accusation-accuse-blame-1991215/
17
PROCESS
Policy for writing postmortem reports

Always write a report when:

● Anytime an SLO is breached

● An incident required an emergency (on call) response from another team
● An impacted team requests a follow-up communication
Policy

You should have a policy regarding these reports:

● A draft report should be published within X hours of the incident.
● The report should be completed within Y business days.

It is important that teams learn from mistakes. Each postmortem written and read
reduces the chances of repeating mistakes. Postmortem reports become a method
for training people.
Refresher
Upload Thumbnail
Server Server

Data Storage
Service

Thumbnail
Image Conversion Thumbnail Serving
Ingest
Storage (Processing) Storage Thumbnails

User Experience

After systematic and logical troubleshooting, and answering the "five why's", the
team determines that the issue is definitely tied to the capacity of the system to
generate thumbnails.

The front-end web service is not causing delays. Only the back-end thumbnail
generating service, which is failing to keep up with demand.

The thumbnail server is running out of CPU.

CPU utilization is non-linear. During busy times, the utilization goes to 100%, which
impacts the end to end response time for the user.

Keep in mind that CPU utilization is not to be used as a service level indicator. It is not
a direct measurement of customer pain.
Scale the backend processing of thumbnails
Thumbnail Servers

Business Issue: Need to handle Upload

Server
more thumbnail processing - must
become scalable. Network
Cloud Load
Balancing

● Add a network load balancer to

distribute the traffic to multiple
thumbnail servers.

Thumbnail
Image Conversion Thumbnail Serving
Ingest
Storage (Processing) Storage Thumbnails
Data Storage
User Experience Server

So, here was our decision. We decided that if we need to handle more thumbnail
processing, it's got to become more scalable. However, we didn't choose to simply
throw more CPU and network at it because it was more of a single point of failure.
Instead, we decided to add a load balancer and scale out the thumbnails servers. The
great thing is that it's like a microservice in itself now. Because storage has been
isolated to Google Cloud Storage, the same code can be distributed and it doesn't
keep track of a queue or anything else. The upload server basically pulls whatever is
on the data storage server, and load balances it as they come in. Technically, this is
probably an internal load balancer, but we'll get into that a little bit later. In this case
here, to help us with our greater than 80 percent CPU utilization, we want to distribute
traffic requests from our business logic to the application servers in a cluster.
Objectives and Indicators

Objectives Indicators
Availability, 23/24 hours/day = 95.83% availability Server up/down time

99% of user operations completed in < 1 minute End to end latency

Failure to produce a thumbnail < 0.01% Completion errors (log entry)

(100 errors per million) @ 1m images/day
Error budget = 3,000 errors per month

Even though we’ve added a cluster of servers, we haven’t changed anything that the
user can measure. The performance is still a measure of the end to end latency, and
the accuracy of the service is still based on the error logs.

We didn’t need to adjust the SLOs because they are not based on the CPU load of
the backend. Rather, the SLOs are based on the user experience.

The autoscaling will help alleviate the CPU bottleneck because if the pool gets
saturated it will autoscale.
That problem has been resolved. A new problem it reveals, however, is how long it
will take for the autoscaling to catch up to the user demand. If the user demand is
gradual, autoscaling will have no problem keeping up with demand. But if the demand
is extremely bursty, other techniques and settings might be necessary: for example,
might need to keep capacity at N+1 servers to give time for the pool to start up
another server; changing the autoscaling trigger value; or using more sophisticated or
custom metrics.
22

YOUR TURN

Design challenge #3
Growth

https://pixabay.com/en/the-strategy-win-champion-1080527/
App logs are growing. Logging server can't keep up

Web Logs Data Storage Logs App Logs

Logs Logs Logs

App
App Logs
Web Data Stg
Logs Logs Logs

App App
Logs Logs

Storage Service Logging Server

Logs Logs

Data Logs Ingest

Cloud
Web Logs Ingest Append Transform Bigtable
Transform

Cloud App Logs Store

Bigtable
Daily Cron Batch Job

If you expect to quickly outgrow local CPU, what is a way to scale the processing
capability of the logic?
24

Take a few minutes to design your solution

Problem: Autoscaling of the application servers have produced logs that are
outgrowing the processing capacity of the aggregation logging server.

Design a solution.

There are multiple designs possible depending on your assumptions. Your

solution might be better than the one shown. The point of this exercise is to
"think about the design" to develop your architecting skills.

You can sketch your design in a tool like http://docs.google.com/drawings

One solution

Remember, there are multiple valid solutions to this challenge.

Compare your design with the example solution.

Did your design account for all the elements addressed in the example solution?
App logs are growing. Logging server can't keep up

Web Logs Data Storage Logs App Logs

Logs Logs Logs

App
App Logs
Web Data Stg
Logs Logs Logs

App App
Logs Logs

Cloud Load
Balancing
Storage Service Logging Server

Logs Logs

Data Logs Ingest

Cloud
Web Logs
Transform Ingest Append Transform Bigtable
Store
Cloud App Logs
Bigtable
Autoscaling instance group Daily Cron Batch Job

This proposed solution uses an internal load balancer.

An alternative could be Cloud Dataflow.
27

GCP lab

Deployment Manager: Adding load balancing

Lab 3: How to move from an instance to an instance template, add an instance group, autoscaling,
and a load balancer. (Echo application).
© 2018 Google Inc. All rights reserved. Google and the Google logo
are trademarks of Google Inc. All other company and product names may
be trademarks of the respective companies with which they are associated.
28

Lab Deployment

autoscaling
instance group

Cloud Load Appserver

Balancing Compute Engine

© 2018 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names
may be trademarks of the respective companies with which they are associated.

DevOps Shack Networking Load Balancing 1743097349
No ratings yet
DevOps Shack Networking Load Balancing 1743097349
23 pages
GCP ACE LectureSlides
No ratings yet
GCP ACE LectureSlides
644 pages
Course Presentation GoogleCloudDigitalLeader
No ratings yet
Course Presentation GoogleCloudDigitalLeader
254 pages
Google Cloud Platform (GCP) - PPT (Not Daily)
No ratings yet
Google Cloud Platform (GCP) - PPT (Not Daily)
13 pages
07 Network Billing and Pricing 2.0
No ratings yet
07 Network Billing and Pricing 2.0
33 pages
4.1 - Interconnecting Networks
No ratings yet
4.1 - Interconnecting Networks
31 pages
02 Virtual Networks
No ratings yet
02 Virtual Networks
51 pages
00 Networking in Google Cloud 2.0
No ratings yet
00 Networking in Google Cloud 2.0
18 pages
1.1 GCP - VPC - and - Subnets PDF
No ratings yet
1.1 GCP - VPC - and - Subnets PDF
18 pages
Google Certified Professional Cloud Architect
100% (1)
Google Certified Professional Cloud Architect
446 pages
Virtual - Networks GCP
No ratings yet
Virtual - Networks GCP
45 pages
M3 - T-GCPFCI-B - Core Infrastructure v5.1.0 - ILT
No ratings yet
M3 - T-GCPFCI-B - Core Infrastructure v5.1.0 - ILT
37 pages
VPC Network
No ratings yet
VPC Network
7 pages
Maglev: A Fast and Reliable Software Network Load Balancer
No ratings yet
Maglev: A Fast and Reliable Software Network Load Balancer
13 pages
Managing and Provisoning A Solution
No ratings yet
Managing and Provisoning A Solution
26 pages
Pls Gca Pca Student Slides 3
No ratings yet
Pls Gca Pca Student Slides 3
98 pages
03 T-GCPPCA-A-m2-l6-file-en-17.en
No ratings yet
03 T-GCPPCA-A-m2-l6-file-en-17.en
26 pages
GCP - GOOD - 11 - Hybrid Load Balancing and Traffic Management - ILT
No ratings yet
GCP - GOOD - 11 - Hybrid Load Balancing and Traffic Management - ILT
39 pages
Lecture 4 - NFV As A Public Cloud Service
No ratings yet
Lecture 4 - NFV As A Public Cloud Service
36 pages
01 Google Cloud VPC Networking Fundamentals
No ratings yet
01 Google Cloud VPC Networking Fundamentals
45 pages
Course Presentation GoogleCloudDigitalLeader
100% (2)
Course Presentation GoogleCloudDigitalLeader
214 pages
Google Cloud Platform: School of Computer Engineering KIIT University
No ratings yet
Google Cloud Platform: School of Computer Engineering KIIT University
39 pages
118 GCP Digital Leader Cheat Sheet
50% (2)
118 GCP Digital Leader Cheat Sheet
5 pages
Course Presentation GoogleCloudDigitalLeader
No ratings yet
Course Presentation GoogleCloudDigitalLeader
182 pages
04 - Load Balancing 2.0 OD
No ratings yet
04 - Load Balancing 2.0 OD
65 pages
OG For FTTX O&M - (V100R002C01 - 03) PDF
No ratings yet
OG For FTTX O&M - (V100R002C01 - 03) PDF
801 pages
Topology
No ratings yet
Topology
38 pages
1.2 Virtual Networks
No ratings yet
1.2 Virtual Networks
45 pages
PCNE Workbook
No ratings yet
PCNE Workbook
83 pages
Icc Final
No ratings yet
Icc Final
29 pages
AWS Cloud Computing Unit 4
No ratings yet
AWS Cloud Computing Unit 4
16 pages
CTEL MultiCloud GCP Introduction ENG Y23
No ratings yet
CTEL MultiCloud GCP Introduction ENG Y23
25 pages
GCP Networking Course Slides For Downloads Rev1
No ratings yet
GCP Networking Course Slides For Downloads Rev1
129 pages
Python Module 3
No ratings yet
Python Module 3
88 pages
Scaling and Automation
No ratings yet
Scaling and Automation
4 pages
Seminar Research
No ratings yet
Seminar Research
17 pages
Google Cloud Architect Exams Questions
No ratings yet
Google Cloud Architect Exams Questions
40 pages
TR-255 GPON Interoperability Test Plan PDF
No ratings yet
TR-255 GPON Interoperability Test Plan PDF
254 pages
Data Layer Design: Architecting With Google Cloud Platform: Design and Process
No ratings yet
Data Layer Design: Architecting With Google Cloud Platform: Design and Process
47 pages
TR-134 - Corrigendum-1 Broadband Policy Control PDF
No ratings yet
TR-134 - Corrigendum-1 Broadband Policy Control PDF
110 pages
Gapps Networking Guide
No ratings yet
Gapps Networking Guide
40 pages
05 Google Cloud and Hybrid Network Architecture
No ratings yet
05 Google Cloud and Hybrid Network Architecture
41 pages
EMC VPLEX Administration Guide
No ratings yet
EMC VPLEX Administration Guide
278 pages
Accessing Web Services Using Ibm Db2 For I HTTP Udfs and Udtfs
No ratings yet
Accessing Web Services Using Ibm Db2 For I HTTP Udfs and Udtfs
48 pages
(T-GCPAWS-I) Module 3 - Virtual Machines in The Cloud
No ratings yet
(T-GCPAWS-I) Module 3 - Virtual Machines in The Cloud
62 pages
GCP Fund Module 9 Summary and Review
No ratings yet
GCP Fund Module 9 Summary and Review
13 pages
Week 11 GCP Notes
No ratings yet
Week 11 GCP Notes
7 pages
GCP Fund Module 1 Introducing Google Cloud Platform
100% (1)
GCP Fund Module 1 Introducing Google Cloud Platform
21 pages
01 Introduction To GCP
No ratings yet
01 Introduction To GCP
22 pages
GCP: Google Cloud Platform
100% (1)
GCP: Google Cloud Platform
25 pages
Internal Load Balancing Tutorial Slides
No ratings yet
Internal Load Balancing Tutorial Slides
18 pages
GCP Pe
No ratings yet
GCP Pe
20 pages
(T-GCPAZURE-B) Module 3 - Virtual Machines in The Cloud
No ratings yet
(T-GCPAZURE-B) Module 3 - Virtual Machines in The Cloud
58 pages
Google Cloud Fund M1 Introducing Google Cloud
No ratings yet
Google Cloud Fund M1 Introducing Google Cloud
31 pages
Clustered Data ONTAP 83 MetroCluster Installation
No ratings yet
Clustered Data ONTAP 83 MetroCluster Installation
215 pages
GCP Fund Module 1 Introducing Google Cloud Platform
No ratings yet
GCP Fund Module 1 Introducing Google Cloud Platform
30 pages
TR-348 Hybrid Access Broadband Network Architecture
No ratings yet
TR-348 Hybrid Access Broadband Network Architecture
49 pages
UL - 8K - Backhaul Whitepaper Clean Version
No ratings yet
UL - 8K - Backhaul Whitepaper Clean Version
31 pages
Load Balancing in The Cloud AWS NGINX Plus
No ratings yet
Load Balancing in The Cloud AWS NGINX Plus
40 pages
Stenography
No ratings yet
Stenography
37 pages
5.1 GCP - Cloud - Load - Balancing PDF
No ratings yet
5.1 GCP - Cloud - Load - Balancing PDF
13 pages
Ajp MCQ Chapter 5
No ratings yet
Ajp MCQ Chapter 5
54 pages
Cloud Network and Security Services: Google Amazon Azure
No ratings yet
Cloud Network and Security Services: Google Amazon Azure
27 pages
05 ArchDP Design For Resiliency Scalability and DR
No ratings yet
05 ArchDP Design For Resiliency Scalability and DR
71 pages
Google Cloud Fundamentals: Core Infrastructure: Summary and Next Steps
No ratings yet
Google Cloud Fundamentals: Core Infrastructure: Summary and Next Steps
15 pages
TR-098 - Amendment-2 - Corrigendum-1 Internet GW Device PDF
No ratings yet
TR-098 - Amendment-2 - Corrigendum-1 Internet GW Device PDF
48 pages
02 OS90522EN15GLA0 Data Storages
No ratings yet
02 OS90522EN15GLA0 Data Storages
84 pages
(T-GCPAWS-I) Module 1 - Introducing Google Cloud Platform
No ratings yet
(T-GCPAWS-I) Module 1 - Introducing Google Cloud Platform
36 pages
Business-Logic Layer Design: Architecting With Google Cloud Platform: Design and Process
No ratings yet
Business-Logic Layer Design: Architecting With Google Cloud Platform: Design and Process
49 pages
Lesson 5 DLP Grade 8 Tle-Ict (Carry Out Mensuration and Calculation)
75% (4)
Lesson 5 DLP Grade 8 Tle-Ict (Carry Out Mensuration and Calculation)
7 pages
Module 4 - Deploying and Implementing A Cloud Solution
No ratings yet
Module 4 - Deploying and Implementing A Cloud Solution
39 pages
Unit 4
No ratings yet
Unit 4
48 pages
VDP PG
No ratings yet
VDP PG
72 pages
TR-383 Common YANG Modules For Access Networks
No ratings yet
TR-383 Common YANG Modules For Access Networks
27 pages
Module 5 - Ensuring Successful Operation of A Cloud Solution
No ratings yet
Module 5 - Ensuring Successful Operation of A Cloud Solution
27 pages
Protect File
No ratings yet
Protect File
4 pages
Module 2 - Setting Up A Cloud Solution Environment
No ratings yet
Module 2 - Setting Up A Cloud Solution Environment
23 pages
RT1050 HAB Encrypted Image Generation and Analysis
No ratings yet
RT1050 HAB Encrypted Image Generation and Analysis
14 pages
Snort Config and Rules
No ratings yet
Snort Config and Rules
15 pages
TMF Catalyst Cognitive Contact Center Phase 3
No ratings yet
TMF Catalyst Cognitive Contact Center Phase 3
15 pages
Waste Management System: Dbms Project 15CSE302
No ratings yet
Waste Management System: Dbms Project 15CSE302
13 pages
Module 3 - Planning and Configuring A Cloud Solution
No ratings yet
Module 3 - Planning and Configuring A Cloud Solution
16 pages
Data Sheet Axon Exchange EN
No ratings yet
Data Sheet Axon Exchange EN
11 pages
Configuring The Switch For Access Point Discovery
No ratings yet
Configuring The Switch For Access Point Discovery
8 pages
Module 1 - About The Associate Cloud Engineer Certification
No ratings yet
Module 1 - About The Associate Cloud Engineer Certification
21 pages
Computer Architecture Chapter 2: MIPS: Dr. Phạm Quốc Cường
No ratings yet
Computer Architecture Chapter 2: MIPS: Dr. Phạm Quốc Cường
27 pages
Wallet Creation For Password Hiding From Users
100% (1)
Wallet Creation For Password Hiding From Users
2 pages
Fall2016 Tutorial1 Model Answer
No ratings yet
Fall2016 Tutorial1 Model Answer
10 pages
Locking System in DBMS: Introdunction
No ratings yet
Locking System in DBMS: Introdunction
8 pages
This Example Shows Default Values of 8 Primitive Types in Java
No ratings yet
This Example Shows Default Values of 8 Primitive Types in Java
5 pages
Macros & VBA Cheat Sheet
0% (1)
Macros & VBA Cheat Sheet
11 pages
Pascal Data Types
No ratings yet
Pascal Data Types
3 pages
Formula To Calculate TCP Socket Buffer Sizes
No ratings yet
Formula To Calculate TCP Socket Buffer Sizes
1 page
Comp3511 Spring16 hw4
No ratings yet
Comp3511 Spring16 hw4
6 pages
Tsend C
No ratings yet
Tsend C
10 pages
Top 10 Kafka Problems
No ratings yet
Top 10 Kafka Problems
3 pages
Extract Table of Contents From PDF
No ratings yet
Extract Table of Contents From PDF
2 pages
OpenText FirstClass 7 and 8 Collaboration System Hacking and Security Information
100% (4)
OpenText FirstClass 7 and 8 Collaboration System Hacking and Security Information
4 pages
Data Engineering with Google Cloud Platform: A guide to leveling up as a data engineer by building a scalable data platform with Google Cloud
From Everand
Data Engineering with Google Cloud Platform: A guide to leveling up as a data engineer by building a scalable data platform with Google Cloud
Adi Wijaya
No ratings yet
Programming Backend with Go
From Everand
Programming Backend with Go
Julian Braun
No ratings yet
Programming Backend with Go: Build robust and scalable backends for your applications using the efficient and powerful tools of the Go ecosystem
From Everand
Programming Backend with Go: Build robust and scalable backends for your applications using the efficient and powerful tools of the Go ecosystem
Julian Braun
No ratings yet
Mastering the Art of Cloud Computing with Google Cloud Platform: Unraveling the Secrets of Experts
From Everand
Mastering the Art of Cloud Computing with Google Cloud Platform: Unraveling the Secrets of Experts
Steve Jones
No ratings yet
Deploy any website on google cloud platform
From Everand
Deploy any website on google cloud platform
AJ Books
No ratings yet
Google Cloud Data Engineer 100+ Practice Exam Questions With Well Explained Answers
From Everand
Google Cloud Data Engineer 100+ Practice Exam Questions With Well Explained Answers
vivian njoroge
No ratings yet
Google Associate Cloud Engineer Exam Companion: Q&A with Explanations
From Everand
Google Associate Cloud Engineer Exam Companion: Q&A with Explanations
SUJAN
No ratings yet

Network Design: Architecting With Google Cloud Platform: Design and Process

Uploaded by

Network Design: Architecting With Google Cloud Platform: Design and Process

Uploaded by

Network Design

Architecting with Google Cloud Platform:

Last modified 2018-08-08

Network integration with other environments

Photo service: periodic slowdown

Design challenge #3: Growth

GCP lab Deployment Manager: Adding load balancing

Network configuration for

Location of resources within the cloud network is significant

Time in ms, 1 million ns = 1 ms

● Send 2 kB over 1 Gbps network | 2,000 ns | 0.002 ms

The technology that allows you to control the network location of

Load balancing provides control over location and scale

Load balancing can get user traffic to application

Load balancing can scale services by distributing

Google provides several load balancing services

Selecting Load Balancing Services

● Proxied ● Proxied ● Proxied ● Pass-through ● Can be proxied or

Load balancing is either proxied or pass-through. A proxied load balancer terminates

HTTP(S) processes requests on port 80 or port 8080

Intelligent routing means that capacity is considered in the routing decision.

Maglev research paper: https://research.google.com/pubs/pub44824.html

Network integration with

Cloud External IP Address

GCP offers global static external IP addresses:

● You can use global IPs in DNS records.

Cloud CDN (Content Delivery Network) uses Google's globally distributed

Caching content at the edges of Google's network provides faster delivery of

● Lowers network latency

In general, Cloud CDN handles caching transparently.

Advice on 3rd party or open source cache products:

Design pattern: Multi-cloud solution with dedicated interconnect

Combine the two for reliability plus aggregate bandwidth.

You can influence performance by changing encryption during setup

● AES-GCM offers the highest throughput

Under certain conditions the service is very

What is causing this irregularity?

What's causing the service to slow down?

What can be done to fix it?

The thumbnail service is growing in terms of the number of thumbnails being

Other teams were impacted:

System stability depends on finding and fixing

● Fix what can be fixed

Consider this example:

The service was out.

Identify the actions that led to the incident.

Reference: SRE Book: Chapter 15 - Postmortem Culture: Learning from Failure

Always write a report when:

● Anytime an SLO is breached

You should have a policy regarding these reports:

The thumbnail server is running out of CPU.

Business Issue: Need to handle Upload

● Add a network load balancer to

99% of user operations completed in < 1 minute End to end latency

Failure to produce a thumbnail < 0.01% Completion errors (log entry)

Web Logs Data Storage Logs App Logs

Logs Logs Logs

Storage Service Logging Server

Data Logs Ingest

Cloud App Logs Store

Take a few minutes to design your solution

There are multiple designs possible depending on your assumptions. Your

You can sketch your design in a tool like http://docs.google.com/drawings

Remember, there are multiple valid solutions to this challenge.

Compare your design with the example solution.

Web Logs Data Storage Logs App Logs

Logs Logs Logs

Data Logs Ingest

This proposed solution uses an internal load balancer.

Deployment Manager: Adding load balancing

Cloud Load Appserver

You might also like