Set up Google Kubernetes Engine Pods using automatic Envoy injection
Overview
In a service mesh, your application code doesn't need to know about your networking configuration. Instead, your applications communicate over a data plane, which is configured by a control plane that handles service networking. In this guide, Cloud Service Mesh is your control plane and the Envoy sidecar proxies are your data plane.
The Google managed Envoy sidecar injector adds Envoy sidecar proxies to your Google Kubernetes Engine Pods. When the Envoy sidecar injector adds a proxy, it also sets that proxy up to handle application traffic and connect to Cloud Service Mesh for configuration.
The guide walks you through a simple setup of Cloud Service Mesh with Google Kubernetes Engine. These steps provide the foundation that you can extend to advanced use cases, such as a service mesh that extends across multiple Google Kubernetes Engine clusters and, potentially, Compute Engine VMs. You can also use these instructions if you are configuring Cloud Service Mesh with Shared VPC.
The setup process involves:
- Creating a GKE cluster for your workloads.
- Installing the Envoy sidecar injector and enabling injection.
- Deploying a sample client and verifying injection.
- Deploying a Kubernetes service for testing.
- Configuring Cloud Service Mesh with Cloud Load Balancing components to route traffic to the test service.
- Verifying the configuration by sending a request from the sample client to the test service.
Prerequisites
Before you follow the instructions in this guide, complete the prerequisite tasks described in Prepare to set up on service routing APIs with Envoy and proxyless workloads.
For information about the Envoy version that is supported, see the Cloud Service Mesh release notes.
Additional prerequisites with Shared VPC
If you are setting up Cloud Service Mesh in a Shared VPC environment, make sure of the following.
- You have the correct permissions and roles for Shared VPC.
- You have set up the correct projects and billing.
- You have enabled billing in the projects.
- You have enabled the Cloud Service Mesh and GKE APIs in each project, including the host project.
- You have set up the correct service accounts for each project.
- You have created a VPC network and subnets.
- You have enabled Shared VPC.
For more information, see Shared VPC.
Configure IAM roles
This example of IAM role configuration assumes that the host project for Shared VPC has two subnets and there are two service projects in the Shared VPC.
In Cloud Shell, create a working folder (
WORKDIR)
where you create the files associated with this section:mkdir -p ~/td-shared-vpc cd ~/td-shared-vpc export WORKDIR=$(pwd)
Configure IAM permissions in the host project so that service projects can use the resources in the shared VPC.
In this step, you configure the IAM permissions so that
subnet-1
is accessible by service project 1 andsubnet-2
is accessible by service project 2. You assign the Compute Network User IAM role (roles/compute.networkUser
) to both the Compute Engine compute default service account and the Google Cloud API service account in each service project for each subnet.For service project 1, configure IAM permissions for
subnet-1
:export SUBNET_1_ETAG=$(gcloud beta compute networks subnets get-iam-policy subnet-1 --project ${HOST_PROJECT} --region ${REGION_1} --format=json | jq -r '.etag') cat > subnet-1-policy.yaml <<EOF bindings: - members: - serviceAccount:${SVC_PROJECT_1_API_SA} - serviceAccount:${SVC_PROJECT_1_GKE_SA} role: roles/compute.networkUser etag: ${SUBNET_1_ETAG} EOF gcloud beta compute networks subnets set-iam-policy subnet-1 \ subnet-1-policy.yaml \ --project ${HOST_PROJECT} \ --region ${REGION_1}
For service project 2, configure IAM permissions for
subnet-2
:export SUBNET_2_ETAG=$(gcloud beta compute networks subnets get-iam-policy subnet-2 --project ${HOST_PROJECT} --region ${REGION_2} --format=json | jq -r '.etag') cat > subnet-2-policy.yaml <<EOF bindings: - members: - serviceAccount:${SVC_PROJECT_2_API_SA} - serviceAccount:${SVC_PROJECT_2_GKE_SA} role: roles/compute.networkUser etag: ${SUBNET_2_ETAG} EOF gcloud beta compute networks subnets set-iam-policy subnet-2 \ subnet-2-policy.yaml \ --project ${HOST_PROJECT} \ --region ${REGION_2}
For each service project, you must grant the Kubernetes Engine Host Service Agent User IAM role (
roles/container.hostServiceAgentUser
) to the GKE service account in the host project:gcloud projects add-iam-policy-binding ${HOST_PROJECT} \ --member serviceAccount:${SVC_PROJECT_1_GKE_SA} \ --role roles/container.hostServiceAgentUser gcloud projects add-iam-policy-binding ${HOST_PROJECT} \ --member serviceAccount:${SVC_PROJECT_2_GKE_SA} \ --role roles/container.hostServiceAgentUser
This role lets the GKE service account of the service project use the GKE service account of the host project to configure shared network resources.
For each service project, grant the Compute Engine default service account the Compute Network Viewer IAM role (
roles/compute.networkViewer
) in the host project.gcloud projects add-iam-policy-binding ${SVC_PROJECT_1} \ --member serviceAccount:${SVC_PROJECT_1_COMPUTE_SA} \ --role roles/compute.networkViewer gcloud projects add-iam-policy-binding ${SVC_PROJECT_2} \ --member serviceAccount:${SVC_PROJECT_2_COMPUTE_SA} \ --role roles/compute.networkViewer
When the Envoy sidecar proxy connects to the xDS service (Traffic Director API), the proxy uses the service account of the Compute Engine virtual machine (VM) host or of the GKE node instance. The service account must have the
compute.globalForwardingRules.get
project-level IAM permission. The Compute Network Viewer role is sufficient for this step.
Configure project information
If you haven't created Google Cloud Project or installed Google Cloud CLI yet, follow these instructions. If you haven't installed kubectl yet, follow these instructions.
# The project that contains your GKE cluster. export CLUSTER_PROJECT_ID=YOUR_CLUSTER_PROJECT_NUMBER_HERE # The name of your GKE cluster. export CLUSTER=YOUR_CLUSTER_NAME # The channel of your GKE cluster. Eg: rapid, regular, stable. This channel # should match the channel of your GKE cluster. export CHANNEL=YOUR_CLUSTER_CHANNEL # The location of your GKE cluster, Eg: us-central1 for regional GKE cluster, # us-central1-a for zonal GKE cluster export LOCATION=ZONE # The network name of the traffic director load balancing API. export MESH_NAME=default # The project that holds the mesh resources. export MESH_PROJECT_NUMBER=YOUR_PROJECT_NUMBER_HERE export TARGET=projects/${MESH_PROJECT_NUMBER}/global/networks/${MESH_NAME} gcloud config set project ${CLUSTER_PROJECT_ID}
If you are using the new service routing APIs, use the following instructions to set MESH_NAME
,
MESH_PROJECT_NUMBER
and TARGET
:
# The mesh name of the traffic director load balancing API. export MESH_NAME=YOUR_MESH_NAME # The project that holds the mesh resources. export MESH_PROJECT_NUMBER=YOUR_PROJECT_NUMBER_HERE export TARGET=projects/${MESH_PROJECT_NUMBER}/locations/global/meshes/${MESH_NAME}
In most scenarios, CLUSTER_PROJECT_ID
and MESH_PROJECT_NUMBER
refer to the
same project. However, if you set up the
different project, such as when using a Shared VPC, the
CLUSTER_PROJECT_ID
refers to the project ID that contains your GKE
cluster, and the MESH_PROJECT_NUMBER
refers to the project number
that contains the resources.
Ensure that you have configured the appropriate permissions to allow injected
envoy to retrieve configurations from the
Enable Mesh Config API
Enable the following API to get started with Google managed sidecar injector.
gcloud services enable --project=${CLUSTER_PROJECT_ID} meshconfig.googleapis.com
Creating a GKE cluster for your workloads
GKE clusters must meet the following requirements to support Cloud Service Mesh:
- Network endpoint groups support must be enabled. For more information and examples, refer to Standalone network endpoint groups.
- The service account for your GKE nodes/pods must have permission to access the Traffic Director API. For more information about the required permissions, see Enable the service account to access the Traffic Director API.
Creating the GKE cluster
Create a GKE cluster in your preferred zone, for example,
us-central1-a
.
gcloud container clusters createYOUR_CLUSTER_NAME \ --zoneZONE \ --scopes=https://www.googleapis.com/auth/cloud-platform \ --enable-ip-alias
Pointing kubectl to the newly created cluster
Change the current context for kubectl
to the newly created cluster by issuing
the following command:
gcloud container clusters get-credentials traffic-director-cluster \ --zoneZONE
Apply the configurations for Mutating Webhook
The following sections provide instructions apply the MutatingWebhookConfiguration to the cluster. When a pod is created, the in-cluster admission controller is invoked. The admission controller, talks to the managed sidecar injector to add the Envoy container to the pod.
Apply the following mutating webhook configurations to your cluster.
cat <<EOF | kubectl apply -f -
apiVersion: admissionregistration.k8s.io/v1
kind: MutatingWebhookConfiguration
metadata:
labels:
app: sidecar-injector
name: td-mutating-webhook
webhooks:
- admissionReviewVersions:
- v1beta1
- v1
clientConfig:
url: https://meshconfig.googleapis.com/v1internal/projects/${CLUSTER_PROJECT_ID}/locations/${LOCATION}/clusters/${CLUSTER}/channels/${CHANNEL}/targets/${TARGET}:tdInject
failurePolicy: Fail
matchPolicy: Exact
name: namespace.sidecar-injector.csm.io
namespaceSelector:
matchExpressions:
- key: td-injection
operator: Exists
reinvocationPolicy: Never
rules:
- apiGroups:
- ""
apiVersions:
- v1
operations:
- CREATE
resources:
- pods
scope: '*'
sideEffects: None
timeoutSeconds: 30
EOF
Enabling sidecar injection
The following command enables injection for the default
namespace. The
sidecar injector injects sidecar containers to pods created under this
namespace:
kubectl label namespace default td-injection=enabled
You can verify that the default
namespace is properly enabled by running
the following command:
kubectl get namespace -L td-injection
This should return:
NAME STATUS AGE TD-INJECTION default Active 7d16h enabled
If you are configuring service security for Cloud Service Mesh with Envoy, return to the section Setting up a test service in the that setup guide.
Deploying a sample client and verifying injection
This section shows how to deploy a sample pod running Busybox, which provides a simple interface for reaching a test service. In a real deployment, you would deploy your own client application instead.
cat <<EOF | kubectl apply -f -
apiVersion: apps/v1
kind: Deployment
metadata:
labels:
run: client
name: busybox
spec:
replicas: 1
selector:
matchLabels:
run: client
template:
metadata:
labels:
run: client
spec:
containers:
- name: busybox
image: busybox
command:
- sh
- -c
- while true; do sleep 1; done
EOF
The Busybox pod consists of two containers. The first container is the client based on the Busybox image and the second container is the Envoy proxy injected by the sidecar injector. You can get more information about the pod by running the following command:
kubectl describe pods -l run=client
This should return:
… Init Containers: # Istio-init sets up traffic interception for the pod. Istio-init: … Containers: # busybox is the client container that runs application code. busybox: … # Envoy is the container that runs the injected Envoy proxy. envoy: …
Cloud Service Mesh Proxy
The managed sidecar injector will use Cloud Service Mesh Proxy image as the proxy. The Cloud Service Mesh Proxy is a sidecar container responsible for starting an envoy proxy for mesh-enabled instances. The proxy image uses the OSS envoy image along with a proxy agent responsible for starting the envoy, providing bootstrap configuration, and healthchecking of the envoy. The Cloud Service Mesh Proxy image versions align with the OSS Envoy version. You can track the available proxy images here: https://gcr.io/gke-release/asm/csm-mesh-proxy
The Cloud Service Mesh Mesh Proxy which gets injected varies based on the channel the user has chosen for the GKE cluster. The Envoy version is regularly updated based on the current OSS Envoy releases and is tested with the specific GKE release to ensure compatibility.
Cloud Service Mesh Proxy version
The following table shows the current GKE cluster channel to Cloud Service Mesh Proxy version mapping:
Channel | Cloud Service Mesh Proxy Version |
---|---|
Rapid | 1.29.9-gke.3 |
Regular | 1.28.7-gke.3 |
Stable | 1.27.7-gke.3 |
Cloud Service Mesh Proxy upgrade
Upgrading to the latest version is highly recommended. Although the service mesh is fine when the control plane and proxies are at different versions, we recommend that you update the proxies so that they are configured with the new Cloud Service Mesh version.
The managed sidecar injector takes care of Envoy's version which always injects the latest Envoy version qualified by Google. If the Cloud Service Mesh Proxy version is newer than the proxy version, restart the proxies for your services.
kubectl rollout restart deployment -nYOUR_NAMESPACE_HERE
Deploying a Kubernetes service for testing
The following sections provide instructions for setting up a test service that you use later in this guide to provide end-to-end verification of your setup.
Configuring GKE services with NEGs
GKE services must be exposed through network endpoint groups
(NEGs) so that you can configure them as backends of a Cloud Service Mesh backend
service. Add the NEG annotation to your Kubernetes service specification and
choose a name (by replacing NEG-NAME
in the sample below) so that you can find
it easily later. You need the name when you attach the NEG to your
Cloud Service Mesh backend service. For more information on annotating NEGs, see
Naming NEGs.
... metadata: annotations: cloud.google.com/neg: '{"exposed_ports": {"80":{"name": "service-test-neg"}}}' spec: ports: - port: 80 name: service-test protocol: TCP targetPort: 8000
This annotation creates a standalone NEG containing endpoints corresponding with the IP addresses and ports of the service's pods. For more information and examples, refer to Standalone network endpoint groups.
The following sample service includes the NEG annotation. The service serves
the hostname over HTTP on port 80
. Use the following command to get the
service and deploy it to your GKE cluster.
wget -q -O - \ https://storage.googleapis.com/traffic-director/demo/trafficdirector_service_sample.yaml \ | kubectl apply -f -
Verify that the new service is created and the application pod is running:
kubectl get svc
The output should be similar to the following:
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE service-test ClusterIP 10.71.9.71 none 80/TCP 41m [..skip..]
Verify that the application pod associated with this service is running:
kubectl get pods
NAME READY STATUS RESTARTS AGE app1-6db459dcb9-zvfg2 2/2 Running 0 6m busybox-5dcf86f4c7-jvvdd 2/2 Running 0 10m [..skip..]
Saving the NEG's name
Find the NEG created from the example above and record its name for Cloud Service Mesh configuration in the next section.
gcloud compute network-endpoint-groups list
This returns the following:
NAME LOCATION ENDPOINT_TYPE SIZE service-test-negZONE GCE_VM_IP_PORT 1
Save the NEG's name in the NEG_NAME variable:
NEG_NAME=$(gcloud compute network-endpoint-groups list \ | grep service-test | awk '{print $1}')
Configuring Cloud Service Mesh with Cloud Load Balancing components
This section configures Cloud Service Mesh using Compute Engine load balancing resources. This enables the sample client's sidecar proxy to receive configuration from Cloud Service Mesh. Outbound requests from the sample client are handled by the sidecar proxy and routed to the test service.
You must configure the following components:
- A health check. For more information on health checks, read Health Check Concepts and Creating Health Checks.
- A backend service. For more information on backend services, read Backend Services.
- A routing rule map. This includes creating a forwarding rule, a target HTTP proxy and a URL map. For more information, read Using forwarding rules for Cloud Service Mesh, Using target proxies for Cloud Service Mesh, and Using URL maps.
Creating the health check and firewall rule
Use the following instructions to create a health check and the firewall rule that is required for the health check probes. For more information, see Firewall rules for health checks.
- Go to the Health checks page in the Google Cloud console.
Go to the Health checks page - Click Create Health Check.
- For the name, enter
td-gke-health-check
. - For the protocol, select HTTP.
Click Create.
Go to the Firewall policies page in the Google Cloud console.
Go to the Firewall policies pageClick Create firewall rules.
On the Create a firewall rule page, supply the following information:
- Name: Provide a name for the rule. For this example, use
fw-allow-health-checks
. - Network: Choose a VPC network.
- Priority: Enter a number for the priority. Lower numbers have higher priorities. Be sure that the firewall rule has a higher priority than other rules that might deny ingress traffic.
- Direction of traffic: Choose Ingress.
- Action on match: Choose Allow.
- Targets: Choose All instances in the network.
- Source filter: Choose the correct IP range type.
- Source IP ranges:
35.191.0.0/16,130.211.0.0/22
- Destination filter: Select the IP type.
- Protocols and ports: Click Specified ports and protocols,
then check
tcp
. TCP is the underlying protocol for all health check protocols. - Click Create.
- Name: Provide a name for the rule. For this example, use
Create the health check.
gcloud compute health-checks create http td-gke-health-check \ --use-serving-port
Create the firewall rule to allow the health checker IP address ranges.
gcloud compute firewall-rules create fw-allow-health-checks \ --action ALLOW \ --direction INGRESS \ --source-ranges 35.191.0.0/16,130.211.0.0/22 \ --rules tcp
Creating the backend service
Create a global backend service
with a load balancing scheme of INTERNAL_SELF_MANAGED
. In the
Google Cloud console, the load balancing scheme is set implicitly. Add the
health check to the backend service.
Go to the Cloud Service Mesh page in the Google Cloud console.
On the Services tab, click Create Service.
Click Continue.
For the service name, enter
td-gke-service
.Select Network, which you configured in the Cloud Service Mesh ConfigMap.
Under Backend type, select Network endpoint groups.
Select the network endpoint group you created.
Set the Maximum RPS to
5
.Set the Balancing mode to Rate.
Click Done.
Under Health check, select
td-gke-health-check
, which is the health check you created.Click Continue.
Create the backend service and associate the health check with the backend service.
gcloud compute backend-services create td-gke-service \ --global \ --health-checks td-gke-health-check \ --load-balancing-scheme INTERNAL_SELF_MANAGED
Add the previously created NEG as a backend to the backend service. If you are configuring Cloud Service Mesh with a target TCP proxy, you must use
UTILIZATION
balancing mode. If you are using an HTTP or HTTPS target proxy, you can useRATE
mode.gcloud compute backend-services add-backend td-gke-service \ --global \ --network-endpoint-group ${NEG_NAME} \ --network-endpoint-group-zone
ZONE \ --balancing-mode [RATE | UTILIZATION] \ --max-rate-per-endpoint 5
Creating the routing rule map
The routing rule map defines how Cloud Service Mesh routes traffic in your mesh. As part of the routing rule map, you configure a virtual IP (VIP) address and a set of associated traffic management rules, such as host-based routing. When an application sends a request to the VIP, the attached Envoy sidecar proxy does the following:
- Intercepts the request.
- Evaluates it according to the traffic management rules in the URL map.
- Selects a backend service based on the hostname in the request.
- Chooses a backend or endpoint associated with the selected backend service.
- Sends traffic to that backend or endpoint.
In the console, the target proxy is combined with the forwarding rule. When you create the forwarding rule, Google Cloud automatically creates a target HTTP proxy and attaches it to the URL map.
The route rule consist of the forwarding rule and the host and path rules (also known as the URL map).
Go to the Cloud Service Mesh page in the Google Cloud console.
Click Routing rule maps
Click Create Routing Rule.
Enter
td-gke-url-map
as the Name of the URL map.Click Add forwarding rule.
For the forwarding rule name, enter
td-gke-forwarding-rule
.Select your network.
Select your Internal IP.
Click Save.
Optionally, add custom host and path rules or leave the path rules as the defaults.
Set the host to
service-test
.Click Save.
Create a URL map that uses
td-gke-service
as the default backend service.gcloud compute url-maps create td-gke-url-map \ --default-service td-gke-service
Create a URL map path matcher and a host rule to route traffic for your service based on hostname and a path. This example uses
service-test
as the service name and a default path matcher that matches all path requests for this host (/*
).gcloud compute url-maps add-path-matcher td-gke-url-map \ --default-service td-gke-service \ --path-matcher-name td-gke-path-matcher gcloud compute url-maps add-host-rule td-gke-url-map \ --hosts service-test \ --path-matcher-name td-gke-path-matcher
Create the target HTTP proxy.
gcloud compute target-http-proxies create td-gke-proxy \ --url-map td-gke-url-map
Create the forwarding rule.
gcloud compute forwarding-rules create td-gke-forwarding-rule \ --global \ --load-balancing-scheme=INTERNAL_SELF_MANAGED \ --address=0.0.0.0 \ --target-http-proxy=td-gke-proxy \ --ports 80 --network default
At this point, Cloud Service Mesh configures your sidecar proxies to route
requests that specify the service-test
hostname to backends of
td-gke-service
. In this case, those backends are endpoints in the network
endpoint group associated with the Kubernetes test service that you deployed
earlier.
Verifying the configuration
This section shows how to verify that traffic sent from the sample Busybox
client is routed to your service-test
Kubernetes service. To send a test
request, you can access a shell on one of the containers and execute the
following verification command. A service-test
Pod should return the hostname
of the serving pod.
# Get the name of the pod running Busybox. BUSYBOX_POD=$(kubectl get po -l run=client -o=jsonpath='{.items[0].metadata.name}') # Command to execute that tests connectivity to the service service-test at # the VIP 10.0.0.1. Because 0.0.0.0 is configured in the forwarding rule, this # can be any VIP. TEST_CMD="wget -q -O - 10.0.0.1; echo" # Execute the test command on the pod. kubectl exec -it $BUSYBOX_POD -c busybox -- /bin/sh -c "$TEST_CMD"
Here's how the configuration is verified:
- The sample client sent a request that specified the
service-test
hostname. - The sample client has an Envoy sidecar proxy that was injected by the Envoy sidecar injector.
- The sidecar proxy intercepted the request.
- Using the URL map, the Envoy matched the
service-test
hostname to thetd-gke-service
Cloud Service Mesh service. - The Envoy chose an endpoint from the network endpoint group associated with
td-gke-service
. - The Envoy sent the request to a pod associated with the
service-test
Kubernetes service.
How to Migrate to Managed Sidecar Injector
This tutorial guides you through migrating an application from the legacy Cloud Service Mesh sidecar injector on GKE (with an in-cluster sidecar injector) to one using a managed sidecar injector.
Disabling in-cluster sidecar injection
The following commands disable the legacy in-cluster sidecar injector for the default namespace
kubectl label namespace default istio-injection-
Cleanup in-cluster sidecar injector
Download and extract the legacy Envoy sidecar injector.
wget https://storage.googleapis.com/traffic-director/td-sidecar-injector-xdsv3.tgz tar -xzvf td-sidecar-injector-xdsv3.tgz cd td-sidecar-injector-xdsv3
Delete in-cluster sidecar injector resources
kubectl delete -f specs/
What's next
- Learn about advanced traffic management
- Learn about Cloud Service Mesh service security.
- Learn how to set up observability with Envoy.
- Learn how to troubleshoot Cloud Service Mesh deployments.
- Learn about options for Google Kubernetes Engine Pod setup with automatic Envoy injection.