100% found this document useful (1 vote)

134 views20 pages

A Beginner-Friendly Introduction To Kubernetes - by David Chong - Towards Data Science

Kubernetes (K8s) is a container orchestration system that automates the deployment, management, and scaling of containerized applications. It uses a master-slave architecture with master nodes coordinating worker nodes to run application containers. Key Kubernetes components include pods, services, ingress/ingress controllers, configmaps, secrets and volumes which work together to deploy and manage applications in a scalable and highly available manner.

Uploaded by

Mikiyas Tsegaye

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

134 views20 pages

A Beginner-Friendly Introduction To Kubernetes - by David Chong - Towards Data Science

Uploaded by

Mikiyas Tsegaye

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Open in app Get started

Published in Towards Data Science

You have 1 free member-only story left this month. Sign up for Medium and get an extra one

David Chong Follow

Jun 19 · 14 min read · · Listen

Save

IT DOESN’T HAVE TO BE THAT COMPLICATED, RIGHT?

A Beginner-Friendly Introduction to
Kubernetes
With a hands-on MLFlow deployment example

Photo by Lorenzo Herrera on Unsplash

Let’s get into it.
Open in app Get started
If you’re reading this article, it’s likely because you’ve heard this buzzword
“Kubernetes” (K8s) and you’re likely to be in the technology space. You would also
likely have some idea of what containerization (or synonymously known as Docker /
dockerization) is, so I would skip over the details of that and jump straight into what
K8s is.

Introduction
In a nutshell, K8s is simply a container orchestration framework. What this
essentially means is that K8s is a system designed to automate the lifecycle of
containerized applications — from predictability, scalability to availability.

If you’re using Kubernetes to set up your data science infrastructure, do check out
Saturn Cloud, a scalable, flexible data science platform which offers compute including
GPUs.

Why do we even need Kubernetes?

The driving reason behind the rise and need for K8s stems from the increasing use
of microservices, away from traditional monolithic-type applications. As a result,
containers provide the perfect host for these individual microservices as containers
manage dependencies, are independent, OS-agnostic and ephemeral, amongst
other benefits.

Complex applications that have many components are often made up of hundreds
or even thousands of microservices. Scaling these microservices up while ensuring
availability is an extremely painful process if we were to manage all these different
components using custom-written programs or scripts, resulting in the demand for
a proper way of managing these components.

Cue Kubernetes.
Benefits of Kubernetes
Open in app Get started
Kubernetes promises to solve the above problem using these following features:

1. High Availability — this simply means that your application will always be up
and running, whether you have a new update to roll-out or have some
unexpected pods crashing.

2. Scalability — this ensures high performance of your application, whether you

have a single user or a thousand users flooding your application concurrently.

3. Disaster Recovery — this ensures that your application will always have the
latest data and states of your application if something unfortunate happens to
your physical or cloud-based infrastructure.

How It Works Beneath The Hood

K8s uses a Master-Slave type of architecture where a node acts as the Master, calling
the shots in the cluster while the other nodes act as slaves/worker nodes, executing
application workloads decided by the Master.

A Simple Kubernetes Architecture

A simple K8s setup with a single Master node along with 2 worker nodes look
something like this:
Open in app Get started

Example K8s setup with a single master and two slave nodes (Illustrated by Author)

Master Node(s)
As its name suggests, the Master node is the boss of the cluster, deciding the cluster
state and what each worker node does. In order to setup a Master node, 4 processes
are required to run on it:

1. API Server

Main entrypoint for users to interact with the cluster (i.e., cluster gateway); it is
where requests are sent when we use kubectl

Gatekeeper for authentication and request validation, ensuring that only

certain users are able to execute requests

2. Scheduler

Decide which node the next pod will be spun up on but does NOT spin up the
pod itself (kubelet does this)

3. Controller Manager
233 2

Detects cluster state changes (e.g., pods dying) and tries to restore the cluster
back to its original state
For example, if a pod unexpectedly dies, the Controller Manager makes a request
Open in app Get started
to the Scheduler to decide which node to spin up the new pod to replace the dead
pod. Kubelet then spins up the new pod.

4. etcd

Cluster BRAIN!

Key-Value store of the cluster state

Any cluster changes made will be stored here

Application data is NOT stored here, only cluster state data. Remember, the
master node does not do the work, it is the brain of the cluster. Specifically, etcd
stores the cluster state information in order for other processes above to know
information about the cluster

Slave/Worker Node(s)
Each worker node has to be installed with 3 node processes in order to allow
Kubernetes to interact with it and to independently spin up pods within each node.
The 3 processes required are:

1. Kubelet a.k.a. kubelet

Interacts with both the node AND the container

In charge of taking configuration files and spinning up the pod using the
container runtime (see below!) installed on the node

2. Container Runtime

Any container runtime installed (e.g., Docker, containerd)

3. Kube Proxy a.k.a. kube-proxy

A network proxy that implements part of the Kubernetes Service concept (details
below)

Sits between nodes and forwards the requests intelligently (either intra-node or
inter-node forwarding)
Open in app Get started
Components of Kubernetes
Now that we know K8s work, let’s look at some of the most common components of
Kubernetes that we will use to deploy our applications.

1. Pod
Smallest unit of K8s and usually houses an instance of your application

Abstraction over a container

Each pod gets its own IP address (public or private)

Ephemeral — new IP address upon re-creation of pod

2. Service
Because pods are meant to be ephemeral, Service provides a way to “give” pods
a permanent IP address

With Service, if the pod dies, its IP address will not change upon re-creation

Acts almost as a load balancer that routes traffic to pods while maintaining a
static IP

Like load balancers, the Service can also be internal or external, where external
Service is public facing (public IP) and internal Service which is meant for
internal applications (private IP)

3. Ingress
With Services, we may now have a web application exposed on a certain port,
say 8080 on an IP address, say 10.104.35. In practice, it is impractical to access a
public-facing application on http://10.104.35:8080 .

We would thus need an entrypoint with a proper domain name (e.g.,

https://my-domain-name.com , which then forwards the request to the Service
(e.g., http://10.104.35:8080 )

In essence, Ingress exposes HTTP and HTTPs routes from outside the cluster to
services within the cluster [1].

SSL termination (a.k.a. SSL offloading) — i.e., traffic to Service and its Pods is in
plaintext
That being said, creating an Ingress resource alone has no effect. An Ingress-
Open in app Get started
controller is also required to satisfy an Ingress.

How Ingress works with Ingress Controller (Illustrated by Author)

4. Ingress Controller
Load balances incoming traffic to services in the cluster

Also manages egress traffic for services that require communication with
external services

What is the difference between Ingress and Ingress Controller?

Ingress contains the rules for routing traffic, deciding which Service the incoming
request should route to within the cluster.

Ingress Controller is the actual implementation of Ingress, in charge of the Layer-4 or

Layer-7 proxy. Examples of Ingress Controller include Ingress NGINX Controller and
Ingress GCE. Each cloud provider and other 3rd party providers will have their own
implementation of the Ingress Controller.

A full list can be found here.

5. ConfigMap
As its name suggests, it is essentially a configuration file that you want exposed
Open in app Get started
for users to modify

6. Secret
Also a configuration file, but for sensitive information like passwords

Base64-encoded

7. Volumes
Used for persistent data storage

As pods themselves are ephemeral, volumes are used to persist information so

that existing and new pods can reference some state of your application

Acts almost like an “external hard drive” for your pods

Volumes can be stored locally on the same node running your pods or remotely
(e.g., cloud storage, NFS)

8. Deployment
Used to define blueprint for pods

In practice, we deal with deployments and not pods themselves

Deployments usually have replicas such that when any component of the
application dies, there is always a backup

However, components like databases cannot be replicated because they are

stateful applications. In this case, we would need the Kubernetes component:
StatefulSet. This is hard and more often than not, databases should be hosted
outside the Kubernetes cluster

Ok, that was probably too much to digest. Let’s jump

into some hands-on practice! Do take some time to
re-read the above to get a clear understanding of
each component’s responsibility in the entire K8s
architecture.
Open in app Get started

Let’s Practice!
Because this article focuses on understanding the components of K8s themselves
rather than how to setup a K8s cluster, we will simply use minikube to setup our own
local cluster. After which, we will deploy a simple but realistic application — a
MLFlow server.

If you want to follow along with the source code, I have included them in a GitHub
repo here.

What we will be setting up

A typical application has a web server with a backend service to persist data — that’s
what we will aim to replicate and deploy today. To make things simpler, we’ll deploy
a MLFlow web server that persists data on a Cloud SQL database on Google Cloud
Platform (GCP).

The setup is shown below:

MLflow with remote Tracking Server, backend and artifact stores (Image credits: MLFlow documentation)

To those who are unaware, MLFlow is mainly an experiment tracking tool that
allows Data Scientists to track their data science experiments by logging data and
model artifacts, with the option of deploying their models using a standardized
package defined by MLFlow. For the purposes of this article, we will deploy the
Open in app Get started
MLFlow tracking web server with a PostgreSQL backend (hosted on Cloud SQL) and
blob store (on Google Cloud Storage).

Before that, we’ll have to install a few things (skip ahead if you already have these
installed).

Installation
1. Docker

2. K8s command line tool, kubectl . Our best friend — we use this to interact with
our K8s cluster, be it minikube, cloud or a hybrid cluster

3. Minikube installation guide

4. Google Cloud SDK

5. [Optional] Power tools for kubectl , kubens and kubectx . Follow this to install.

Setting up your local cluster

Start your cluster with minikube start . That’s it! You’ve created your own local
Kubernetes cluster with a single command :)

You can verify that the various components listed above are created with minikube

status . If you have several K8s cluster context, make sure you switch to minikube.

# Check context

kubectx
# If not on minikube, switch context

kubectx minikube

With our local cluster setup, let’s start by setting up external components and then
move on to deploying Kubernetes objects.

1. Create a Dockerfile for MLFlow

We first need a Docker image of the MLFlow web server that we will be deploying.
Unfortunately, MLFlow does not have an official image that we can use on
DockerHub, so I’ve created one here for everyone to use. Let’s pull the image I’ve
Open in app Get started
created from DockerHub.

docker pull davidcjw/example-mlflow:1.0

[Optional] To test if the image works locally, simply run:

docker run -p 8080:8080 davidcjw/example-mlflow:1.0

2. Create a Cloud SQL (PostgreSQL) instance on GCP

This will be used to store metadata for the runs logged onto MLFlow tracking server.
As mentioned earlier, it is easier to create stateful applications outside of your
Kubernetes cluster.

First of all, create an account and project on GCP if you don’t already have one

Create an instance using the CLI with the following command:

gcloud sql instances create <your_instance_name> \

--assign-ip \

--authorized-networks=<your_ip_address>/32 \

--database-version=POSTGRES_14 \

--region=<your_region> \

--cpu=2 \

--memory=3840MiB \

--root-password=<your_password>

To find <your_ip_address> , simple Google “what is my ip”. For <region> , you can
specify a region that is close to you. For me, I’ve specified asia-southeast1 .

NOTE! These configs are intended for this example deployment and not
suitable for production environments. For production environments,
you would want to have minimally multi-zonal availability connected
over a Private IP.
3. Create a Google Cloud Storage Bucket
Open in app Get started

This will be used to store data and model artefacts logged by the user. Create a
bucket on GCP and take note of the URI for later. For myself, I’ve created one at
gs://example-mlflow-artefacts using the following command:

gsutil mb -l <your_region> gs://example-mlflow-artefacts

4. Create ConfigMap and Secret on our local minikube cluster

Now, the exciting part — deploying onto our Kubernetes clusters the various
components that are needed. Before that, it’s absolutely essential to know a few
things about K8s objects.

Kubernetes resources are created using .yaml files with specific formats (refer to the
Kubernetes documentation [2] for any resource type you’re creating). They are used to
define what containerized applications are running on which port and more
importantly, the policies around how those applications behave.

The .yaml files effectively defines our cluster state!

Describing Kubernetes Objects ( .yaml files):

Always starts with apiVersion , kind and has metadata

apiVersion : defines version number of the Kubernetes API (usually v1 if the

version you are using is in stable mode)

kind : defines the component type (e.g. Secret, ConfigMap, Pod, etc)

metadata : data that uniquely identifies an object, including name , UID and
namespace (more about this in the future!)

spec (or specification) / data : details specific to the component

4a. Let’s start with the ConfigMap as these configurations will be needed when we
deploy our MLFlow application using Deployment (NOTE: Order of resource creation
matters, especially when there is configurations or secrets attached to deployments).
# configmap.yaml
Open in app Get started
apiVersion: v1

kind: ConfigMap

metadata:

data:

# property-like keys; each key maps to a simple value

DEFAULT_ARTIFACT_ROOT: <your_gs_uri>

DB_NAME: postgres

DB_USERNAME: postgres

DB_HOST: <your_cloud_sql_public_ip>

💡 Pro Tip! Always have a tab of the official K8s documentation open so you can
reference the example .yaml file they have for each K8s component.

4b. Next, let’s create one for Secrets. Note that secrets have to be base64-encoded. It
can simply be done using:

echo -n "<your_password>" | base64

The only thing that we have to encode is the password for our PostgreSQL instance
defined above earlier when we created it on Cloud SQL. Let’s base64-encode that
and copy the stdout into the .yaml file below.

# secrets.yaml

apiVersion: v1

kind: Secret

metadata:

type: Opaque

data:

postgresql-password: <your_base64_encoded_password>

Apply ConfigMap and Secret using:

kubectl apply -f k8s/configmap.yaml

kubectl apply -f k8s/secrets.yaml

>>> configmap/mlflow-configmap created

>>> secret/mlflow-postgresql-credentials created

Great! We can now reference the secrets and configurations we have created.
Open in app Get started

5. Create Deployment and Service

5a. Let’s start with Deployment. To understand deployments, let’s take a step back
and recall that the main difference between Deployment and Pod is that the former
helps to create replicas of the pod that will be deployed. As such, the yaml file for
Deployment consists of the configurations for the Pod, as well as the number of
replicas we want to create.

If we take a look at the yaml file below, we notice metadata and spec appearing
twice in the configuration, the first time at the top of the config file and the second
time below the “template” key. This is because everything defined BELOW the
“template” key is used for the Pod configuration.

Simply put, a Pod component deploys a single

instance of our application whereas a Deployment
(usually) consists of more than one deployment of
that Pod. If the number of replicas in our Deployment
is 1, it is essentially the same as a single Pod (but with
the option of scaling up).

# deployment.yaml

apiVersion: apps/v1

kind: Deployment

metadata:

labels:

app: mlflow-tracking-server

spec:

replicas: 1

selector:

matchLabels:

app: mlflow-tracking-server-pods

# Pod configurations defined here in `template`

template:

metadata:

labels:

app: mlflow-tracking-server-pods

spec:

containers:

- name: mlflow-tracking-server-pod

image: davidcjw/example-mlflow:1.0
Open in app Get started
ports:

- containerPorts: 5000

resources:

limits:

memory: 1Gi

cpu: "2"

requests:

memory: 1Gi

cpu: "1"

imagePullPolicy: Always

env:

- name: DB_PASSWORD

valueFrom:

secretKeyRef:

key: postgresql-password

- name: DB_USERNAME

valueFrom:

configMapKeyRef:

key: DB_USERNAME

- name: DB_HOST

valueFrom:

configMapKeyRef:

key: DB_HOST

- name: DB_NAME

valueFrom:

configMapKeyRef:

key: DB_NAME

- name: DEFAULT_ARTIFACT_ROOT

valueFrom:

configMapKeyRef:

key: DEFAULT_ARTIFACT_ROOT

Two important questions to answer: 1) How do the pod replicas group together to be
identified as one by the Deployment? 2) How does the Deployment know which group
of pod replicas belong to it?

1. template > metadata > labels : Unlike other components like ConfigMap and
Secret, this metadata key labels is mandatory because each pod replica created
under this deployment will have a unique ID (e.g., mlflow-tracking-xyz, mlflow-
tracking-abc). To be able to collectively identify them as a group, labels are used
so that each of these pod replicas will receive these same set of labels.
2. selector > matchLabels : Used to determine which group of pods are under this
Open in app Get started
deployment. Note that the labels here have to exactly match the labels in (1).

Image by Author

Other key configurations:

replicas : used to determine the number of pod replicas

containers > image : the image that will be used by each pod

containers > env : here is where we specify the environment variables that will
be initialized in each pod, referenced from the ConfigMap and Secret we have
created earlier.

5b. Service — As mentioned above, Service is used almost like a load balancer to
distribute traffic to each of the pod replicas. As such, here are some important
things to note about Service.

selector : This key-value pair should match the template > metadata > labels

specified earlier in Deployment, so that Service knows which set of pods to route
the request to.
Open in app Get started

type : This defaults to ClusterIP , which is the internal IP address of the cluster
(a list of other other service types can be found here). For our use case, we will
use NodePort to expose our web application on a port of our node’s IP address.
Do note that the values for NodePort can only be between 30000–32767.

targetPort : This refers to the port that your pod is exposing the application on,
which is specified in Deployment.

apiVersion: v1

kind: Service

metadata:

labels:

app: mlflow-tracking-server

spec:

type: NodePort

selector:

app: mlflow-tracking-server-pods

ports:

- port: 5000

protocol: TCP

targetPort: 5000

nodePort: 30001

5c. Putting it together

You can in fact put several .yaml configurations in one file — specifically the
Deployment and Service configurations, since we will be applying those changes
together. To do so, simply use a --- to demarcate these two configs in one file:

# deployment.yaml

apiVersion: v1

kind: Deployment

...

---

apiVersion: v1

kind: Service

...
Finally, we apply these changes using kubectl apply -f k8s/deployment.yaml .
Open in app Get started
Congrats! You can now access your MLFlow server at <node_IP>:<nodePort> . Here’s
how to find out what your node_IP is:

kubectl get node -o wide

# or equivalently:

minikube ip

If you’re a Apple Silicon or Windows user…

If you’re like me using the Docker driver on Darwin (or Windows, WSL), the Node IP
will not be directly reachable using the above method. Complete steps 4 and 5 listed
in this link to access your application.

Cleaning Up
Finally, we’re done with our test application and cleaning up is as simple as minikube

delete --all .

If you’re using Kubernetes to set up your data science infrastructure, do check out
Saturn Cloud, a scalable, flexible data science platform which offers compute including
GPUs.

Final Words
Thanks for reading and hope this helps you in your understanding of Kubernetes.
Please let me know if you spot any mistakes or if you would like to know more in
another article!

Support me! — If you like my content and are not subscribed to Medium, do consider
supporting me and subscribing via my referral link here (NOTE: a portion of your
membership fees will be apportioned to me as referral fees).

References
[1] What is Ingress?

Open in app Get started

[2] Kubernetes Documentation

[3] Nana’s Kubernetes Crash Course

[4] Accessing apps (Minikube)

Enjoy the read? Reward the writer.Beta

Your tip will go to David Chong through a third-party platform of their choice, letting them know you appreciate their
story.

Give a tip

Sign up for The Variable

By Towards Data Science

Every Thursday, the Variable delivers the very best of Towards Data Science: from hands-on tutorials and cutting-
edge research to original features you don't want to miss. Take a look.

By signing up, you will create a Medium account if you don’t already have one. Review
our Privacy Policy for more information about our privacy practices.

Get this newsletter

About Help Terms Privacy

Get the Medium app

Open in app Get started

Kubernetes
No ratings yet
Kubernetes
95 pages
Palo Vm-Series-Deployment
No ratings yet
Palo Vm-Series-Deployment
1,094 pages
SEQ 11 PGMR UG
No ratings yet
SEQ 11 PGMR UG
316 pages
List of DMG Officers Posted in The Punjab
No ratings yet
List of DMG Officers Posted in The Punjab
28 pages
Chapter 13 - Solution Manual
50% (2)
Chapter 13 - Solution Manual
21 pages
Apache Kafka Tutorial
No ratings yet
Apache Kafka Tutorial
24 pages
AWS - Capstone Project
No ratings yet
AWS - Capstone Project
12 pages
156 (3) Vs 138 NIA
No ratings yet
156 (3) Vs 138 NIA
20 pages
IBM Security Product Integration Reference
100% (1)
IBM Security Product Integration Reference
14 pages
Scripting
50% (2)
Scripting
729 pages
Ip Addressing and Subnetting
No ratings yet
Ip Addressing and Subnetting
38 pages
Splunk 6.3.1 Forwarding
No ratings yet
Splunk 6.3.1 Forwarding
159 pages
Dogen Zenji
100% (1)
Dogen Zenji
18 pages
DataStage Health Check Guide, Its A Guide For Checking Health of Unix Sever
100% (1)
DataStage Health Check Guide, Its A Guide For Checking Health of Unix Sever
9 pages
Class Notes
No ratings yet
Class Notes
28 pages
Binder
No ratings yet
Binder
2 pages
Oracle Soa Maturity Model Cheat Sheet
100% (2)
Oracle Soa Maturity Model Cheat Sheet
8 pages
Cloud Computing Interview Questions and Answers
100% (1)
Cloud Computing Interview Questions and Answers
2 pages
Solace Essentials Activity Guide
No ratings yet
Solace Essentials Activity Guide
38 pages
The Docker Handbook: by Anand Nevase
No ratings yet
The Docker Handbook: by Anand Nevase
57 pages
Optim Installation &amp Configuration Guide
No ratings yet
Optim Installation &amp Configuration Guide
548 pages
Red Hat Virtualization-4.4-Planning and Prerequisites Guide-En-US
No ratings yet
Red Hat Virtualization-4.4-Planning and Prerequisites Guide-En-US
36 pages
Netstat Unix
100% (1)
Netstat Unix
5 pages
Weblogic Interview Questions
No ratings yet
Weblogic Interview Questions
16 pages
Performance Tuning With InfoSphere CDC
100% (1)
Performance Tuning With InfoSphere CDC
37 pages
Cognos 10
100% (5)
Cognos 10
35 pages
Using The WebLogic Scripting Tool
No ratings yet
Using The WebLogic Scripting Tool
8 pages
How To Configure HA Proxy Load Balancer With EFT Server HA Cluster
No ratings yet
How To Configure HA Proxy Load Balancer With EFT Server HA Cluster
8 pages
3 Docs Every Beginners Must Read To Start Cloud Journey Edition1
No ratings yet
3 Docs Every Beginners Must Read To Start Cloud Journey Edition1
8 pages
Jboss Logs
100% (1)
Jboss Logs
2 pages
Cs9152 DBT Unit I Notes
100% (1)
Cs9152 DBT Unit I Notes
53 pages
Kafka Secuirty
No ratings yet
Kafka Secuirty
4 pages
Deed of Guarantee
100% (1)
Deed of Guarantee
5 pages
(Omran) Introduction To Google Cloud Platform
No ratings yet
(Omran) Introduction To Google Cloud Platform
45 pages
Elastic Load Balancing: User Guide
No ratings yet
Elastic Load Balancing: User Guide
32 pages
YARN Essentials - Sample Chapter
No ratings yet
YARN Essentials - Sample Chapter
12 pages
Scaladayslambda Architecture Spark Cassandra Akka Kafka 150609194508 Lva1 App6891 PDF
No ratings yet
Scaladayslambda Architecture Spark Cassandra Akka Kafka 150609194508 Lva1 App6891 PDF
100 pages
A View From Inside Out: Worshipping The Fake Jesus
No ratings yet
A View From Inside Out: Worshipping The Fake Jesus
6 pages
DevOps Master Program Syllabus Mithun Technologies 2023
No ratings yet
DevOps Master Program Syllabus Mithun Technologies 2023
13 pages
RabbitMQ Master
No ratings yet
RabbitMQ Master
136 pages
Adriana Galvez-Aguilar - Poetry Project Pres
No ratings yet
Adriana Galvez-Aguilar - Poetry Project Pres
11 pages
EXAM COG-622: IBM Cognos 10 BI Administrator
0% (1)
EXAM COG-622: IBM Cognos 10 BI Administrator
16 pages
Oracle Linux: KVM User's Guide
No ratings yet
Oracle Linux: KVM User's Guide
54 pages
Troubleshooting MQ
No ratings yet
Troubleshooting MQ
32 pages
Oracle Interview Questions:: Company: Interview Type: ZOOM Meeting Date: Interview Time: 2 Hours
No ratings yet
Oracle Interview Questions:: Company: Interview Type: ZOOM Meeting Date: Interview Time: 2 Hours
7 pages
Abu Sayyaf - The Father of The Swordsman
No ratings yet
Abu Sayyaf - The Father of The Swordsman
9 pages
Public Sector
No ratings yet
Public Sector
8 pages
Cloud Project
No ratings yet
Cloud Project
12 pages
VCF Ems Deployment Parameter
No ratings yet
VCF Ems Deployment Parameter
8 pages
TRONSCAN - TRON BlockChain Explorer
0% (1)
TRONSCAN - TRON BlockChain Explorer
1 page
KPLABS Course - CKA D1 Core Concepts
No ratings yet
KPLABS Course - CKA D1 Core Concepts
22 pages
Russo, 2012 - Survivor-Controlled Research-A New Foundation Psychiatry and Mental Health
No ratings yet
Russo, 2012 - Survivor-Controlled Research-A New Foundation Psychiatry and Mental Health
29 pages
Bw6 FT Setup
No ratings yet
Bw6 FT Setup
4 pages
Helmi Teguh Ari Wibowo
No ratings yet
Helmi Teguh Ari Wibowo
187 pages
English Language
No ratings yet
English Language
7 pages
WLS ZeroDowntime Patching
No ratings yet
WLS ZeroDowntime Patching
23 pages
15 Reasons To Use Redis As An Application Cache: Itamar Haber
No ratings yet
15 Reasons To Use Redis As An Application Cache: Itamar Haber
9 pages
A Beginners Guide To Cron Jobs
No ratings yet
A Beginners Guide To Cron Jobs
4 pages
UNIX-Linux Netstat Command Examples
No ratings yet
UNIX-Linux Netstat Command Examples
12 pages
Kubernetes Deploy Mysql Spring Rest Api React Native App Instructions
No ratings yet
Kubernetes Deploy Mysql Spring Rest Api React Native App Instructions
7 pages
Load Balancing in Oracle RAC 11GR2
No ratings yet
Load Balancing in Oracle RAC 11GR2
3 pages
Oracle Unified and Internet Directory
No ratings yet
Oracle Unified and Internet Directory
7 pages
Open Stack
No ratings yet
Open Stack
7 pages
Fernandez, (2009) PDF
No ratings yet
Fernandez, (2009) PDF
17 pages
Contracts - Support - Associate - Swiss Re - Written Assessment - 15.11.21
No ratings yet
Contracts - Support - Associate - Swiss Re - Written Assessment - 15.11.21
4 pages
Start: Pods Are Running Correctly
No ratings yet
Start: Pods Are Running Correctly
1 page
What Is A Load Balancer
No ratings yet
What Is A Load Balancer
3 pages
Red Hat JBoss Enterprise Application Platform-7.2-Introduction To JBoss EAP-En-US
No ratings yet
Red Hat JBoss Enterprise Application Platform-7.2-Introduction To JBoss EAP-En-US
12 pages
ACDC Home Improvement Application
No ratings yet
ACDC Home Improvement Application
16 pages
Difference Between Windows Editions
No ratings yet
Difference Between Windows Editions
3 pages
VIT University, Chennai: (B.Tech Computer Science and Engineering)
No ratings yet
VIT University, Chennai: (B.Tech Computer Science and Engineering)
4 pages
Photo Research Document
No ratings yet
Photo Research Document
27 pages
People V Isnain G.R. No. L-2857. February 28, 1950
No ratings yet
People V Isnain G.R. No. L-2857. February 28, 1950
2 pages
Cloudera Administration Study Guide
No ratings yet
Cloudera Administration Study Guide
3 pages
Munandar Et Al. (2022)
No ratings yet
Munandar Et Al. (2022)
11 pages
Test Strategy Template
No ratings yet
Test Strategy Template
17 pages
Discussion - Job Costing
No ratings yet
Discussion - Job Costing
3 pages
5 Reasons Why Soft Skills Are More Important Than Ever
No ratings yet
5 Reasons Why Soft Skills Are More Important Than Ever
8 pages
Melita Kovacevic, Vice-Rector, University of Zagreb, "Research Funding With The European Research Area (ERA) : Towards University Excellence"
No ratings yet
Melita Kovacevic, Vice-Rector, University of Zagreb, "Research Funding With The European Research Area (ERA) : Towards University Excellence"
21 pages
Email - Behind The Scenes - Mikiyas T. - Jan 29, 2021
No ratings yet
Email - Behind The Scenes - Mikiyas T. - Jan 29, 2021
16 pages
Publication and Implementation of Nig Cars 2023 Fourth Amendment To Nigeria Civil Aviation Regulations
No ratings yet
Publication and Implementation of Nig Cars 2023 Fourth Amendment To Nigeria Civil Aviation Regulations
2 pages
Ae tt1021 Progress Test 4
No ratings yet
Ae tt1021 Progress Test 4
7 pages
IPv4 Vs IPv6
No ratings yet
IPv4 Vs IPv6
11 pages
110 Igot v. COMELEC
No ratings yet
110 Igot v. COMELEC
7 pages
Some Thoughts On The History of
No ratings yet
Some Thoughts On The History of
6 pages
Last Issue-Nov 2024
No ratings yet
Last Issue-Nov 2024
2 pages
Cisco Router Password Recovery
No ratings yet
Cisco Router Password Recovery
3 pages
Ultimate AWS Certified Solutions Architect Associate Exam Guide: Master Designing Resilient, Scalable Architectures with Core and Advanced AWS Services to Crack the SAA-C03 Certification (English Edition)
From Everand
Ultimate AWS Certified Solutions Architect Associate Exam Guide: Master Designing Resilient, Scalable Architectures with Core and Advanced AWS Services to Crack the SAA-C03 Certification (English Edition)
Venkata Sasi Kanumuri
No ratings yet
Implementing NetScaler VPX™ - Second Edition
From Everand
Implementing NetScaler VPX™ - Second Edition
Sandbu Marius
No ratings yet
Kubernetes and Cloud Native Associate (KCNA) Exam Preparation
From Everand
Kubernetes and Cloud Native Associate (KCNA) Exam Preparation
Georgio Daccache
No ratings yet
Learning SaltStack - Second Edition
From Everand
Learning SaltStack - Second Edition
Colton Myers
No ratings yet
VMware Horizon View Essentials
From Everand
VMware Horizon View Essentials
Peter von Oven
No ratings yet
About Kubernetes and Security Practices - Short Edition: First Edition, #1
From Everand
About Kubernetes and Security Practices - Short Edition: First Edition, #1
Ami Adi
No ratings yet
Kubernetes A Complete Guide
From Everand
Kubernetes A Complete Guide
Gerardus Blokdyk
No ratings yet
TIBCO Software The Ultimate Step-By-Step Guide
From Everand
TIBCO Software The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet

A Beginner-Friendly Introduction To Kubernetes - by David Chong - Towards Data Science

Uploaded by

A Beginner-Friendly Introduction To Kubernetes - by David Chong - Towards Data Science

Uploaded by

Open in app Get started

Published in Towards Data Science

David Chong Follow

Jun 19 · 14 min read · · Listen

IT DOESN’T HAVE TO BE THAT COMPLICATED, RIGHT?

Photo by Lorenzo Herrera on Unsplash

Why do we even need Kubernetes?

2. Scalability — this ensures high performance of your application, whether you

How It Works Beneath The Hood

A Simple Kubernetes Architecture

Gatekeeper for authentication and request validation, ensuring that only

Key-Value store of the cluster state

Any cluster changes made will be stored here

1. Kubelet a.k.a. kubelet

Interacts with both the node AND the container

Any container runtime installed (e.g., Docker, containerd)

3. Kube Proxy a.k.a. kube-proxy

Abstraction over a container

Each pod gets its own IP address (public or private)

Ephemeral — new IP address upon re-creation of pod

We would thus need an entrypoint with a proper domain name (e.g.,

How Ingress works with Ingress Controller (Illustrated by Author)

What is the difference between Ingress and Ingress Controller?

Ingress Controller is the actual implementation of Ingress, in charge of the Layer-4 or

A full list can be found here.

As pods themselves are ephemeral, volumes are used to persist information so

Acts almost like an “external hard drive” for your pods

In practice, we deal with deployments and not pods themselves

However, components like databases cannot be replicated because they are

Ok, that was probably too much to digest. Let’s jump

What we will be setting up

The setup is shown below:

3. Minikube installation guide

4. Google Cloud SDK

Setting up your local cluster

1. Create a Dockerfile for MLFlow

docker pull davidcjw/example-mlflow:1.0

[Optional] To test if the image works locally, simply run:

docker run -p 8080:8080 davidcjw/example-mlflow:1.0

2. Create a Cloud SQL (PostgreSQL) instance on GCP

Create an instance using the CLI with the following command:

gcloud sql instances create <your_instance_name> \

gsutil mb -l <your_region> gs://example-mlflow-artefacts

4. Create ConfigMap and Secret on our local minikube cluster

The .yaml files effectively defines our cluster state!

Describing Kubernetes Objects ( .yaml files):

Always starts with apiVersion , kind and has metadata

apiVersion : defines version number of the Kubernetes API (usually v1 if the

spec (or specification) / data : details specific to the component

# property-like keys; each key maps to a simple value

echo -n "<your_password>" | base64

Apply ConfigMap and Secret using:

kubectl apply -f k8s/configmap.yaml

kubectl apply -f k8s/secrets.yaml

>>> configmap/mlflow-configmap created

>>> secret/mlflow-postgresql-credentials created

5. Create Deployment and Service

Simply put, a Pod component deploys a single

# Pod configurations defined here in `template`

Other key configurations:

replicas : used to determine the number of pod replicas

5c. Putting it together

kubectl get node -o wide

If you’re a Apple Silicon or Windows user…

Open in app Get started

[3] Nana’s Kubernetes Crash Course

[4] Accessing apps (Minikube)

Enjoy the read? Reward the writer.Beta

Sign up for The Variable

Get this newsletter

About Help Terms Privacy

Get the Medium app

You might also like