0% found this document useful (0 votes)

36 views6 pages

Kernel Functions: Tejumade Afonja Jan 2, 2017 6 Min Read

The document discusses kernel functions, which allow linear classifiers like support vector machines to solve non-linear problems. A kernel function transforms data into a higher-dimensional space where it becomes linearly separable. Specifically, it maps data instances to dot products in a higher-dimensional space without explicitly computing the coordinates, making classification more efficient. Kernels are useful because real-world data is often non-linearly separable. The document provides an example of how kernels can transform non-linearly separable data into a form where a linear classifier can solve the problem.

Uploaded by

Luciano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views6 pages

Kernel Functions: Tejumade Afonja Jan 2, 2017 6 Min Read

Uploaded by

Luciano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

23/3/2020 Kernel Functions - Towards Data Science

Kernel Functions
Tejumade Afonja
Jan 2, 2017 · 6 min read

Lately, I have been doing some reading up on machine learning and Kernels happens to
be an interesting part of classification problems, before I go further, this topic was
inspired by a medium post written by Alan, Do it Yourself nlp for bot developers .
Thanks A.

What is a kernel function?

To talk about kernels, we need to understand terms like SVM (support vector
machines) -classifications -supervised Learning -machine learning -blah blah…. So
many terms right?, but don’t let that discourage you (I knew nothing about all of those
before the DIY exercise). Let’s walk in it together:-

https://towardsdatascience.com/kernel-function-6f1d2be6091 1/6
23/3/2020 Kernel Functions - Towards Data Science

So what exactly is “machine learning (ML)” ? well, it turns out that ML is actually a lot of
things but the overarching theme is best summed up by this oft-quoted statement made
by Arthur Samuel way back in 1959:

“Machine Learning is the field of study that gives computers the ability to learn without
being explicitly programmed.”

A computer program is said to learn from

experience E with respect to some task T and some
performance measure P, if its performance on T, as
measured by P, improves with experience E.” —
Tom Mitchell, Carnegie Mellon University
So if you want your program to predict, for example, traffic patterns at a busy
intersection (task T), you can run it through a machine learning algorithm with data
about past traffic patterns (experience E), if it has successfully “learned”, it will then do
better at predicting future traffic patterns (performance measure P).

Among the different types of ML tasks is what we call supervised learning (SL). This
is a situation where you put in some data you already have answers to (for example, to
predict if a dog is a particular breed, we load in millions of dog information/properties
like type, height, skin color, body hair length etc. In ML lingo, these properties are
referred to as ‘features’. A single entry of these list of features is a data instance while
the collection of everything is the Training Data which forms the basis of your
prediction i.e if you know the skin color, body hair length, height and so on of a
particular dog, then you can predict the breed it will probably belong to.

Before we can jump into kernels, we need to understand what a support vector machine is.
Support Vector Machine or SVM are supervised learning models with associated learning
algorithms that analyze data for classification( clasifications means knowing what belong
to what e.g ‘apple’ belongs to class ‘fruit’ while ‘dog’ to class ‘animals’ -see fig.1)

https://towardsdatascience.com/kernel-function-6f1d2be6091 2/6
23/3/2020 Kernel Functions - Towards Data Science

Fig. 1

In support vector machines, it looks somewhat like Fig.2 below :) which separates the
blue balls from red.

SVM is a classifier formally defined by a separating hyperplane. An hyperplane is a

subspace of one dimension less than its ambient space. The dimension of a
mathematical space (or object) is informally defined as the minimum number of
coordinates (x,y,z axis) needed to specify any point (like each blue and red point) within it
while an ambient space is the space surrounding a mathematical object. A mathematical
object is an abstract object arising in mathematics An abstract object is an object which
does not exist at any particular time or place, but rather exists as a type of thing, i.e., an
idea, or abstraction (wikipedia) .

Therefore the hyperplane of a two dimensional space below (fig.2) is a one

dimensional line dividing the red and blue dots.

Fig. 2

From the example above of trying to predict the breed of a particular dog, it goes like
this

Data (all breeds of dog)→ Features(skin color, hair etc)→ Learning algorithm

So why Kernels?
https://towardsdatascience.com/kernel-function-6f1d2be6091 3/6
23/3/2020 Kernel Functions - Towards Data Science

Consider the Fig. 3 below

Fig. 3

Can you try to solve the above problem linearly like we did with Fig. 2?

NO!

The red and blue balls cannot be separated by a straight line as they are randomly
distributed and this, in reality, is how most real life problem data are -randomly
distributed.

In machine learning, a “kernel” is usually used to refer to the kernel trick, a method of
using a linear classifier to solve a non-linear problem. It entails transforming linearly
inseparable data like (Fig. 3) to linearly separable ones (Fig. 2). The kernel function is
what is applied on each data instance to map the original non-linear observations into
a higher-dimensional space in which they become separable.

Using the dog breed prediction example again, kernels offer a better alternative.
Instead of defining a slew of features, you define a single kernel function to compute
similarity between breeds of dog. You provide this kernel, together with the data and
labels to the learning algorithm, and out comes a classifier.

https://towardsdatascience.com/kernel-function-6f1d2be6091 4/6
23/3/2020 Kernel Functions - Towards Data Science

How does it work?

To better understand how Kernels work, let us use Lili Jiang’s mathematical illustration

Mathematical definition: K(x, y) = <f(x), f(y)>. Here K is the kernel function, x, y are
n dimensional inputs. f is a map from n-dimension to m-dimension space. < x,y> denotes
the dot product. usually m is much larger than n.

Intuition: normally calculating <f(x), f(y)> requires us to calculate f(x), f(y) first, and
then do the dot product. These two computation steps can be quite expensive as they
involve manipulations in m dimensional space, where m can be a large number. But after
all the trouble of going to the high dimensional space, the result of the dot product is really
a scalar: we come back to one-dimensional space again! Now, the question we have is: do
we really need to go through all the trouble to get this one number? do we really have to go
to the m-dimensional space? The answer is no, if you find a clever kernel.

Simple Example: x = (x1, x2, x3); y = (y1, y2, y3). Then for the function f(x) = (x1x1,
x1x2, x1x3, x2x1, x2x2, x2x3, x3x1, x3x2, x3x3), the kernel is K(x, y ) = (<x, y>)².

Let’s plug in some numbers to make this more intuitive: suppose x = (1, 2, 3); y = (4, 5,
6). Then:
f(x) = (1, 2, 3, 2, 4, 6, 3, 6, 9)
f(y) = (16, 20, 24, 20, 25, 30, 24, 30, 36)
<f(x), f(y)> = 16 + 40 + 72 + 40 + 100+ 180 + 72 + 180 + 324 = 1024

A lot of algebra, mainly because f is a mapping from 3-dimensional to 9 dimensional space.

Now let us use the kernel instead:

K(x, y) = (4 + 10 + 18 ) ^2 = 32² = 1024
Same result, but this calculation is so much easier.

That’s about it for kernels. Good Job! you just took the first step to becoming a Machine
Learning Expert :)

Extra note: To learn more, you can check out how I predicted the stock market at Numerai
ml and what are kernels in machine learning and SVM.

Pelumi Aboluwarin did a fantastic job in reading the draft and suggesting this topic. Thank
you!

https://towardsdatascience.com/kernel-function-6f1d2be6091 5/6
23/3/2020 Kernel Functions - Towards Data Science

If you enjoyed reading this as much as I enjoyed writing it, you know what to do ;) show it
some love and if you have suggestions on topics you would like me to write about, drop it in
the comment section below. Thanks for reading :)

*all images are from web**

Extra Readings

1. https://en.wikipedia.org/wiki/Statistical_classification

2. https://en.wikipedia.org/wiki/Supervised_learning

Machine Learning Kernel Hyperplane Svm Math

About Help Legal

https://towardsdatascience.com/kernel-function-6f1d2be6091 6/6

(A. Paul (Auth.) ) Chemistry of Glasses (B-Ok - Xyz) PDF
100% (1)
(A. Paul (Auth.) ) Chemistry of Glasses (B-Ok - Xyz) PDF
300 pages
SVM Using Python
No ratings yet
SVM Using Python
24 pages
Complete Beginner's Guide To Processing Whatsapp Data With Python
No ratings yet
Complete Beginner's Guide To Processing Whatsapp Data With Python
9 pages
Steps To Activate Windows 8 Pro
67% (6)
Steps To Activate Windows 8 Pro
4 pages
SVM Kernel Functions
No ratings yet
SVM Kernel Functions
12 pages
Kernel Models 1233
No ratings yet
Kernel Models 1233
56 pages
Vahid
No ratings yet
Vahid
18 pages
Ds 11
No ratings yet
Ds 11
21 pages
Kernal and Multiclass
No ratings yet
Kernal and Multiclass
51 pages
Icml Tutorial
No ratings yet
Icml Tutorial
85 pages
Kernel Method
No ratings yet
Kernel Method
5 pages
Lec 16
No ratings yet
Lec 16
23 pages
MACHINE LEARNING Notes
No ratings yet
MACHINE LEARNING Notes
8 pages
Support Vector Machines: Kernels: CS4780/5780 - Machine Learning Fall 2011 Thorsten Joachims Cornell University
No ratings yet
Support Vector Machines: Kernels: CS4780/5780 - Machine Learning Fall 2011 Thorsten Joachims Cornell University
15 pages
This Is
No ratings yet
This Is
7 pages
Kernels in Support Vector Machine Part B
No ratings yet
Kernels in Support Vector Machine Part B
5 pages
Lecture 5
No ratings yet
Lecture 5
19 pages
Support Vector Machine Explained
No ratings yet
Support Vector Machine Explained
10 pages
תרגול - SVM 1
No ratings yet
תרגול - SVM 1
32 pages
Lecture 19 - Nonlinear Learning With Kernels (1) - Plain
No ratings yet
Lecture 19 - Nonlinear Learning With Kernels (1) - Plain
15 pages
ML Assignment 2 PDF
No ratings yet
ML Assignment 2 PDF
5 pages
Kernel Machines
No ratings yet
Kernel Machines
16 pages
5th Unit ML
No ratings yet
5th Unit ML
40 pages
03 - Kernelization
No ratings yet
03 - Kernelization
32 pages
SVM
No ratings yet
SVM
8 pages
Handout 03 Classic Classifiers
No ratings yet
Handout 03 Classic Classifiers
39 pages
Lecture 13 - Kernels
No ratings yet
Lecture 13 - Kernels
5 pages
SVM and Kernels
No ratings yet
SVM and Kernels
13 pages
L6 Lecture Image - Classification.fundemental v4
No ratings yet
L6 Lecture Image - Classification.fundemental v4
66 pages
4c Kernels
No ratings yet
4c Kernels
31 pages
Lecture 4
No ratings yet
Lecture 4
49 pages
Kernel Methods For General Pattern Analysis PDF
No ratings yet
Kernel Methods For General Pattern Analysis PDF
77 pages
Machine Learning SVM - Supervised
No ratings yet
Machine Learning SVM - Supervised
32 pages
SVM 4
No ratings yet
SVM 4
8 pages
Lecture 05
No ratings yet
Lecture 05
49 pages
22-Kernel Tricks Shit
No ratings yet
22-Kernel Tricks Shit
43 pages
2021 UNAS REFER Rafi Yon Saputra 173112706420242 Kernel Primer
No ratings yet
2021 UNAS REFER Rafi Yon Saputra 173112706420242 Kernel Primer
65 pages
Atc Lecture Tyliu
No ratings yet
Atc Lecture Tyliu
48 pages
ML Imppp
No ratings yet
ML Imppp
12 pages
Kernel Methods in Machine Learning
No ratings yet
Kernel Methods in Machine Learning
53 pages
Kernal Methods Machine Learning
No ratings yet
Kernal Methods Machine Learning
53 pages
Ml Svm Lect10 11
No ratings yet
Ml Svm Lect10 11
27 pages
Math Behind SVM (Kernel Trick) - This Is PART III of SVM Series - by MLMath - Io - Medium
No ratings yet
Math Behind SVM (Kernel Trick) - This Is PART III of SVM Series - by MLMath - Io - Medium
6 pages
07 Kernels
No ratings yet
07 Kernels
6 pages
Support Vector and Kernel Methods_ Detailed Notes (1)
No ratings yet
Support Vector and Kernel Methods_ Detailed Notes (1)
10 pages
Kernel Trick
No ratings yet
Kernel Trick
40 pages
SVM Extra Kernels
No ratings yet
SVM Extra Kernels
29 pages
SML Unit 4
No ratings yet
SML Unit 4
61 pages
Support Vector Machine
No ratings yet
Support Vector Machine
34 pages
DA CH 2
No ratings yet
DA CH 2
37 pages
Time Series Forecasting by Using Wavelet Kernel SVM
No ratings yet
Time Series Forecasting by Using Wavelet Kernel SVM
52 pages
Machine Learning 3
No ratings yet
Machine Learning 3
35 pages
Introduction To Support Vector Machines: BTR Workshop Fall 2006
No ratings yet
Introduction To Support Vector Machines: BTR Workshop Fall 2006
88 pages
Introduction To Support Vector Machines: BTR Workshop Fall 2006
No ratings yet
Introduction To Support Vector Machines: BTR Workshop Fall 2006
88 pages
Introduction To Kernels: Max Welling
No ratings yet
Introduction To Kernels: Max Welling
16 pages
SVM
No ratings yet
SVM
12 pages
Kernel Functions
No ratings yet
Kernel Functions
35 pages
SCH Smo 03 C
No ratings yet
SCH Smo 03 C
24 pages
Deep Learning Techniques
No ratings yet
Deep Learning Techniques
65 pages
28.9 - Domain Specific Kernels - mp4
No ratings yet
28.9 - Domain Specific Kernels - mp4
2 pages
Deep Learning Fundamentals in Python
From Everand
Deep Learning Fundamentals in Python
LazyProgrammer
4/5 (9)
Deep learning: deep learning explained to your granny – a guide for beginners
From Everand
Deep learning: deep learning explained to your granny – a guide for beginners
PAT NAKAMOTO
3/5 (2)
AI Algorithms: Foundations, Applications, and Advancements
From Everand
AI Algorithms: Foundations, Applications, and Advancements
Anand Vemula
No ratings yet
Cognitive Hierarchy Theory
No ratings yet
Cognitive Hierarchy Theory
18 pages
The Firm As A Subeconomy: JLEO, V15 N1
No ratings yet
The Firm As A Subeconomy: JLEO, V15 N1
29 pages
Incompleteness and Randomness
No ratings yet
Incompleteness and Randomness
15 pages
EPDM Weatherstrip Performance
100% (1)
EPDM Weatherstrip Performance
17 pages
Carbon Black For Weatherstrips
No ratings yet
Carbon Black For Weatherstrips
8 pages
Gödel's Theorem For Law
No ratings yet
Gödel's Theorem For Law
6 pages
Post Covid-19 Fractal Economics and Economies: DR - Kartik H
No ratings yet
Post Covid-19 Fractal Economics and Economies: DR - Kartik H
9 pages
Gödel's Incompleteness Theorems
No ratings yet
Gödel's Incompleteness Theorems
6 pages
Lemmatization Approaches
No ratings yet
Lemmatization Approaches
13 pages
Vertical Thermosyphon Reboilers
No ratings yet
Vertical Thermosyphon Reboilers
9 pages
Perimeter of An Ellipse Perimeter of An Ellipse
No ratings yet
Perimeter of An Ellipse Perimeter of An Ellipse
4 pages
Ensembling in Python
No ratings yet
Ensembling in Python
20 pages
Chemicals and Capital Markets
No ratings yet
Chemicals and Capital Markets
15 pages
The Curse of Dimensionality - Towards Data Science PDF
No ratings yet
The Curse of Dimensionality - Towards Data Science PDF
9 pages
Deep Reinforcement Learning in Games
No ratings yet
Deep Reinforcement Learning in Games
9 pages
Chemistry's Reproducibility Crisis
No ratings yet
Chemistry's Reproducibility Crisis
6 pages
Essentials I Project
No ratings yet
Essentials I Project
14 pages
Raspberry Pi SDR IGate PDF
No ratings yet
Raspberry Pi SDR IGate PDF
10 pages
Project Handover Checklist
No ratings yet
Project Handover Checklist
4 pages
E Whoring Then A Dez Way
100% (1)
E Whoring Then A Dez Way
9 pages
Des. Sybbols Blower B Cooler E Drum D Vessel V Pump G Compressor K Filter L Package / Skid Z
No ratings yet
Des. Sybbols Blower B Cooler E Drum D Vessel V Pump G Compressor K Filter L Package / Skid Z
1 page
Savemax20 Vendors
No ratings yet
Savemax20 Vendors
633 pages
English Proficiency Test 2
No ratings yet
English Proficiency Test 2
3 pages
Lecture 1
No ratings yet
Lecture 1
14 pages
SAP ICS Cash Advance Reconciliation Best Practice
No ratings yet
SAP ICS Cash Advance Reconciliation Best Practice
6 pages
Untitled
100% (3)
Untitled
512 pages
RAP Protocol Route Access Protocol
No ratings yet
RAP Protocol Route Access Protocol
1 page
Insert Grade
No ratings yet
Insert Grade
16 pages
Chapter 3
No ratings yet
Chapter 3
15 pages
EFE Matrix For Globe Telecom
No ratings yet
EFE Matrix For Globe Telecom
14 pages
Acer Predator ph317-53 Specs
No ratings yet
Acer Predator ph317-53 Specs
5 pages
Sure. in The Ideal Case This A Diabatic Process Is
No ratings yet
Sure. in The Ideal Case This A Diabatic Process Is
2 pages
E4416 Programming Guide
No ratings yet
E4416 Programming Guide
616 pages
Streamlining Deductions and Dispute Management in SAP
0% (1)
Streamlining Deductions and Dispute Management in SAP
12 pages
Rig Equipment
No ratings yet
Rig Equipment
49 pages
Cef238 Course Outline
No ratings yet
Cef238 Course Outline
1 page
The Essential Guide To ISO SAE 21434 - Table of Contents - CYRES Consulting - 2021 - S
No ratings yet
The Essential Guide To ISO SAE 21434 - Table of Contents - CYRES Consulting - 2021 - S
11 pages
TMS In-Depth Look at The RamSan-710 Flash Solid State Disk
No ratings yet
TMS In-Depth Look at The RamSan-710 Flash Solid State Disk
22 pages
Rlda PPT Work
No ratings yet
Rlda PPT Work
18 pages
Ermiyas Tilahun
No ratings yet
Ermiyas Tilahun
2 pages
ClarityTel Appoints Telecom Veteran Rob Rogers As Director of Sales For East Coast Region
No ratings yet
ClarityTel Appoints Telecom Veteran Rob Rogers As Director of Sales For East Coast Region
2 pages
2021-Reliable Interest Forwarding Strategy For Underwater Vehicular
No ratings yet
2021-Reliable Interest Forwarding Strategy For Underwater Vehicular
6 pages
Project Management Competency Framework
No ratings yet
Project Management Competency Framework
3 pages
Introduction To Computer Science - WEB
No ratings yet
Introduction To Computer Science - WEB
945 pages
Sniper Link Cheat Sheet Web
No ratings yet
Sniper Link Cheat Sheet Web
3 pages