0% found this document useful (0 votes)

16 views20 pages

Face File

This document discusses modern face recognition technology using deep learning, particularly how it is implemented in systems like Facebook. It outlines a step-by-step approach to face recognition, including face detection, landmark estimation, encoding faces into numerical embeddings, and classifying identities based on those embeddings. The author also provides resources for readers to implement their own face recognition systems using available libraries and tools.

Uploaded by

Sweet Candy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views20 pages

Face File

Uploaded by

Sweet Candy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Machine Learning is Fun!

Part 4: Modern
Face Recognition with Deep Learning

Adam Geitgey

13 min read
·
Jul 24, 2016

Update: This article is part of a series. Check out the full series: Part 1, Part 2, Part 3, Part 4,
Part 5, Part 6, Part 7 and Part 8! You can also read this article in 普通话, Русский, 한국어,
Português, Tiếng Việt, ‫ ف ار سی‬or Italiano.

Giant update: I’ve written a new book based on these articles! It not only expands and updates
all my articles, but it has tons of brand new content and lots of hands-on coding projects. Check
it out now!

Have you noticed that Facebook has developed an uncanny ability to recognize your friends in
your photographs? In the old days, Facebook used to make you to tag your friends in photos by
clicking on them and typing in their name. Now as soon as you upload a photo, Facebook tags
everyone for you like magic:
Facebook automatically tags people in your photos that you have tagged before. I’m not sure if
this is helpful or creepy!

This technology is called face recognition. Facebook’s algorithms are able to recognize your
friends’ faces after they have been tagged only a few times. It’s pretty amazing technology —
Facebook can recognize faces with 98% accuracy which is pretty much as good as humans can
do!

Let’s learn how modern face recognition works! But just recognizing your friends would be too
easy. We can push this tech to the limit to solve a more challenging problem — telling Will
Ferrell (famous actor) apart from Chad Smith (famous rock musician)!
One of these people is Will Farrell. The other is Chad Smith. I swear they are different people!

How to use Machine Learning on a Very

Complicated Problem
So far in Part 1, 2 and 3, we’ve used machine learning to solve isolated problems that have only
one step — estimating the price of a house, generating new data based on existing data and
telling if an image contains a certain object. All of those problems can be solved by choosing one
machine learning algorithm, feeding in data, and getting the result.

But face recognition is really a series of several related problems:

1. First, look at a picture and find all the faces in it

2. Second, focus on each face and be able to understand that even if a face is turned in a
weird direction or in bad lighting, it is still the same person.
3. Third, be able to pick out unique features of the face that you can use to tell it apart from
other people— like how big the eyes are, how long the face is, etc.
4. Finally, compare the unique features of that face to all the people you already know to
determine the person’s name.

As a human, your brain is wired to do all of this automatically and instantly. In fact, humans are
too good at recognizing faces and end up seeing faces in everyday objects:
Computers are not capable of this kind of high-level generalization (at least not yet…), so we
have to teach them how to do each step in this process separately.

We need to build a pipeline where we solve each step of face recognition separately and pass the
result of the current step to the next step. In other words, we will chain together several machine
learning algorithms:

How a basic pipeline for detecting faces might work

Face Recognition — Step by Step

Let’s tackle this problem one step at a time. For each step, we’ll learn about a different machine
learning algorithm. I’m not going to explain every single algorithm completely to keep this from
turning into a book, but you’ll learn the main ideas behind each one and you’ll learn how you
can build your own facial recognition system in Python using OpenFace and dlib.

Step 1: Finding all the Faces

The first step in our pipeline is face detection. Obviously we need to locate the faces in a
photograph before we can try to tell them apart!

If you’ve used any camera in the last 10 years, you’ve probably seen face detection in action:

Face detection is a great feature for cameras. When the camera can automatically pick out faces,
it can make sure that all the faces are in focus before it takes the picture. But we’ll use it for a
different purpose — finding the areas of the image we want to pass on to the next step in our
pipeline.

Face detection went mainstream in the early 2000's when Paul Viola and Michael Jones invented
a way to detect faces that was fast enough to run on cheap cameras. However, much more
reliable solutions exist now. We’re going to use a method invented in 2005 called Histogram of
Oriented Gradients — or just HOG for short.

To find faces in an image, we’ll start by making our image black and white because we don’t
need color data to find faces:
Then we’ll look at every single pixel in our image one at a time. For every single pixel, we want
to look at the pixels that directly surrounding it:
Our goal is to figure out how dark the current pixel is compared to the pixels directly
surrounding it. Then we want to draw an arrow showing in which direction the image is getting
darker:

Looking at just this one pixel and the pixels touching it, the image is getting darker towards the
upper right.

If you repeat that process for every single pixel in the image, you end up with every pixel being
replaced by an arrow. These arrows are called gradients and they show the flow from light to
dark across the entire image:
This might seem like a random thing to do, but there’s a really good reason for replacing the
pixels with gradients. If we analyze pixels directly, really dark images and really light images of
the same person will have totally different pixel values. But by only considering the direction
that brightness changes, both really dark images and really bright images will end up with the
same exact representation. That makes the problem a lot easier to solve!

But saving the gradient for every single pixel gives us way too much detail. We end up missing
the forest for the trees. It would be better if we could just see the basic flow of lightness/darkness
at a higher level so we could see the basic pattern of the image.

To do this, we’ll break up the image into small squares of 16x16 pixels each. In each square,
we’ll count up how many gradients point in each major direction (how many point up, point up-
right, point right, etc…). Then we’ll replace that square in the image with the arrow directions
that were the strongest.

The end result is we turn the original image into a very simple representation that captures the
basic structure of a face in a simple way:
The original image is turned into a HOG representation that captures the major features of the
image regardless of image brightnesss.

To find faces in this HOG image, all we have to do is find the part of our image that looks the
most similar to a known HOG pattern that was extracted from a bunch of other training faces:
Using this technique, we can now easily find faces in any image:

If you want to try this step out yourself using Python and dlib, here’s code showing how to
generate and view HOG representations of images.

Step 2: Posing and Projecting Faces

Whew, we isolated the faces in our image. But now we have to deal with the problem that faces
turned different directions look totally different to a computer:
Humans can easily recognize that both images are of Will Ferrell, but computers would see these
pictures as two completely different people.

To account for this, we will try to warp each picture so that the eyes and lips are always in the
sample place in the image. This will make it a lot easier for us to compare faces in the next steps.

To do this, we are going to use an algorithm called face landmark estimation. There are lots of
ways to do this, but we are going to use the approach invented in 2014 by Vahid Kazemi and
Josephine Sullivan.

The basic idea is we will come up with 68 specific points (called landmarks) that exist on every
face — the top of the chin, the outside edge of each eye, the inner edge of each eyebrow, etc.
Then we will train a machine learning algorithm to be able to find these 68 specific points on any
face:
The 68 landmarks we will locate on every face. This image was created by Brandon Amos of
CMU who works on OpenFace.

Here’s the result of locating the 68 face landmarks on our test image:
PROTIP: You can also use this same technique to implement your own version of Snapchat’s
real-time 3d face filters!

Now that we know were the eyes and mouth are, we’ll simply rotate, scale and shear the image
so that the eyes and mouth are centered as best as possible. We won’t do any fancy 3d warps
because that would introduce distortions into the image. We are only going to use basic image
transformations like rotation and scale that preserve parallel lines (called affine transformations):
Now no matter how the face is turned, we are able to center the eyes and mouth are in roughly
the same position in the image. This will make our next step a lot more accurate.

If you want to try this step out yourself using Python and dlib, here’s the code for finding face
landmarks and here’s the code for transforming the image using those landmarks.

Step 3: Encoding Faces

Now we are to the meat of the problem — actually telling faces apart. This is where things get
really interesting!

The simplest approach to face recognition is to directly compare the unknown face we found in
Step 2 with all the pictures we have of people that have already been tagged. When we find a
previously tagged face that looks very similar to our unknown face, it must be the same person.
Seems like a pretty good idea, right?

There’s actually a huge problem with that approach. A site like Facebook with billions of users
and a trillion photos can’t possibly loop through every previous-tagged face to compare it to
every newly uploaded picture. That would take way too long. They need to be able to recognize
faces in milliseconds, not hours.

What we need is a way to extract a few basic measurements from each face. Then we could
measure our unknown face the same way and find the known face with the closest
measurements. For example, we might measure the size of each ear, the spacing between the
eyes, the length of the nose, etc. If you’ve ever watched a bad crime show like CSI, you know
what I am talking about:
Just like TV! So real! #science

The most reliable way to measure a face

Ok, so which measurements should we collect from each face to build our known face database?
Ear size? Nose length? Eye color? Something else?

It turns out that the measurements that seem obvious to us humans (like eye color) don’t really
make sense to a computer looking at individual pixels in an image. Researchers have discovered
that the most accurate approach is to let the computer figure out the measurements to collect
itself. Deep learning does a better job than humans at figuring out which parts of a face are
important to measure.

The solution is to train a Deep Convolutional Neural Network (just like we did in Part 3). But
instead of training the network to recognize pictures objects like we did last time, we are going to
train it to generate 128 measurements for each face.
The training process works by looking at 3 face images at a time:

1. Load a training face image of a known person

2. Load another picture of the same known person
3. Load a picture of a totally different person

Then the algorithm looks at the measurements it is currently generating for each of those three
images. It then tweaks the neural network slightly so that it makes sure the measurements it
generates for #1 and #2 are slightly closer while making sure the measurements for #2 and #3 are
slightly further apart:

After repeating this step millions of times for millions of images of thousands of different
people, the neural network learns to reliably generate 128 measurements for each person. Any
ten different pictures of the same person should give roughly the same measurements.
Machine learning people call the 128 measurements of each face an embedding. The idea of
reducing complicated raw data like a picture into a list of computer-generated numbers comes up
a lot in machine learning (especially in language translation). The exact approach for faces we
are using was invented in 2015 by researchers at Google but many similar approaches exist.

Encoding our face image

This process of training a convolutional neural network to output face embeddings requires a lot
of data and computer power. Even with an expensive NVidia Telsa video card, it takes about 24
hours of continuous training to get good accuracy.

But once the network has been trained, it can generate measurements for any face, even ones it
has never seen before! So this step only needs to be done once. Lucky for us, the fine folks at
OpenFace already did this and they published several trained networks which we can directly
use. Thanks Brandon Amos and team!

So all we need to do ourselves is run our face images through their pre-trained network to get the
128 measurements for each face. Here’s the measurements for our test image:

So what parts of the face are these 128 numbers measuring exactly? It turns out that we have no
idea. It doesn’t really matter to us. All that we care is that the network generates nearly the same
numbers when looking at two different pictures of the same person.

If you want to try this step yourself, OpenFace provides a lua script that will generate
embeddings all images in a folder and write them to a csv file. You run it like this.
Step 4: Finding the person’s name from the encoding
This last step is actually the easiest step in the whole process. All we have to do is find the
person in our database of known people who has the closest measurements to our test image.

You can do that by using any basic machine learning classification algorithm. No fancy deep
learning tricks are needed. We’ll use a simple linear SVM classifier, but lots of classification
algorithms could work.

All we need to do is train a classifier that can take in the measurements from a new test image
and tells which known person is the closest match. Running this classifier takes milliseconds.
The result of the classifier is the name of the person!

So let’s try out our system. First, I trained a classifier with the embeddings of about 20 pictures
each of Will Ferrell, Chad Smith and Jimmy Falon:

Sweet, sweet training data!

Then I ran the classifier on every frame of the famous youtube video of Will Ferrell and Chad
Smith pretending to be each other on the Jimmy Fallon show:
It works! And look how well it works for faces in different poses — even sideways faces!

Running this Yourself

Let’s review the steps we followed:

1. Encode a picture using the HOG algorithm to create a simplified version of the image.
Using this simplified image, find the part of the image that most looks like a generic
HOG encoding of a face.
2. Figure out the pose of the face by finding the main landmarks in the face. Once we find
those landmarks, use them to warp the image so that the eyes and mouth are centered.
3. Pass the centered face image through a neural network that knows how to measure
features of the face. Save those 128 measurements.
4. Looking at all the faces we’ve measured in the past, see which person has the closest
measurements to our face’s measurements. That’s our match!

Now that you know how this all works, here’s instructions from start-to-finish of how run this
entire face recognition pipeline on your own computer:

UPDATE 4/9/2017: You can still follow the steps below to use OpenFace. However, I’ve
released a new Python-based face recognition library called face_recognition that is much easier
to install and use. So I’d recommend trying out face_recognition first instead of continuing
below!
I even put together a pre-configured virtual machine with face_recognition, OpenCV,
TensorFlow and lots of other deep learning tools pre-installed. You can download and run it on
your computer very easily. Give the virtual machine a shot if you don’t want to install all these
libraries yourself!

Chapter 5 Face Recognition
No ratings yet
Chapter 5 Face Recognition
75 pages
Face Detection With Python
0% (1)
Face Detection With Python
20 pages
Face Recognition Homework
100% (1)
Face Recognition Homework
6 pages
Build Face Recognition Attendance System Using Python
No ratings yet
Build Face Recognition Attendance System Using Python
8 pages
Introduction To Face Processing With Computer Vision
No ratings yet
Introduction To Face Processing With Computer Vision
82 pages
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-02 Reference-Material-I
No ratings yet
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-02 Reference-Material-I
69 pages
VA Lecture 23
No ratings yet
VA Lecture 23
42 pages
Probability and Statistics For ML - Cwa
No ratings yet
Probability and Statistics For ML - Cwa
822 pages
Zhou 2024 J. Phys.: Conf. Ser. 2717 012027
No ratings yet
Zhou 2024 J. Phys.: Conf. Ser. 2717 012027
7 pages
Seminar Report
0% (1)
Seminar Report
18 pages
IEEE Research Paper 2
No ratings yet
IEEE Research Paper 2
5 pages
Face Recognition and Identification Using Deep Learning
No ratings yet
Face Recognition and Identification Using Deep Learning
5 pages
Convolutional Neural Network Approach Fo
No ratings yet
Convolutional Neural Network Approach Fo
6 pages
Machine Learning Is Fun! Part 4: Modern Face Recognition With Deep Learning
No ratings yet
Machine Learning Is Fun! Part 4: Modern Face Recognition With Deep Learning
18 pages
IJNAA - Volume 13 - Issue 1 - Pages 3997-4015 مهم
No ratings yet
IJNAA - Volume 13 - Issue 1 - Pages 3997-4015 مهم
19 pages
Face
No ratings yet
Face
25 pages
Face Recognition System Using Deep Learning
No ratings yet
Face Recognition System Using Deep Learning
12 pages
Face-Recognition-Login: Join Github Today
No ratings yet
Face-Recognition-Login: Join Github Today
22 pages
Lawrence Et Al. (1997) PDF
No ratings yet
Lawrence Et Al. (1997) PDF
16 pages
A Comprehensive Survey On Face Recognition and Image Retrieval For Event-Based Applications
No ratings yet
A Comprehensive Survey On Face Recognition and Image Retrieval For Event-Based Applications
5 pages
Transfer Learning Convolutional Neural Network-AlexNet Achieving Face Recognition
No ratings yet
Transfer Learning Convolutional Neural Network-AlexNet Achieving Face Recognition
4 pages
Face Recognition Attendance System Using Python (With Code)
No ratings yet
Face Recognition Attendance System Using Python (With Code)
9 pages
Intro Face Detect Recognition
No ratings yet
Intro Face Detect Recognition
43 pages
Project Report Format For College
No ratings yet
Project Report Format For College
13 pages
Seminar Shivani
No ratings yet
Seminar Shivani
58 pages
Face Recognition Seminar Presentation Day1
No ratings yet
Face Recognition Seminar Presentation Day1
23 pages
Using Technology and Algorithms For Face Detection and Recognition Using Digital Image Processing 14328
No ratings yet
Using Technology and Algorithms For Face Detection and Recognition Using Digital Image Processing 14328
14 pages
How Facial Identification Works
No ratings yet
How Facial Identification Works
4 pages
Face Recognition Based On Machine Learning
No ratings yet
Face Recognition Based On Machine Learning
6 pages
Intro Face Detect Recognition
No ratings yet
Intro Face Detect Recognition
43 pages
Seminar Project - Face - Recognition
No ratings yet
Seminar Project - Face - Recognition
58 pages
Journal Paper-2
No ratings yet
Journal Paper-2
11 pages
An Introduction To Face Detection and Recognition
No ratings yet
An Introduction To Face Detection and Recognition
73 pages
Face Recognition With Deep Learning
No ratings yet
Face Recognition With Deep Learning
1 page
Face Detection Algorithm Report
No ratings yet
Face Detection Algorithm Report
9 pages
Techniques and Applications of Face Reco
No ratings yet
Techniques and Applications of Face Reco
4 pages
Building An Open Source Facial Recognition System For Mass Surveillance
100% (1)
Building An Open Source Facial Recognition System For Mass Surveillance
31 pages
Face Recognition Chapter
No ratings yet
Face Recognition Chapter
30 pages
Safari - 19-Aug-2019 at 9:16 AM
No ratings yet
Safari - 19-Aug-2019 at 9:16 AM
1 page
CS 4407 Discussion Forum Unit 7
No ratings yet
CS 4407 Discussion Forum Unit 7
2 pages
Face Recognition Based Attendance System
No ratings yet
Face Recognition Based Attendance System
9 pages
Attendance System Based On The Face Recognition of Webcam's Image of The Classroom
No ratings yet
Attendance System Based On The Face Recognition of Webcam's Image of The Classroom
11 pages
Developing A Neural Network-Based Method For Faster Face Recognition by Training & Simulation
No ratings yet
Developing A Neural Network-Based Method For Faster Face Recognition by Training & Simulation
10 pages
Research Paper - Human Face Detection
No ratings yet
Research Paper - Human Face Detection
8 pages
Face Detection and Its Applications: ISSN: 2320 - 8791
No ratings yet
Face Detection and Its Applications: ISSN: 2320 - 8791
10 pages
Teoh 2021 J. Phys. Conf. Ser. 1755 012006
No ratings yet
Teoh 2021 J. Phys. Conf. Ser. 1755 012006
10 pages
Final Report Kolkata Btech
No ratings yet
Final Report Kolkata Btech
74 pages
2022 CHVR Lalitha ICSCSP 2021 Proceedings
No ratings yet
2022 CHVR Lalitha ICSCSP 2021 Proceedings
793 pages
Face Recognition Using Opencv in Python
No ratings yet
Face Recognition Using Opencv in Python
6 pages
Smart Phone Smart Photo Editing
From Everand
Smart Phone Smart Photo Editing
Jo Bradford
No ratings yet
Computer Vision Assignment
No ratings yet
Computer Vision Assignment
9 pages
Face Recognition
No ratings yet
Face Recognition
23 pages
Introduction To Face Recognition and Detection
No ratings yet
Introduction To Face Recognition and Detection
54 pages
How To Write Thesis Proposal On Face Recognition
No ratings yet
How To Write Thesis Proposal On Face Recognition
5 pages
Chapter One1
No ratings yet
Chapter One1
106 pages
Face Recognition System: Abstract-We Present An Approach To The Detection and
No ratings yet
Face Recognition System: Abstract-We Present An Approach To The Detection and
6 pages
3D Face Recognition Based On Deep Learning - 8816269 PDF
No ratings yet
3D Face Recognition Based On Deep Learning - 8816269 PDF
6 pages
Face Detection and Recognition Using Image Processing
No ratings yet
Face Detection and Recognition Using Image Processing
43 pages
10fold Split70
No ratings yet
10fold Split70
5 pages
MLS 1 - Decision Trees and Random Forests
No ratings yet
MLS 1 - Decision Trees and Random Forests
16 pages
Face Recognition
No ratings yet
Face Recognition
17 pages
Seminar Report: Facial Recognition Systems and Algorithms
No ratings yet
Seminar Report: Facial Recognition Systems and Algorithms
31 pages
Face Recognition Analysis
No ratings yet
Face Recognition Analysis
1 page
03 - Supervised Learning (BPNN)
No ratings yet
03 - Supervised Learning (BPNN)
14 pages
Notes Discriminant Analysis March 2021
No ratings yet
Notes Discriminant Analysis March 2021
59 pages
Predicting The Type of Road Traffic Accident For Test Scenario Generation
No ratings yet
Predicting The Type of Road Traffic Accident For Test Scenario Generation
22 pages
Face Detection and Recognition: A Review: February 2018
No ratings yet
Face Detection and Recognition: A Review: February 2018
4 pages
Mini Interview Sheet ML GenAI
No ratings yet
Mini Interview Sheet ML GenAI
2 pages
6COM1044 Deep Learning 1
No ratings yet
6COM1044 Deep Learning 1
49 pages
New Repo
No ratings yet
New Repo
24 pages
Research Tracker
No ratings yet
Research Tracker
5 pages
CEDEFOB
No ratings yet
CEDEFOB
152 pages
Facerec Python
No ratings yet
Facerec Python
16 pages
2 PB
No ratings yet
2 PB
13 pages
Face Recognition Dataset Summary
No ratings yet
Face Recognition Dataset Summary
1 page
MEMBA 2017 March 10 Competing On Analytics Part I
No ratings yet
MEMBA 2017 March 10 Competing On Analytics Part I
33 pages
Medical AI
No ratings yet
Medical AI
10 pages
Unit 4 BDA
No ratings yet
Unit 4 BDA
4 pages
Machine Learning, Animated (Liu, Mark) (Z-Library)
No ratings yet
Machine Learning, Animated (Liu, Mark) (Z-Library)
582 pages
Face Recognition With GNU Octave/MATLAB: Philipp Wagner
No ratings yet
Face Recognition With GNU Octave/MATLAB: Philipp Wagner
14 pages
Face Recognition Report PDF
No ratings yet
Face Recognition Report PDF
16 pages
Naive Bayes
No ratings yet
Naive Bayes
32 pages
Random Forests: Paper Presentation For CSI5388 Pengcheng Xi Mar. 23, 2005
No ratings yet
Random Forests: Paper Presentation For CSI5388 Pengcheng Xi Mar. 23, 2005
23 pages
Cs8080 Unit3 Text Classification and Clustering
No ratings yet
Cs8080 Unit3 Text Classification and Clustering
171 pages
Basketball Free Throw - Biomechanic Analysis
No ratings yet
Basketball Free Throw - Biomechanic Analysis
13 pages
Question Bank - Basic Machine Learing
No ratings yet
Question Bank - Basic Machine Learing
2 pages
Assignment 5'
No ratings yet
Assignment 5'
4 pages
IET Communications - 2020 - Safara - Improved Intrusion Detection Method For Communication Networks Using Association Rule
No ratings yet
IET Communications - 2020 - Safara - Improved Intrusion Detection Method For Communication Networks Using Association Rule
6 pages
Classification and Prediction
No ratings yet
Classification and Prediction
143 pages
RTLC Manual
No ratings yet
RTLC Manual
9 pages
Fetal Brain Ultrasound Image Classification Using Deep Learning
100% (1)
Fetal Brain Ultrasound Image Classification Using Deep Learning
5 pages
Sir Sat 2017
No ratings yet
Sir Sat 2017
11 pages
Ai YasmeenAlhajYousef 0197638 Mohammad Almajali 2191370 End
No ratings yet
Ai YasmeenAlhajYousef 0197638 Mohammad Almajali 2191370 End
2 pages
Knowledge Discovery in Databases (KDD) : An Overview
No ratings yet
Knowledge Discovery in Databases (KDD) : An Overview
4 pages
Research Proposal-GIS1
No ratings yet
Research Proposal-GIS1
8 pages