0% found this document useful (0 votes)

86 views6 pages

III. Advanced Neural Network Techniques

Convolutional neural networks (CNNs) are a type of neural network that is especially effective for image processing tasks. CNNs use convolutional layers that can detect visual features like edges or patterns in images. This reduces the number of parameters needed compared to regular neural networks and allows CNNs to recognize objects in different positions and scales using less training data. Generative adversarial networks (GANs) use two neural networks, a generator and discriminator, that compete against each other. The generator learns to generate new images to fool the discriminator, which learns to distinguish real and generated images. This allows GANs to generate highly realistic new images similar to images in the training data.

Uploaded by

Welluzani Manda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

86 views6 pages

III. Advanced Neural Network Techniques

Uploaded by

Welluzani Manda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

III.

Advanced neural network techniques

In the previous section, we have discussed the basic ideas behind most
neural network methods: multilayer networks, non-linear activation functions,
and learning rules such as the backpropagation algorithm.
They power almost all modern neural network applications. However, there are some

interesting and powerful variations of the theme that have led to great advances in deep

learning in many areas.

Convolutional neural networks (CNNs)

One area where deep learning has achieved spectacular success is image processing. The

simple classifier that we studied in detail in the previous section is severely limited – as you

noticed it wasn't even possible to classify all the smiley faces correctly. Adding more layers

in the network and using backpropagation to learn the weights does in principle solve the

problem, but another one emerges: the number of weights becomes extremely large and

consequently, the amount of training data required to achieve satisfactory accuracy can

become too large to be realistic.

Fortunately, a very elegant solution to the problem of too many weights exists: a special kind

of neural network, or rather, a special kind of layer that can be included in a deep neural

network. This special kind of layer is a so-called convolutional layer. Networks including

convolutional layers are called convolutional neural networks (CNNs). Their key property is

that they can detect image features such as bright or dark (or specific color) spots, edges in

various orientations, patterns, and so on. These form the basis for detecting more abstract

features such as a cat's ears, a dog's snout, a person's eye, or the octagonal shape of a stop

sign. It would normally be hard to train a neural network to detect such features based on the

pixels of the input image, because the features can appear in different positions, different

orientations, and in different sizes in the image: moving the object or the camera angle will
change the pixel values dramatically even if the object itself looks just the same to us. In

order to learn to detect a stop sign in all these different conditions would require vast of

amounts of training data because the network would only detect the sign in conditions where

it has appeared in the training data. So, for example, a stop sign in the top right corner of the

image would be detected only if the training data included an image with the stop sign in the

top right corner. CNNs can recognize the object anywhere in the image no matter where it

has been observed in the training images.

Note

Why we need CNNs

CNNs use a clever trick to reduce the amount of training data required to detect objects in
different conditions. The trick basically amounts to using the same input weights for many
neurons – so that all of these neurons are activated by the same pattern – but with different
input pixels. We can for example have a set of neurons that are activated by a cat's pointy
ear. When the input is a photo of a cat, two neurons are activated, one for the left ear and
another for the right. We can also let the neuron's input pixels be taken from a smaller or a
larger area, so that different neurons are activated by the ear appearing in different scales
(sizes), so that we can detect a small cat's ears even if the training data only included
images of big cats.
The convolutional neurons are typically placed in the bottom layers of the network, which

processes the raw input pixels. Basic neurons (like the perceptron neuron discussed above)

are placed in the higher layers, which process the output of the bottom layers. The bottom

layers can usually be trained using unsupervised learning, without a particular prediction

task in mind. Their weights will be tuned to detect features that appear frequently in the input

data. Thus, with photos of animals, typical features will be ears and snouts, whereas in

images of buildings, the features are architectural components such as walls, roofs,

windows, and so on. If a mix of various objects and scenes is used as the input data, then

the features learned by the bottom layers will be more or less generic. This means that

pre-trained convolutional layers can be reused in many different image processing tasks.
This is extremely important since it is easy to get virtually unlimited amounts of unlabeled

training data – images without labels – which can be used to train the bottom layers. The top

layers are always trained by supervised machine learning techniques such as

backpropagation.

Do neural networks dream of electric sheep? Generative adversarial

networks (GANs)
Having learned a neural network from data, it can be used for prediction. Since the top

layers of the network have been trained in a supervised manner to perform a particular

classification or prediction task, the top layers are really useful only for that task. A network

trained to detect stop signs is useless for detecting handwritten digits or cats.

A fascinating result is obtained by taking the pre-trained bottom layers and studying what the

features they have learned look like. This can be achieved by generating images that

activate a certain set of neurons in the bottom layers. Looking at the generated images, we

can see what the neural network “thinks” a particular feature looks like, or what an image

with a select set of features in it would look like. Some even like to talk about the networks

“dreaming” or “hallucinating” images (see Google's DeepDream system).

Note

Be careful with metaphors

However, we'd like to once again emphasize the problem with metaphors such as dreaming
when simple optimization of the input image is meant – remember the suitcase words
discussed in Chapter 1. The neural network doesn't really dream, and it doesn't have a
concept of a cat that it would understand in a similar sense as a human understands. It is
simply trained to recognize objects and it can generate images that are similar to the input
data that it is trained on.
To actually generate real looking cats, human faces, or other objects (you'll get whatever you

used as the training data), Ian Goodfellow who currently works at Google Brain, proposed a

clever combination of two neural networks. The idea is to let the two networks compete

against each other. One of the networks is trained to generate images like the ones in the

training data. The other network's task is to separate images generated by the first network

from real images from the training data – it is called the adversarial network, and the whole

system is called generative adversarial network or a GAN.

The system trains the two models side by side. In the beginning of the training, the

adversarial model has an easy task to tell apart the real images from the training data and

the clumsy attempts by the generative model. However, as the generative network slowly

gets better and better, the adversarial model has to improve as well, and the cycle continues

until eventually the generated images are almost indistinguishable from real ones. The GAN

tries to not only reproduce the images in the training data: that would be a way too simple

strategy to beat the adversarial network. Rather, the system is trained so that it has to be

able to generate new, real-looking images too.

The above images were generated by a GAN developed by NVIDIA in a project led by Prof

Jaakko Lehtinen (see this article for more details).

Could you have recognized them as fakes?

ANN by B.Yegnanarayana PDF
93% (14)
ANN by B.Yegnanarayana PDF
479 pages
Image Recognition Using CNN
0% (1)
Image Recognition Using CNN
12 pages
TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
BM466 - Homework 4
No ratings yet
BM466 - Homework 4
10 pages
Configuring A Build Pipeline On Azure DevOps For An ASP - Net Core API - CodeProject
No ratings yet
Configuring A Build Pipeline On Azure DevOps For An ASP - Net Core API - CodeProject
18 pages
Artificial Neural Networks: Fundamentals and Applications for Decoding the Mysteries of Neural Computation
From Everand
Artificial Neural Networks: Fundamentals and Applications for Decoding the Mysteries of Neural Computation
Fouad Sabry
No ratings yet
Advanced Neural Network Techniques - Elements of AI
No ratings yet
Advanced Neural Network Techniques - Elements of AI
11 pages
Introduction To Deep Learning: 1 General Overview
No ratings yet
Introduction To Deep Learning: 1 General Overview
29 pages
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
From Everand
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
Fouad Sabry
No ratings yet
Deep Learning Notes
No ratings yet
Deep Learning Notes
14 pages
2017 Beginner's Review of Generative Adversarial Networks (GAN) Architectures
No ratings yet
2017 Beginner's Review of Generative Adversarial Networks (GAN) Architectures
9 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
25 pages
Demystifying Deep Convolutional Neural Networks - Adam Harley (2014) CNN PDF
No ratings yet
Demystifying Deep Convolutional Neural Networks - Adam Harley (2014) CNN PDF
27 pages
Main Phase-1 CHITHARA
No ratings yet
Main Phase-1 CHITHARA
35 pages
Deep Convolutional Neural Networks: Structure, Feature Extraction and Training
No ratings yet
Deep Convolutional Neural Networks: Structure, Feature Extraction and Training
8 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
26 pages
Text To Image Translation Using Generative Adversarial Networks
No ratings yet
Text To Image Translation Using Generative Adversarial Networks
7 pages
‎⁨فصل ثاني اسراء⁩
No ratings yet
‎⁨فصل ثاني اسراء⁩
13 pages
A Survey On Computer Vision Algorithms
No ratings yet
A Survey On Computer Vision Algorithms
16 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
65 pages
Unit Iv DL
No ratings yet
Unit Iv DL
26 pages
Intro To AI
No ratings yet
Intro To AI
44 pages
AD3501-DL-Unit 2
No ratings yet
AD3501-DL-Unit 2
33 pages
249 254Tesma601IJEAST
No ratings yet
249 254Tesma601IJEAST
7 pages
Master of Technology in Computer Science: Generative Adversarial Network
No ratings yet
Master of Technology in Computer Science: Generative Adversarial Network
11 pages
Master of Technology in Computer Science: Generative Adversarial Network
No ratings yet
Master of Technology in Computer Science: Generative Adversarial Network
11 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
1.convolutional Neural Networks For Image Classification
No ratings yet
1.convolutional Neural Networks For Image Classification
11 pages
L10-DL Intro
No ratings yet
L10-DL Intro
25 pages
Unit 4
No ratings yet
Unit 4
27 pages
CNN Eem305
100% (1)
CNN Eem305
7 pages
Object Detection Using Convolutional Neural Network Transfer Learning
No ratings yet
Object Detection Using Convolutional Neural Network Transfer Learning
11 pages
Class 5 - Deep Dive Into AI
No ratings yet
Class 5 - Deep Dive Into AI
32 pages
Ad3501-Dl-Unit 2 Notes
No ratings yet
Ad3501-Dl-Unit 2 Notes
29 pages
Image Recognition in Self-Driving Cars Using CNN
No ratings yet
Image Recognition in Self-Driving Cars Using CNN
7 pages
An Introduction To Convolutional Neural Networks: November 2015
No ratings yet
An Introduction To Convolutional Neural Networks: November 2015
12 pages
A Survey of Convolutional Neural Networks - Analysis-Applications-Prospects
No ratings yet
A Survey of Convolutional Neural Networks - Analysis-Applications-Prospects
21 pages
What Is A Convolutional Neural Network-Unit3
No ratings yet
What Is A Convolutional Neural Network-Unit3
12 pages
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
No ratings yet
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
17 pages
Article Review 10 Eng
No ratings yet
Article Review 10 Eng
28 pages
Admin,+4554 Article+Text 17736 2 10 20210928
No ratings yet
Admin,+4554 Article+Text 17736 2 10 20210928
13 pages
Yegna 1999
No ratings yet
Yegna 1999
479 pages
Langr GIA MEAP V04 ch1
100% (1)
Langr GIA MEAP V04 ch1
18 pages
DL Unit-4
No ratings yet
DL Unit-4
26 pages
Research Acv
No ratings yet
Research Acv
6 pages
4.1 Overview: Deep Learning
No ratings yet
4.1 Overview: Deep Learning
11 pages
Neural Networks
No ratings yet
Neural Networks
6 pages
Convolutional Neural Network For Image Recognition
No ratings yet
Convolutional Neural Network For Image Recognition
8 pages
CNN Unit
No ratings yet
CNN Unit
52 pages
DL-Unit-3 Final
No ratings yet
DL-Unit-3 Final
25 pages
Machine Learning Data Mining: Problems
No ratings yet
Machine Learning Data Mining: Problems
4 pages
Machine Learning 4th Unit
No ratings yet
Machine Learning 4th Unit
54 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
95 pages
DL Unit3 1
No ratings yet
DL Unit3 1
67 pages
CNN Students
No ratings yet
CNN Students
170 pages
ML Unit 2 Ann
No ratings yet
ML Unit 2 Ann
29 pages
GROKKING ALGORITHMS: Simple and Effective Methods to Grokking Deep Learning and Machine Learning
From Everand
GROKKING ALGORITHMS: Simple and Effective Methods to Grokking Deep Learning and Machine Learning
Eric Schmidt
No ratings yet
Convolutional Neural Networks in Python: Beginner's Guide to Convolutional Neural Networks in Python
From Everand
Convolutional Neural Networks in Python: Beginner's Guide to Convolutional Neural Networks in Python
Frank Millstein
No ratings yet
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
From Everand
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
William Sullivan
1/5 (1)
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
Breaking Cryptographic Implementations Using Deep Learning Techniques
No ratings yet
Breaking Cryptographic Implementations Using Deep Learning Techniques
25 pages
التعلم العميق
No ratings yet
التعلم العميق
192 pages
BDA Worksheet 5 Arman
No ratings yet
BDA Worksheet 5 Arman
5 pages
CVDL Cae1
No ratings yet
CVDL Cae1
28 pages
Top 10 Machine Learning Algorithms
No ratings yet
Top 10 Machine Learning Algorithms
12 pages
Assignment ON Data Mining: Submitted by Name: Manjula.T
No ratings yet
Assignment ON Data Mining: Submitted by Name: Manjula.T
11 pages
Classification and Clustering
No ratings yet
Classification and Clustering
8 pages
Big Data Analytics Project
No ratings yet
Big Data Analytics Project
21 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
5 pages
Intro To Ai ML
No ratings yet
Intro To Ai ML
21 pages
Email Classification: Roll No-41463 (LP-3)
No ratings yet
Email Classification: Roll No-41463 (LP-3)
5 pages
The Failure of The Perceptron To Successfully Simple Problem Such As XOR (Minsky and Papert)
No ratings yet
The Failure of The Perceptron To Successfully Simple Problem Such As XOR (Minsky and Papert)
13 pages
Ann Project Assignment
No ratings yet
Ann Project Assignment
16 pages
Unit 5 CNN
No ratings yet
Unit 5 CNN
151 pages
ASC Assignment 1 and 2 Questions
No ratings yet
ASC Assignment 1 and 2 Questions
2 pages
Slide-08-Chapter10-Cluster Analysis Basic Concept I
No ratings yet
Slide-08-Chapter10-Cluster Analysis Basic Concept I
40 pages
Perceptron
No ratings yet
Perceptron
24 pages
Chapter 3 Ann
No ratings yet
Chapter 3 Ann
26 pages
Question Bank - Machine Learning
100% (1)
Question Bank - Machine Learning
4 pages
Assignment 5 Solution
No ratings yet
Assignment 5 Solution
4 pages
Front Page Ramesh
No ratings yet
Front Page Ramesh
7 pages
Program 7-EM Algorithm-K Means Algorithm
No ratings yet
Program 7-EM Algorithm-K Means Algorithm
3 pages
ccs355 Syllabus NNDL
100% (1)
ccs355 Syllabus NNDL
3 pages
K-Means With Elbow Method
No ratings yet
K-Means With Elbow Method
24 pages
Module 5 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
No ratings yet
Module 5 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
26 pages
Ex 5 - NN - Wheat Seed Data
No ratings yet
Ex 5 - NN - Wheat Seed Data
9 pages
Combining Classifiers
No ratings yet
Combining Classifiers
12 pages
Bits f464 Machine Learning l1
No ratings yet
Bits f464 Machine Learning l1
5 pages
U2-ML-QB With Answers
No ratings yet
U2-ML-QB With Answers
16 pages
12 ML KNN
No ratings yet
12 ML KNN
28 pages

III. Advanced Neural Network Techniques

Uploaded by

III. Advanced Neural Network Techniques

Uploaded by

III.

Advanced neural network techniques

learning in many areas.

Convolutional neural networks (CNNs)

become too large to be realistic.

has been observed in the training images.

Why we need CNNs

layers are always trained by supervised machine learning techniques such as

Do neural networks dream of electric sheep? Generative adversarial

“dreaming” or “hallucinating” images (see Google's DeepDream system).

Be careful with metaphors

system is called generative adversarial network or a GAN.

able to generate new, real-looking images too.

Jaakko Lehtinen (see this article for more details).

Could you have recognized them as fakes?

You might also like