0% found this document useful (0 votes)

2 views74 pages

neural-networks-part1

The document provides an introduction to neural networks, focusing on their structure, function, and applications in tasks like image classification and semantic segmentation. It explains the roles of neurons, layers, and activation functions, particularly emphasizing multi-layer perceptrons (MLPs) as a foundational type of neural network. The summary concludes with a note on the importance of learning parameters for effective network performance.

Uploaded by

Surya Basnet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views74 pages

neural-networks-part1

Uploaded by

Surya Basnet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 74

Intro to Neural Networks

Part 1: NetworkBasics

1
Image Classification

input classifie output

“cat
”

“5” 2
Semantic
Segmentation
“a label
for each
pixel”

3
Neural Networks
 Machine learning technique
 Often used for classification,
semantic segmentation, and related
tasks
 First ideas discussed in the
1950/60ies
 Theory work on NNs in the 1990ies
 Increase in attention from 2000 on
 Deep learning took off around 2010
 CNNs for image tasks from 2012 on 4
Part 1
Neural Networks Basics

5
Neural Network

What is a neuron? What is a network?

fundamental unit connected

(of the brain) elements

neural networks are

connected elementary
(computing) units
6
Biological Neurons
Biological neurons are the
fundamental units of the brain
that
 Receive sensory input from the
external world or from other
neurons
 Transform and relay signals
 Send signals to other neurons and
also motor commands to the
muscles
7
Artificial Neurons
Artificial neurons are the
fundamental units of artificial neural
networks that
 Receive inputs
 Transform information
 Create an output

8
Neurons
 Receive inputs / activations from
sensors or other neurons
 Combine / transform information
 Create an output / activation

9
Neurons as Functions
We can see a neuron as a function
 Input given by
 Transformation of the input data
can be described by a function
 Output

10
Neural Network
 NN is a network/graph of neurons
 Nodes are neurons
 Edges represent input-output
connections of the data
flow

input computations output

11
Neural Network as a
Function
 The whole network is again a
function
 Multi-layer perceptron or MLP is
often seen as the “vanilla”
neural network

input layer hidden layers output

layer

12
Neural Networks are Functions
 Neural networks are functions
 Consist of connected artificial
neurons
 Input layer takes (sensor) data
 Output layer provides the
function result (informationor
command)
 Hidden layers do some
computations
input layer hidden layers output layer
13
Different Types of NNs
 Perceptron
 MLP – Multilayer perceptron
 Autoencoder
 CNN – Convolutional NN
 RNN – Recurrent NN
 LSTM – Long/short term memory NN
 GANs – Generative adversarial network
 Graph NN
 Transformer
 ...
14
[Image courtesy: van Veen]
6
1
[Image courtesy: van Veen]
7
1
Multi-layer Perceptron (MLP)

17
Multi-layer
Perceptron Seen as a
Function
input layer hidden layers output layer

18
Image Classification Example

“cat
”

input function that maps labe

images to labels l
imag
e 19
What is the Network’s Input?

An image consists
of individual
pixels.

imag 20
e
What is the Network’s Input?

An image consists
of individual
pixel intensities
pixels.
Each pixel stores
an intensity
value.

imag 21
e
What is the Network’s Input?
pixel intensities

An image consists
of individual
pixels.
Each pixel stores
an intensity
value.

imag 22
e
What is the Network’s Input?

An image consists
of individual
pixels.
Each pixel stores
an intensity
value.
We have N+1 such
intensity values.
23
What is the Network’s Input?

Arrange all the

intensity
values
in a N+1 dim vector.
24
What is the Network’s Input?

Arrange all the

intensity
values
in a N+1 dim vector.
25
What is the Network’s Input?

Arrange all the

intensity
values
in a N+1 dim vector.
26
Input Layer of the
Network

This vector is
the input layer
of our network!
27
What is the Network’s
Output?

“cat
”

28
What is the Network’s
Output?

Is it a...
cat or a
dog or a
human
or a
...?

29
What is the Network’s
Output?

Is it a...
cat or a
dog or a
human
or a
...?

indicato
r
30
vector
What is the Network’s
Output?

Is it a...
cat or a
dog or a
human
or a
...?

indicato
r
31
vector
What is the Network’s
Output?

Is it a...
cat or a
dog or a
human
or a
...?

we are
never
certain.. 32

.
Output of the
Network

the output layer is

vector indicating an
activation/ likelihood
33
for each label
Image Classification

“cat”

largest
value

pixels intensities output layer is a

are the values vector of likelihoods
of the input for the possible labels
layer 34
Multi-layer Perceptron
Let’s Look at a Single
Neuron

35
Multi-layer Perceptron
Let’s Look at a Single
Neuron

36
Perceptron (Single Neuron)

output

How does this

inputs function look like?
37
Perceptron (Single Neuron)

output activation
for the next layer

activations from weights

previous layer
38
Function Behind a
Neuron

(input) activations

weights

bias

activation function

output activation

39
Function Behind a
Neuron
A neuron gets activated ( ) through
 A weighted sum of input activations
 A bias activation
 An activation function

40
Similarity to Convolutions?
 A neuron is similar to a
convolution
 Remember linear shift-invariant
kernels used as local
operators
This part looks like the
convolutions used for defining
local operators

Additionally: activation function and

41
bias
Activation
Function
Biological neurons are either
active or not active
 We can see this as a step
function:

“activated”

“no activation”

 Bias tells us where the

activation happens 42
Activation
Function
“activated”

“no activation”

 We can model this behavior

through

 Non-smooth functions (eg, steps) have

disadvantages later down the line...
43
Sigmoid Activation
Function
Common activation function is a
sigmoid (also called logistic function)
 Smooth function
 Squeezes values to [0,1]

44
ReLU Activation
Function
Most commonly used one is the
so- called “rectified linear unit” or
ReLU

 Often advantages for deep
networks

45
Neuron
Activation
A neuron is only activated if

“no activation” “activated”

(identity)

 If
 the weighted activations are larger than the
46
negative bias
Common Activation
Functions
There are different activation functions
 sigmoid()
 ReLU()
 tanh()
 atan()
 softplus()
 identity()
 step-function()
 …
ReLU is often used
47
Illustration

[Courtesy of S. Sharma]
9
4
Function Behind a
Neuron
Neuron gets activated if the
weighted sum of input activations is
large enough (larger thanthe
negative bias)

 This is the case for all neurons in

the neural network
50
For All
Neurons...

These are a lot of 51

values!
Let’s Use a Matrix
Notation

51
Each Layer Can Be Expressed
Through Matrix
Multiplications
layer 1 layer 0

52
Do It Layer by Layer...

53
Do It Layer by Layer...

input = layer 0

layer 1

layer 2

layer k = output

That not much more than linear

algebra... 54
Feedforward Networks
 MLPs are feedforward networks
 The information flows form left to
right
 There are no loops

 Such networks are called

feedforward networks
 There exist other variants (eg,
RNNs) 55
Example:
Handwritten Digit
Recognition

56
Handwritten Digit
Recognition

=5
28x28 pixel image
[Image courtesy: Nielsen] 58
Handwritten Digit
Recognition

[Image courtesy: Nielsen/Lecun] 59

A Basic MLP Recognizing
Digits

[Image courtesy: Nielsen] 60

Images to Digits - A
Mapping from 784 to 10
Dimensions

28x28
pixel
input
images outpu
t
(784 vector
dim) (10 dim)
[Partial image courtesy: Nielsen] 61
What Happens in the Layers?

61
What Happens in the 1st Layer?

pixel
value
s

784 input activations = pixel intensities

784 weights = weights for pixel intensities
62
What Happens in the 1st Layer?
784 input activations = pixel intensities
784 weights = weights for pixel intensities
treat activations and weights as
images

-1 0 +1
white

black
pixel values weights effect on (rest
the doesn’t
weighted sum matter)
63
What Happens in the 1st Layer?

-1 0 +1

pixel
value weights tell
s what
us matters for
activating the
neuron!

individual “weight images” for a

neuron support individual patterns in 64

the image
Link to Local Operators
Defines Through Convolutions
 Direct link to defining
image operators through
-1 0 +1
convolutions

Here:
 Global (not local) operators
 Weight matrix does not
weights tell (yet) “slide over image”
us what
matters

65
Weights & Bias =
Patterns
Weights define the patterns to
look for in the image
 Bias tells us how well the image
must
match the pattern
 Activation functions “switches the
neuron on” if it matches the
pattern

66
What Happens in the 2nd Layer?
 The weights in layer 2 tell us which
1st layer patterns should be
combined
 The deeper we go, the more
patterns get arranged and combined

 The last layer decides, which

final patterns make up a digit
67
What Happens in the Layers?

raw simple combine pattern

pixels pattern d to
s
s patterns digits
[Image courtesy: Nielsen] 68
No Manual
Features

Compared to several other

classifiers, this network also
includes the feature computations –
it operates directly on the input
data, no manual features!

raw simple combine pattern

pixels pattern d to
s
s patterns digits
[Image courtesy: Nielsen] 69
Classification
Performance

Such a simple MLP achieves a correct

classification for ~96% of the
examples
[Partial image courtesy: Nielsen] 71
Classification
Performance
A simple MLP achieves a
classification accuracy of ~96%
 Note that there are tricky cases

 That is a good performance for a

simple model!
 Improved networks achieve
~99% 71
How to
Design a Neural
Network?

72
How to Make the
Network Compute What
We Want?
So far, the network is a recipe for
sequentially performing
computations
 Structure and parameters are
the
design choices
 How to set them?

Learning! 73
Summary – Part 1
 What are neurons and neural
networks
 Lots of different networks exists
 Focus: multi-layer perceptrons (MLP)
 Activations, weights, bias
 Networks have many parameters
 “It’s just a bunch of matrices and
vectors”
 MLP for simple image
classification 74

 Part 2: Learning the

parameters

Fin 4721
No ratings yet
Fin 4721
15 pages
Unit 5
No ratings yet
Unit 5
61 pages
Image Caption Generation Using Deep Learning: Department of Electronics & Instrumentation Engineering NIT Silchar, Assam
No ratings yet
Image Caption Generation Using Deep Learning: Department of Electronics & Instrumentation Engineering NIT Silchar, Assam
21 pages
2021 Pho1 15 Neural Networks Part1
No ratings yet
2021 Pho1 15 Neural Networks Part1
77 pages
04Introduction to Neural Networks
No ratings yet
04Introduction to Neural Networks
62 pages
UNIT V
No ratings yet
UNIT V
49 pages
UNIT - 4
No ratings yet
UNIT - 4
17 pages
Artificial Intelligence: Outline
No ratings yet
Artificial Intelligence: Outline
35 pages
Basics
No ratings yet
Basics
48 pages
AIMLF-UNIT4
No ratings yet
AIMLF-UNIT4
20 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
MLP 1122 20240509 ch10 DeepNN
No ratings yet
MLP 1122 20240509 ch10 DeepNN
47 pages
unit-3_ml[1]
No ratings yet
unit-3_ml[1]
21 pages
Mod 2.1,2.2
No ratings yet
Mod 2.1,2.2
24 pages
Unit 3 Endsem PYQs
No ratings yet
Unit 3 Endsem PYQs
19 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
75 pages
ECSE484 Intro v2
No ratings yet
ECSE484 Intro v2
67 pages
chapter 4 Neural Network
No ratings yet
chapter 4 Neural Network
46 pages
7_Neural Networks (1)
No ratings yet
7_Neural Networks (1)
66 pages
Lec 6-7 (Neural Networks)
No ratings yet
Lec 6-7 (Neural Networks)
26 pages
NNDL
No ratings yet
NNDL
96 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
ML-5TH UNIT
No ratings yet
ML-5TH UNIT
28 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
Lesson 03 Artificial Neural Network
No ratings yet
Lesson 03 Artificial Neural Network
116 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
216 pages
Notes Chapter Neural Networks
No ratings yet
Notes Chapter Neural Networks
18 pages
Lecture-2 Learning Process45452465442
No ratings yet
Lecture-2 Learning Process45452465442
50 pages
Neural Networks
No ratings yet
Neural Networks
28 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
NEURAL NETWORKS RESEARCH
No ratings yet
NEURAL NETWORKS RESEARCH
3 pages
Unit 5
No ratings yet
Unit 5
102 pages
Module 2
No ratings yet
Module 2
84 pages
Types of Neural Networks and Definition of Neural Network
No ratings yet
Types of Neural Networks and Definition of Neural Network
15 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
221 pages
eL_Assignment
No ratings yet
eL_Assignment
10 pages
Neural Deep Learning
No ratings yet
Neural Deep Learning
221 pages
Crash Course DL
No ratings yet
Crash Course DL
15 pages
Isch 4
No ratings yet
Isch 4
44 pages
Deep Learning Unit1
No ratings yet
Deep Learning Unit1
25 pages
Image Classification Using Convolutional Neural Network With Python
No ratings yet
Image Classification Using Convolutional Neural Network With Python
8 pages
ML Ch-4 Artificial Neural Network
No ratings yet
ML Ch-4 Artificial Neural Network
54 pages
12 Neural Network
No ratings yet
12 Neural Network
52 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
Unit 1
No ratings yet
Unit 1
16 pages
Part7.2 Artificial Neural Networks
No ratings yet
Part7.2 Artificial Neural Networks
51 pages
Introduction To Neural Networks For Senior Design: August 9 - 12, 2004 Intro-1
No ratings yet
Introduction To Neural Networks For Senior Design: August 9 - 12, 2004 Intro-1
33 pages
ML.Unit-5
No ratings yet
ML.Unit-5
22 pages
4.0 The Complete Guide To Artificial Neural Networks
No ratings yet
4.0 The Complete Guide To Artificial Neural Networks
23 pages
Unit 1 Fundamentals of Deep Learning
No ratings yet
Unit 1 Fundamentals of Deep Learning
20 pages
Lecture 7 - Neural Networks
No ratings yet
Lecture 7 - Neural Networks
48 pages
Int254 Unit 3
No ratings yet
Int254 Unit 3
29 pages
7 Neural Networks
No ratings yet
7 Neural Networks
70 pages
DL_IT324a_2_ANN
No ratings yet
DL_IT324a_2_ANN
123 pages
Lesson 2 Neural Network Architectures
No ratings yet
Lesson 2 Neural Network Architectures
35 pages
Lesson 7.0 Supervised Learning With Neural Networks (1)
No ratings yet
Lesson 7.0 Supervised Learning With Neural Networks (1)
22 pages
Neural Network
No ratings yet
Neural Network
18 pages
Unit - 2
No ratings yet
Unit - 2
24 pages
Module11 - NNandDeep Learning
No ratings yet
Module11 - NNandDeep Learning
84 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
51 pages
Neural Networks
From Everand
Neural Networks
Sasha Kurzweil
No ratings yet
Unit 4_C Designing Interfaces and Dialouges
No ratings yet
Unit 4_C Designing Interfaces and Dialouges
25 pages
Unit 3 Analysis_a System Requirements
No ratings yet
Unit 3 Analysis_a System Requirements
47 pages
cs221-lecture10
No ratings yet
cs221-lecture10
43 pages
OS Syllabus
No ratings yet
OS Syllabus
5 pages
system call
No ratings yet
system call
21 pages
disk_management
No ratings yet
disk_management
46 pages
Deadlock
No ratings yet
Deadlock
38 pages
Process creation 2
No ratings yet
Process creation 2
11 pages
ai-ch18-learning-from-examples-part-2
No ratings yet
ai-ch18-learning-from-examples-part-2
30 pages
Machine Learning
No ratings yet
Machine Learning
68 pages
cs221-lecture12
No ratings yet
cs221-lecture12
28 pages
slides_kbAgents (1)
No ratings yet
slides_kbAgents (1)
97 pages
Aneka
No ratings yet
Aneka
12 pages
Hypervisor ESXI 5
No ratings yet
Hypervisor ESXI 5
8 pages
MoreAnekaExamples
No ratings yet
MoreAnekaExamples
10 pages
Architecture of Server Virtualization 3
No ratings yet
Architecture of Server Virtualization 3
13 pages
chapter-2-lab-lab-assignment
No ratings yet
chapter-2-lab-lab-assignment
6 pages
unit-2-linked-lists
No ratings yet
unit-2-linked-lists
21 pages
TaskModel
No ratings yet
TaskModel
68 pages
unit-3-stacks-and-queues
No ratings yet
unit-3-stacks-and-queues
13 pages
chapter-3-lab-lab-assignment
No ratings yet
chapter-3-lab-lab-assignment
7 pages
unit-5-binary-trees
No ratings yet
unit-5-binary-trees
28 pages
chapter-1-lab-lab-assignment
No ratings yet
chapter-1-lab-lab-assignment
7 pages
2 vector-calculus
No ratings yet
2 vector-calculus
3 pages
Chapter 4 Lab Instructions
No ratings yet
Chapter 4 Lab Instructions
3 pages
laudon_ess10e_pp_4
No ratings yet
laudon_ess10e_pp_4
48 pages
unit-4-recursion
No ratings yet
unit-4-recursion
10 pages
unit-1-complexity-analysis
No ratings yet
unit-1-complexity-analysis
6 pages
e Commercesecurityandpaymentsystems
No ratings yet
e Commercesecurityandpaymentsystems
21 pages
Using Accuracy and Diversity To Select Classifiers To Build Ensembles
No ratings yet
Using Accuracy and Diversity To Select Classifiers To Build Ensembles
7 pages
Cluster Validation: Presented By:Rohit Paul
No ratings yet
Cluster Validation: Presented By:Rohit Paul
22 pages
How To Choose An Activation Function For Deep Learning
No ratings yet
How To Choose An Activation Function For Deep Learning
15 pages
Beamer Template Uoft
No ratings yet
Beamer Template Uoft
11 pages
AIML-Module-3-part 2
No ratings yet
AIML-Module-3-part 2
122 pages
Machine Learning and Real-World Applications
100% (1)
Machine Learning and Real-World Applications
19 pages
NN LAB 13 SEP - Jupyter Notebook
No ratings yet
NN LAB 13 SEP - Jupyter Notebook
6 pages
Bab I Mcculloch-Pitts Neuron: %program % Illustration of Various Activation Functions Used in NN's
No ratings yet
Bab I Mcculloch-Pitts Neuron: %program % Illustration of Various Activation Functions Used in NN's
8 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Libro Nuevo ML
No ratings yet
Libro Nuevo ML
577 pages
10.2. Deep Learning (CNN)
No ratings yet
10.2. Deep Learning (CNN)
50 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
Transformer_vs_MOE
No ratings yet
Transformer_vs_MOE
7 pages
Unit 5
No ratings yet
Unit 5
35 pages
Assignment_8_2024_updated
No ratings yet
Assignment_8_2024_updated
6 pages
Scikit Learn Cheat Sheet
No ratings yet
Scikit Learn Cheat Sheet
9 pages
Perceptrons and SVMS: Cs771: Introduction To Machine Learning Nisheeth
No ratings yet
Perceptrons and SVMS: Cs771: Introduction To Machine Learning Nisheeth
18 pages
Deep Learning: Technical Introduction: Thomas Epelbaum
No ratings yet
Deep Learning: Technical Introduction: Thomas Epelbaum
106 pages
ML06 Classical Techniques
No ratings yet
ML06 Classical Techniques
38 pages
ML Glossary
No ratings yet
ML Glossary
44 pages
RNN Neural Network
No ratings yet
RNN Neural Network
23 pages
NN Learning and Expert Systems
No ratings yet
NN Learning and Expert Systems
8 pages
A Step by Step Backpropagation Example - Matt Mazur
100% (1)
A Step by Step Backpropagation Example - Matt Mazur
19 pages
An Investigation into the Detection of Human Scratching Activity Based on Deep Learning Models 1st edition by Kevin Wang ISBN 979-8350399035 979-8350399028 - Quickly download the ebook to start your content journey
100% (7)
An Investigation into the Detection of Human Scratching Activity Based on Deep Learning Models 1st edition by Kevin Wang ISBN 979-8350399035 979-8350399028 - Quickly download the ebook to start your content journey
46 pages
What is VGG
No ratings yet
What is VGG
3 pages
Machine Learning
No ratings yet
Machine Learning
40 pages
Course Code CSA400 8 Course Type LTP Credits 4: Applied Machine Learning
No ratings yet
Course Code CSA400 8 Course Type LTP Credits 4: Applied Machine Learning
3 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
5 pages

neural-networks-part1

Uploaded by

neural-networks-part1

Uploaded by

Intro to Neural Networks

input classifie output

What is a neuron? What is a network?

fundamental unit connected

neural networks are

input computations output

input layer hidden layers output

input function that maps labe

Arrange all the

Arrange all the

Arrange all the

the output layer is

pixels intensities output layer is a

How does this

activations from weights

Additionally: activation function and

 Bias tells us where the

 We can model this behavior

 Non-smooth functions (eg, steps) have

“no activation” “activated”

 This is the case for all neurons in

These are a lot of 51

That not much more than linear

 Such networks are called

[Image courtesy: Nielsen/Lecun] 59

[Image courtesy: Nielsen] 60

784 input activations = pixel intensities

individual “weight images” for a

 The last layer decides, which

raw simple combine pattern

Compared to several other

raw simple combine pattern

Such a simple MLP achieves a correct

 That is a good performance for a

 Part 2: Learning the

You might also like