0% found this document useful (0 votes)

64 views33 pages

Neural Network Presentation

This document provides an overview of artificial neural networks including their history, applications, properties, and how they work. Some key points: - Neural networks were inspired by biological neural systems and can learn from large datasets to classify inputs. - Examples of applications include handwriting recognition, speech recognition, and face recognition. - The basic building block is the perceptron, which uses weighted inputs and an activation function to make classifications. - Multilayer networks can represent complex functions using techniques like backpropagation to calculate errors and update weights. - With enough hidden units, neural networks can approximate any function, though interpretation is difficult for humans.

Uploaded by

hrlive123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views33 pages

Neural Network Presentation

Uploaded by

hrlive123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 33

Artificial Neural Networks

What can they do? How do they work? What might we use them for it our project? Why are they so cool?

History

late-1800's - Neural Networks appear as an analogy to biological systems 1960's and 70's Simple neural networks appear

Fall out of favor because the perceptron is not effective by itself, and there were no good algorithms for multilayer nets Neural Networks have a resurgence in popularity

1986 Backpropagation algorithm appears

Applications

Handwriting recognition Recognizing spoken words Face recognition

You will get a chance to play with this later!

ALVINN TD-BACKGAMMON

ALVINN

Autonomous Land Vehicle in a Neural Network Robotic car Created in 1980s by David Pomerleau 1995

Drove 1000 miles in traffic at speed of up to 120 MPH Steered the car coast to coast (throttle and brakes controlled by human)

30 x 32 image as input, 4 hidden units, and 30 outputs

TD-GAMMON

Plays backgammon Created by Gerry Tesauro in the early 90s Uses variation of Q-learning (similar to what we might use)

Neural network was used to learn the evaluation function

Trained on over 1 million games played against itself Plays competitively at world class level

Basic Idea

Modeled on biological systems

This association has become much looser

Can do more than this

Learn to classify objects

Learn from given training data of the form (x1...xn, output)

Properties

Inputs are flexible

any real values Highly correlated or independent

Target function may be discrete-valued, realvalued, or vectors of discrete or real values

Outputs are real numbers between 0 and 1

Resistant to errors in the training data Long training time Fast evaluation The function produced can be difficult for humans to interpret

Perceptrons

Basic unit in a neural network Linear separator Parts

N inputs, x1 ... xn Weights for each input, w1 ... wn A bias input x0 (constant) and associated weight w0 Weighted sum of inputs, y = w0x0 + w1x1 + ... + wnxn A threshold function, i.e 1 if y > 0, -1 if y <= 0

Diagram
x1
x2 w1

x0
w0

. . . xn wn

y = wixi

Thres hold 1 if y >0 -1 otherwise

Linear Separator
This... + + + x2 But not this (XOR) x2

+
x1 -

x1 +

Boolean Functions
x1
x0=-1 w0 = 1.5 w1=1 w2=1 x1 x0=-1 w0 = 0.5 w1=1 w2=1 x1 OR x2 Thus all boolean functions can be represented by layers of perceptrons! x1 AND x2 x0=-1 w0 = -0.5

w1=1

NOT x1

Perceptron Training Rule

wi= wi wi w i = t o x i
w i : The weight of input i : The 'learning rate' between 0 and 1 t : The target output o: The actual output x i : The ith input

Gradient Descent

Perceptron training rule may not converge if points are not linearly separable Gradient descent will try to fix this by changing the weights by the total error for all training points, rather than the individual

If the data is not linearly separable, then it will converge to the best fit

Gradient Descent
1 Error function : E x = t d od 2 d D wi E w i= wi w i = t d o d x id
d D 2

wi= wi

Gradient Descent Algorithm

GRADIENT-DESCENT(training_examples, ) Each training example is a pair of the form ( x , t where x is the vector of input values, and t is the target output value, is learning rate (0< <1) Initialize each wi to some small random value Until the termination condition is met, Do ----For each (vec x, t) in training_examples, Do --------Input the instance x to the unit and compute the output o --------For each linear unit weight wi , Do wi= wi t o xi ----For each linear unit wi, Do wi = w i wi

Gradient Descent Issues

Converging to a local minimum can be very slow

The while loop may have to run many times

May converge to a local minima Stochastic Gradient Descent

Update the weights after each training example rather than all at once Takes less memory Can sometimes avoid local minima must decrease with time in order for it to converge

Multi-layer Neural Networks

Single perceptron can only learn linearly separable functions Would like to make networks of perceptrons, but how do we determine the error of the output for an internal node? Solution: Backpropogation Algorithm

Differentiable Threshold Unit

We need a differentiable threshold unit in order to continue Our old threshold function (1 if y > 0, 0 otherwise) is not differentiable One solution is the sigmoid unit

Graph of Sigmoid Function

Sigmoid Function
Output : o= wx

1 y= y 1 e y = y y 1 y

Variable Definitions

xij = the input from to unit j from unit i wij = the weight associated with the input to unit j from unit i oj = the output computed by unit j tj = the target output for unit j outputs = the set of units in the final layer of the network Downstream(j) = the set of units whose immediate inputs include the output of unit j

Backpropagation Rule
1 Ed w = t k ok 2 k outputs
2

Ed w ij = w ij For output units: w ij = t j o j o j 1 o j x ij For internal units: w ij = j x ij = o j 1 o j

k Downstream j

w jk

Backpropagation Algorithm

For simplicity, the following algorithm is for a two-layer neural network, with one output layer and one hidden layer

Thus, Downstream(j) = outputs for any internal node j Note: Any boolean function can be represented by a two-layer neural network!

BACKPROPAGATION(training_examples,

, n in , nout , n hidden )

Create a feed-forward network with n in inputs, n hidden units in the hidden layer, and n out output units Initialize all the network weights to small random numbers (e.g. between -.05 and .05 Until the termination condition is met, Do --- Propogate the input forward through the network : ---Input the instance x to the network and compute the output o u for every ---unit u in the network --- Propogate the errors backward through the network ---For each network output unit k, calculate its error term k k = o k 1 o k t k o k ---For each hidden unit h, calculate its error term h w hk d k h= o h 1 o h
k outputs

---Update each network weight w ij wij = w ij

xij

Momentum

Add the a fraction 0 <= < 1 of the previous update for a weight to the current update May allow the learner to avoid local minimums May speed up convergence to global minimum

When to Stop Learning

Learn until error on the training set is below some threshold

Bad idea! Can result in overfitting

If you match the training examples too well, your performance on the real problems may suffer

Learn trying to get the best result on some validation data

Data from your training set that is not trained on, but instead used to check the function Stop when the performance seems to be decreasing on this, while saving the best network seen so far. There may be local minimums, so watch out!

Representational Capabilities

Boolean functions Every boolean function can be represented exactly by some network with two layers of units

Size may be exponential on the number of inputs

Continuous functions Can be approximated to arbitrary accuracy with two layers of units Arbitrary functions Any function can be approximated to arbitrary accuracy with three layers of units

Example: Face Recognition

From Machine Learning by Tom M. Mitchell Input: 30 by 32 pictures of people with the following properties:

Wearing eyeglasses or not Facial expression: happy, sad, angry, neutral Direction in which they are looking: left, right, up, straight ahead

Output: Determine which category it fits into for one of these properties (we will talk about direction)

Input Encoding

Each pixel is an input

30*32 = 960 inputs

The value of the pixel (0 255) is linearly mapped onto the range of reals between 0 and 1

Output Encoding

Could use a single output node with the classifications assigned to 4 values (e.g. 0.2, 0.4, 0.6, and 0.8) Instead, use 4 output nodes (one for each value)

1-of-N output encoding Provides more degrees of freedom to the network

The sigmoid function can never reach 0 or 1!

Use values of 0.1 and 0.9 instead of 0 and 1

Example: (0.9, 0.1, 0.1, 0.1) = left, (0.1, 0.9, 0.1, 0.1) = right, etc.

Network structure
Inputs

x1
x2 . . .

3 Hidden Units

Outputs

x960

Other Parameters

training rate: = 0.3 momentum: = 0.3 Used full gradient descent (as opposed to stochastic) Weights in the output units were initialized to small random variables, but input weights were initialized to 0

Result: 90% accuracy on test set!

Yields better visualizations

Try it yourself!

Get the code from http://www.cs.cmu.edu/~tom/mlbook.html

Go to the Software and Data page, then follow the Neural network learning to recognize faces link Follow the documentation

You can also copy the code and data from my ACM account (provide you have one too), although you will want a fresh copy of facetrain.c and imagenet.c from the website

/afs/acm.uiuc.edu/user/jcander1/Public/NeuralNetwork

Unit 5
No ratings yet
Unit 5
219 pages
Module 4 Continued
No ratings yet
Module 4 Continued
244 pages
Refined Chapter 5 UceQEJ
No ratings yet
Refined Chapter 5 UceQEJ
79 pages
10 Neural Network
No ratings yet
10 Neural Network
65 pages
Module 3 - Modified
No ratings yet
Module 3 - Modified
106 pages
Lecture 10
No ratings yet
Lecture 10
155 pages
L6 Neural Network
No ratings yet
L6 Neural Network
57 pages
Understanding and Creating Neural Networks
No ratings yet
Understanding and Creating Neural Networks
69 pages
Neural Networks & Deep Learning 2025
No ratings yet
Neural Networks & Deep Learning 2025
73 pages
Chapter 5 Artificial Neural Networks
No ratings yet
Chapter 5 Artificial Neural Networks
50 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
68 pages
ML Unit - 2
No ratings yet
ML Unit - 2
70 pages
Lecture 4
No ratings yet
Lecture 4
50 pages
Unit - 4 ANN
No ratings yet
Unit - 4 ANN
46 pages
NN Introduction MES
No ratings yet
NN Introduction MES
39 pages
855597620
No ratings yet
855597620
44 pages
Lecture 8
No ratings yet
Lecture 8
65 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
75 pages
Neural Network BSC
No ratings yet
Neural Network BSC
32 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
3ML.05.NeuralNetworks DeepLearning
No ratings yet
3ML.05.NeuralNetworks DeepLearning
67 pages
Basics
No ratings yet
Basics
48 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
Module 5 Lecture 2
No ratings yet
Module 5 Lecture 2
45 pages
Artificial Neural Networks: Biological Motivation
No ratings yet
Artificial Neural Networks: Biological Motivation
22 pages
4.2 Ann
No ratings yet
4.2 Ann
26 pages
5 - From Linear Models To Multi-Layer Perceptrons
No ratings yet
5 - From Linear Models To Multi-Layer Perceptrons
45 pages
Pr2 ANN WriteUp
No ratings yet
Pr2 ANN WriteUp
11 pages
Neural
No ratings yet
Neural
32 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
ML Unit-2
No ratings yet
ML Unit-2
141 pages
Machine Learning Unit 5 Notes
No ratings yet
Machine Learning Unit 5 Notes
19 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
19 pages
Neural
No ratings yet
Neural
53 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
Neural Network
No ratings yet
Neural Network
44 pages
5 1 ArtificialNeuralNetworks 4up
No ratings yet
5 1 ArtificialNeuralNetworks 4up
12 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Neural Network
100% (1)
Neural Network
54 pages
Machine Learning: Algorithms and Applications: (Continued)
No ratings yet
Machine Learning: Algorithms and Applications: (Continued)
17 pages
How To Build Your Own Neural Network From Scratch in
No ratings yet
How To Build Your Own Neural Network From Scratch in
6 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
71 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
Fuzzy and Nural Approaches in Engineering Matlab Suppliment
100% (1)
Fuzzy and Nural Approaches in Engineering Matlab Suppliment
218 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
Lesson 3 Artificial Neural Network
No ratings yet
Lesson 3 Artificial Neural Network
77 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
Machine Learning: Chapter 4. Artificial Neural Networks
No ratings yet
Machine Learning: Chapter 4. Artificial Neural Networks
34 pages
Neural Networks Handout
No ratings yet
Neural Networks Handout
7 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
Artificial Neural Network: Lecture Module 22
No ratings yet
Artificial Neural Network: Lecture Module 22
54 pages
AI Trading System Evaluation
No ratings yet
AI Trading System Evaluation
18 pages
Modelling of Chemical Processes Using Artificial Neural Network
No ratings yet
Modelling of Chemical Processes Using Artificial Neural Network
23 pages
Real Time Object Detection Using Deep Learning Andmachine Learning Project
No ratings yet
Real Time Object Detection Using Deep Learning Andmachine Learning Project
56 pages
Matlab Codes
75% (8)
Matlab Codes
92 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
Lecture 10 Neural Network
No ratings yet
Lecture 10 Neural Network
34 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
Naive Bayes
No ratings yet
Naive Bayes
60 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
CP5191 Machine Learning Techniques L T P C3 0 0 3
No ratings yet
CP5191 Machine Learning Techniques L T P C3 0 0 3
7 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
Jntuk R20 ML Unit-V
No ratings yet
Jntuk R20 ML Unit-V
19 pages
Deep Neural Network
No ratings yet
Deep Neural Network
60 pages
Deep Learning Concise Notes
No ratings yet
Deep Learning Concise Notes
4 pages
A Step by Step Backpropagation
No ratings yet
A Step by Step Backpropagation
8 pages
TutorialOnNeuralModelingSystems 2
0% (1)
TutorialOnNeuralModelingSystems 2
10 pages
Object Classification Through Perceptron Model Using Labview
No ratings yet
Object Classification Through Perceptron Model Using Labview
4 pages
Neural Network
No ratings yet
Neural Network
15 pages
IANN - Lab Manual - GEC
No ratings yet
IANN - Lab Manual - GEC
65 pages
Fault Classification For Photovoltaic Modules Using Thermography and Machine Learning Techniques
No ratings yet
Fault Classification For Photovoltaic Modules Using Thermography and Machine Learning Techniques
6 pages
CISC 867 Deep Learning: 12. Recurrent Neural Networks
No ratings yet
CISC 867 Deep Learning: 12. Recurrent Neural Networks
72 pages
NNML
No ratings yet
NNML
113 pages
Article CIREI'2019
No ratings yet
Article CIREI'2019
9 pages
Jacobian Chain Rule Backpropagation
No ratings yet
Jacobian Chain Rule Backpropagation
7 pages
ANN Models
No ratings yet
ANN Models
42 pages
Soft Computing CT QP
No ratings yet
Soft Computing CT QP
2 pages
DNN NeuroSim V2.1 User Manual
No ratings yet
DNN NeuroSim V2.1 User Manual
34 pages
A New Magnetic Compass Calibration Algorithm Using Neural Networks
No ratings yet
A New Magnetic Compass Calibration Algorithm Using Neural Networks
9 pages
An Artificial Neural Network Based Adaptive Power System Stabilizer
No ratings yet
An Artificial Neural Network Based Adaptive Power System Stabilizer
7 pages
Predicción Estallido de Rocas
No ratings yet
Predicción Estallido de Rocas
7 pages
Identification of Normal and Abnormal Ecg Using Neural Network
No ratings yet
Identification of Normal and Abnormal Ecg Using Neural Network
6 pages
Applying Multiple Neural Networks On Large Scale Data: Kritsanatt Boonkiatpong and Sukree Sinthupinyo
No ratings yet
Applying Multiple Neural Networks On Large Scale Data: Kritsanatt Boonkiatpong and Sukree Sinthupinyo
5 pages
04 Vision Wang
No ratings yet
04 Vision Wang
6 pages
Employability Skills: Brush Up Your Computing
From Everand
Employability Skills: Brush Up Your Computing
Clive W. Humphris
No ratings yet
Foundation Course for Advanced Computer Studies
From Everand
Foundation Course for Advanced Computer Studies
Franck Ismael Djédjé
No ratings yet
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

Neural Network Presentation

Uploaded by

Neural Network Presentation

Uploaded by

Artificial Neural Networks

1986 Backpropagation algorithm appears

Handwriting recognition Recognizing spoken words Face recognition

You will get a chance to play with this later!

30 x 32 image as input, 4 hidden units, and 30 outputs

Neural network was used to learn the evaluation function

Modeled on biological systems

This association has become much looser

Learn to classify objects

Learn from given training data of the form (x1...xn, output)

Inputs are flexible

any real values Highly correlated or independent

Target function may be discrete-valued, realvalued, or vectors of discrete or real values

Outputs are real numbers between 0 and 1

Basic unit in a neural network Linear separator Parts

Thres hold 1 if y >0 -1 otherwise

Perceptron Training Rule

Gradient Descent Algorithm

Gradient Descent Issues

Converging to a local minimum can be very slow

The while loop may have to run many times

May converge to a local minima Stochastic Gradient Descent

Multi-layer Neural Networks

Differentiable Threshold Unit

Graph of Sigmoid Function

Ed w ij = w ij For output units: w ij = t j o j o j 1 o j x ij For internal units: w ij = j x ij = o j 1 o j

---Update each network weight w ij wij = w ij

When to Stop Learning

Learn until error on the training set is below some threshold

Bad idea! Can result in overfitting

Learn trying to get the best result on some validation data

Size may be exponential on the number of inputs

Example: Face Recognition

Each pixel is an input

30*32 = 960 inputs

1-of-N output encoding Provides more degrees of freedom to the network

Use values of 0.1 and 0.9 instead of 0 and 1

Result: 90% accuracy on test set!

Yields better visualizations

Get the code from http://www.cs.cmu.edu/~tom/mlbook.html

You might also like