Lec 12 NN

Uploaded by

manalalborai393

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views20 pages

Lec 12 NN

Uploaded by

manalalborai393

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

Lec 12

Artificial Neural Networks:

supervised and unsupervised
Artificial Neural Networks: supervised and
unsupervised
• A neural network is a massively parallel distributed computing system
that has a natural tendency for storing experiential knowledge and
making it available for use. It resembles the brain in two respects:
• Knowledge is acquired by the network through a learning process
(called training) Interneuron connection strengths known as synaptic
weights are used to store the knowledge
• Knowledge in the artificial neural networks is implicit and distributed.
• Advantages
• Excellent for pattern recognition
• Excellent classifiers
• Handles noisy data well
• Good for generalization
Draw backs
• The power of ANNs lie in their parallel architecture
– Unfortunately, most machines we have are serial (Von Neumann
architecture)
• Lack of defined rules to build a neural network for a specific
problem
– Too many variables, for instance, the learning algorithm, number of
neurons per layer, number of layers, data representation etc.
• Knowledge is implicit
• Data dependency

But all these drawbacks doesn’t mean that the neural networks
are useless artifacts. They are still arguably very powerful
general purpose problem solvers.
Learning methodology
o Supervised
Given a set of example input/output pairs, find a rule
that does a good job of predicting the output associated with
a new input.
o Unsupervised
Given a set of examples with no labeling, group them
into sets called clusters
Knowledge is not explicitly represented in ANNs. Knowledge
is primarily encoded in the weights of the neurons within the
network
Design phases of ANNs
• Feature Representation
– The number of features are determined using no. of inputs
for the problem.
• Training
– Training is either supervised or unsupervised.
• Similarity Measurement
– A measure to tell the difference between the actual output of
the network while training and the desired labeled output
• Validation
– During training, training data is divided into k data sets; k-1
sets are used for training, and the remaining data set is used
for cross validation. This ensures better results, and avoids
over-fitting.
Supervised
• Given a set of example input/output pairs, find a rule that does a good
job of predicting the output associated with a new input.

Back propagation algorithm

• 1. Randomize the weights {ws} to small random values (both positive and
negative)
• 2. Select a training instance t, i.e., a. the vector {xk(t)}, i = 1,...,Ninp (a
pair of input and output patterns), from the training set
• 3. Apply the network input vector to network input
• 4. Calculate the network output vector {zk(t)}, k = 1,...,Nout
• 5. Calculate the errors for each of the outputs k , k=1,...,Nout, the
difference between the desired output and the network output
• 6. Calculate the necessary updates for weights -ws in a way that
minimizes this error
• 7. Adjust the weights of the network by - ws
• 8. Repeat steps for each instance (pair of input–output vectors) in the
training set until the error for the entire system
Unsupervised
• Given a set of examples with no labeling,
group them into sets called clusters
• A cluster represents some specific underlying
patterns in the data
• Useful for finding patterns in large data sets
• Form clusters of input data
• Map the clusters into outputs
• Given a new example, find its cluster, and
generate the associated output
Self-organizing neural networks:

clustering, quantization, function approximation, Kohonen maps

1. Each node's weights are initialized

2. A data input from training data (vector) is chosen at random and
presented to the cluster lattice
3. Every cluster centre is examined to calculate which weights are most
like the input vector. The winning node is commonly known as the Best
Matching Unit (BMU)
4. The radius of the neighborhood of the BMU is now calculated. Any
nodes found within this radius are deemed to be inside the BMU's
neighborhood
5. Each neighboring node's (the nodes found in step 4) weights are
adjusted to make them more like the input vector. The closer a node is to
the BMU, the more its weights get altered
6. Repeat steps for N iterations
• Different kinds of learning…
• • Supervised learning:
• – Someone gives us examples and the right answer for
• those examples
• – We have to predict the right answer for unseen examples
• • Unsupervised learning:
• – We see examples but get no feedback
• – We need to find patterns in the data
• • Reinforcement learning:
• – We take actions and get rewards
• – Have to learn how to get high rewards
• Example of supervised learning:
• classification
• • We lend money to people
• • We have to predict whether they will pay us back or not
• • People have various (say, binary) features:
• – do we know their Address? do they have a Criminal record? high
• Income? Educated? Old? Unemployed?
• • We see examples: (Y = paid back, N = not)
• +a, -c, +i, +e, +o, +u: Y
• -a, +c, -i, +e, -o, -u: N
• +a, -c, +i, -e, -o, -u: Y
• -a, -c, +i, +e, -o, -u: Y
• -a, +c, +i, -e, -o, -u: N
• -a, -c, +i, -e, -o, +u: Y
• +a, -c, -i, -e, +o, -u: N
• +a, +c, +i, -e, +o, -u: N
• • Next person is +a, -c, +i, -e, +o, -u. Will we get paid back?
• Classification…
• • We want some hypothesis h that predicts whether we will be
• paid back
• +a, -c, +i, +e, +o, +u: Y
• -a, +c, -i, +e, -o, -u: N
• +a, -c, +i, -e, -o, -u: Y
• -a, -c, +i, +e, -o, -u: Y
• -a, +c, +i, -e, -o, -u: N
• -a, -c, +i, -e, -o, +u: Y
• +a, -c, -i, -e, +o, -u: N
• +a, +c, +i, -e, +o, -u: N
• • Lots of possible hypotheses: will be paid back if…
• – Income is high (wrong on 2 occasions in training data)
• – Income is high and no Criminal record (always right in training data)
• – (Address is known AND ((NOT Old) OR Unemployed)) OR ((NOT
• Address is known) AND (NOT Criminal Record)) (always right in training
• data)
• • Which one seems best? Anything better?
• Occam’s Razor
• • Occam’s razor: simpler hypotheses tend to
• generalize to future data better
• • Intuition: given limited training data,
• – it is likely that there is some complicated hypothesis
• that is not actually good but that happens to perform
• well on the training data
• – it is less likely that there is a simple hypothesis that
• is not actually good but that happens to perform
• well on the training data
• • There are fewer simple hypotheses
• • Computational learning theory studies this in
• much more depth
• Different approach: nearest neighbor(s)
• • Next person is -a, +c, -i, +e, -o, +u. Will we get paid
• back?
• • Nearest neighbor: simply look at most similar example
• in the training data, see what happened there
• +a, -c, +i, +e, +o, +u: Y (distance 4)
• -a, +c, -i, +e, -o, -u: N (distance 1)
• +a, -c, +i, -e, -o, -u: Y (distance 5)
• -a, -c, +i, +e, -o, -u: Y (distance 3)
• -a, +c, +i, -e, -o, -u: N (distance 3)
• -a, -c, +i, -e, -o, +u: Y (distance 3)
• +a, -c, -i, -e, +o, -u: N (distance 5)
• +a, +c, +i, -e, +o, -u: N (distance 5)
• • Nearest neighbor is second, so predict N
• • k nearest neighbors: look at k nearest neighbors, take
• a vote
• – E.g., 5 nearest neighbors have 3 Ys, 2Ns, so predict Y
• Another approach: perceptrons
• • Place a weight on every attribute, indicating how
• important that attribute is (and in which direction it
• affects things)
• • E.g., wa = 1, wc = -5, wi = 4, we = 1, wo = 0, wu = -1
• +a, -c, +i, +e, +o, +u: Y (score 1+4+1+0-1 = 5)
• -a, +c, -i, +e, -o, -u: N (score -5+1=-4)
• +a, -c, +i, -e, -o, -u: Y (score 1+4=5)
• -a, -c, +i, +e, -o, -u: Y (score 4+1=5)
• -a, +c, +i, -e, -o, -u: N (score -5+4=-1)
• -a, -c, +i, -e, -o, +u: Y (score 4-1=3)
• +a, -c, -i, -e, +o, -u: N (score 1+0=1)
• +a, +c, +i, -e, +o, -u: N (score 1-5+4+0=0)
• • Need to set some threshold above which we predict to
• be paid back (say, 2)
• • May care about combinations of things (nonlinearity) –
• generalization: neural networks
• Reinforcement learning
• • There are three routes you can take to work: A,
• B, C
• • The times you took A, it took: 10, 60, 30 minutes
• • The times you took B, it took: 32, 31, 34 minutes
• • The time you took C, it took 50 minutes
• • What should you do next?
• • Exploration vs. exploitation tradeoff
• – Exploration: try to explore underexplored options
• – Exploitation: stick with options that look best now
• • Reinforcement learning usually studied in MDPs
• – Take action, observe reward and new state
• Bayesian approach to learning
• • Assume we have a prior distribution over the long term
• behavior of A
• – With probability .6, A is a “fast route” which:
• • With prob. .25, takes 20 minutes
• • With prob. .5, takes 30 minutes
• • With prob. .25, takes 40 minutes
• – With probability .4, A is a “slow route” which:
• • With prob. .25, takes 30 minutes
• • With prob. .5, takes 40 minutes
• • With prob. .25, takes 50 minutes
• • We travel on A once and see it takes 30 minutes
• • P(A is fast | observation) = P(observation | A is
• fast)*P(A is fast) / P(observation) = .5*.6/(.5*.6+.25*.4)
• = .3/(.3+.1) = .75
• • Convenient approach for decision theory, game theory
• Learning in game theory
• • Like 2/3 of average game
• • Very tricky because other agents learn at the
• same time
• • From one agent’s perspective, the
• environment is changing
• – Taking the average of past observations may not
• be good idea

Iterative Methods For Solving Linear Systems-Greenbaum
100% (1)
Iterative Methods For Solving Linear Systems-Greenbaum
225 pages
Applications of Thermodynamic Models
No ratings yet
Applications of Thermodynamic Models
4 pages
018003assessment of Structural Analysis Technology For Static Collapse of Elastic Cylindrical Shells
No ratings yet
018003assessment of Structural Analysis Technology For Static Collapse of Elastic Cylindrical Shells
29 pages
Econometrics Computer Exercise Week 1: Introduction Stata + Simple Regression Model
No ratings yet
Econometrics Computer Exercise Week 1: Introduction Stata + Simple Regression Model
4 pages
Lab 2
No ratings yet
Lab 2
10 pages
Lecture#12 DM MS (DEIM) Spring 2025
No ratings yet
Lecture#12 DM MS (DEIM) Spring 2025
21 pages
Cps270 Machine Learning
No ratings yet
Cps270 Machine Learning
14 pages
FAI Unit 6
No ratings yet
FAI Unit 6
36 pages
Lec 23 Learning Rules
No ratings yet
Lec 23 Learning Rules
60 pages
Numerical Methods
100% (1)
Numerical Methods
649 pages
Donalek Classif
No ratings yet
Donalek Classif
69 pages
Computer Network: 02 December 2024 22:38
No ratings yet
Computer Network: 02 December 2024 22:38
5 pages
The Pauli Algebra Approach To Special Re
No ratings yet
The Pauli Algebra Approach To Special Re
3 pages
Lecture 1
No ratings yet
Lecture 1
62 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
31 pages
Week 09 Lesson 1 Intro Machine Learning 1 To 32
No ratings yet
Week 09 Lesson 1 Intro Machine Learning 1 To 32
61 pages
PGP Aiml2024
No ratings yet
PGP Aiml2024
22 pages
LKSK ML typesToStudents
No ratings yet
LKSK ML typesToStudents
18 pages
Decision Theory
No ratings yet
Decision Theory
6 pages
Unit-1 ML
No ratings yet
Unit-1 ML
39 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
6 pages
Classification BP Regression KNN Other Classifiers - Final
No ratings yet
Classification BP Regression KNN Other Classifiers - Final
116 pages
L06 Slides - mlp3
No ratings yet
L06 Slides - mlp3
26 pages
4 DL
No ratings yet
4 DL
81 pages
Softcomputing NN
No ratings yet
Softcomputing NN
84 pages
Neural NetworksChapter2Sup
No ratings yet
Neural NetworksChapter2Sup
20 pages
Ann 2 A
No ratings yet
Ann 2 A
20 pages
Unit-1 - Machine Learning
No ratings yet
Unit-1 - Machine Learning
85 pages
ML RUSA Module 1 Intro
No ratings yet
ML RUSA Module 1 Intro
30 pages
Image Processing 7
No ratings yet
Image Processing 7
193 pages
Speech English
No ratings yet
Speech English
5 pages
Graph Theory Basic Concepts
No ratings yet
Graph Theory Basic Concepts
56 pages
ML Unit I
No ratings yet
ML Unit I
14 pages
FAQs On Contact Output Variables CNORMF
100% (1)
FAQs On Contact Output Variables CNORMF
3 pages
Module 1
No ratings yet
Module 1
50 pages
2 Marks
No ratings yet
2 Marks
5 pages
(Facchinei, Pang) Finite - Dimens I
No ratings yet
(Facchinei, Pang) Finite - Dimens I
728 pages
Ch3-Machine Learning
No ratings yet
Ch3-Machine Learning
124 pages
2 Units 8,12
No ratings yet
2 Units 8,12
9 pages
Graph
No ratings yet
Graph
3 pages
QAMD Assignment
No ratings yet
QAMD Assignment
4 pages
Final Project
No ratings yet
Final Project
15 pages
Week-12 - Introduction To ML-NN-CNN
No ratings yet
Week-12 - Introduction To ML-NN-CNN
45 pages
Neural Networks: Machine Learning Is Machine Learning Is
No ratings yet
Neural Networks: Machine Learning Is Machine Learning Is
23 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
48 pages
Business Statistics: Assignment 4
No ratings yet
Business Statistics: Assignment 4
3 pages
Chapter 5 - 7
No ratings yet
Chapter 5 - 7
72 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
Aula 3 T
No ratings yet
Aula 3 T
12 pages
Unit Iv DM
No ratings yet
Unit Iv DM
58 pages
13 - Chapter 4 PDF
No ratings yet
13 - Chapter 4 PDF
46 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
Introduction To ML - MCA - 2023
No ratings yet
Introduction To ML - MCA - 2023
30 pages
NNFL Lecture 5 21 July 2021
No ratings yet
NNFL Lecture 5 21 July 2021
66 pages
My Hands-On ML Notebook
No ratings yet
My Hands-On ML Notebook
5 pages
AI Chapter 5
No ratings yet
AI Chapter 5
31 pages
ISSN: 2320-7493 (Online) 2320-8449 (Print)
No ratings yet
ISSN: 2320-7493 (Online) 2320-8449 (Print)
2 pages
01 - IT623 Algorithms & Data Structures-Aymptotic Notation
No ratings yet
01 - IT623 Algorithms & Data Structures-Aymptotic Notation
39 pages
MachineLearning Lecture 2
No ratings yet
MachineLearning Lecture 2
23 pages
Chapter 6 MP
No ratings yet
Chapter 6 MP
30 pages
Session 7
No ratings yet
Session 7
22 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
Ai Lect6 Genetic
No ratings yet
Ai Lect6 Genetic
94 pages
Injecting Knowledge Into The Solution of The Two-Spiral Problem
No ratings yet
Injecting Knowledge Into The Solution of The Two-Spiral Problem
8 pages
Artificial Neural Networks: Dr. Md. Aminul Haque Akhand Dept. of CSE, KUET
100% (1)
Artificial Neural Networks: Dr. Md. Aminul Haque Akhand Dept. of CSE, KUET
82 pages
Cyclegan, A Master of Steganography
No ratings yet
Cyclegan, A Master of Steganography
6 pages
Cubic Spline
No ratings yet
Cubic Spline
11 pages
ML Notes
No ratings yet
ML Notes
15 pages
Portfolio Optimization Beyond Markowitz
No ratings yet
Portfolio Optimization Beyond Markowitz
134 pages
Artificial Neural Networks An Artificial Neuron: X W X W S X W W y
No ratings yet
Artificial Neural Networks An Artificial Neuron: X W X W S X W W y
7 pages
NNML
No ratings yet
NNML
113 pages
Analysis and Design of Linear Control Systems, 07th
No ratings yet
Analysis and Design of Linear Control Systems, 07th
5 pages
Artificial Intelligence Chapter 18 (Updated)
No ratings yet
Artificial Intelligence Chapter 18 (Updated)
19 pages
Machine Learning and Neural Networks: Riccardo Rizzo
100% (1)
Machine Learning and Neural Networks: Riccardo Rizzo
113 pages
Ampl Cplex Tutorial
No ratings yet
Ampl Cplex Tutorial
22 pages
Lect3 UWA PDF
No ratings yet
Lect3 UWA PDF
73 pages
COMP4804 Assignment 3: Due Tuesday March 16th, 23:59EDT
No ratings yet
COMP4804 Assignment 3: Due Tuesday March 16th, 23:59EDT
6 pages
Machine Learning-Gkouzionis
No ratings yet
Machine Learning-Gkouzionis
14 pages
ECE/CS 559 - Neural Networks Lecture Notes #6: Learning: Erdem Koyuncu
No ratings yet
ECE/CS 559 - Neural Networks Lecture Notes #6: Learning: Erdem Koyuncu
13 pages
Fundamentals of Artificial Neural Networks
No ratings yet
Fundamentals of Artificial Neural Networks
27 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
37 pages
TO Artificial Neural Networks
No ratings yet
TO Artificial Neural Networks
22 pages
Deep Learning Lecture 0 Introduction Alexander Tkachenko
No ratings yet
Deep Learning Lecture 0 Introduction Alexander Tkachenko
31 pages
Data Mining Techniques: Presentation On Neural Network
No ratings yet
Data Mining Techniques: Presentation On Neural Network
55 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
32 pages
Soft Computing
No ratings yet
Soft Computing
11 pages
Genetic Algorithms Versus Traditional Methods
No ratings yet
Genetic Algorithms Versus Traditional Methods
7 pages

Lec 12 NN

Uploaded by

Lec 12 NN

Uploaded by

Lec 12

Artificial Neural Networks:

Back propagation algorithm

clustering, quantization, function approximation, Kohonen maps

1. Each node's weights are initialized

You might also like