0% found this document useful (0 votes)

77 views

Neural Net HO

ok, still pretty basic, but finally added something on linear algebra linear regression and kernels, and gaussian process regression. hope it's somewhat coherent or at least usable / correct -- formula-wise. i would certainly appreciate being notified if any errors are discovered

Uploaded by

phli

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

77 views

Neural Net HO

Uploaded by

phli

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Neural Nets

P
i fi wi > threshold → Output = 1, else Output = 0

1 Simple Logical Connectives

and: or: not:

{1,0}1 {1,0}1 {1,0}1 {1,0}1 {1,0}*-1

+ +

>1.5? >.5? >-.5

or:
and: not:
{1,0}*1 1*-.5 {1,0}*1
{1,0}*1 1*-1.5 {1,0}*1 {1,0}*-1 1*.5
+ +
+ + +
>0?
>0? >0?

2 Training

∆w = (C) ∗ Error ∗ f
Error = CorrectAnswer − Output

2.1 or, C=1, threshold=.5

ex. f1 f2 CA w1 w2 f1 w1 + f2 w2 Output Error ∆w1 ∆w2

1 0 0 0 0 0 0 0 0 0 0
2 1 0 1 0 0 0 0 1 1 0
3 0 1 1 1 0 0 0 1 0 1
4 1 1 1 1 1 1 1 0 0 0

1
2.2 or, C=.5, threshold=0
P
ex. f1 f2 f3 CA w1 w2 w3 i fi wi Output Error ∆w1 ∆w2 ∆w3
1 0 0 1 0 0 0 0 0 0 0 0 0 0
2 1 0 1 1 0 0 0 0 0 1 .5 0 .5
3 0 1 1 1 .5 0 .5 .5 1 0 0 0 0
4 1 1 1 1 .5 0 .5 1 1 0 0 0 0
1 0 0 1 0 .5 0 .5 .5 1 -1 0 0 -.5
2 1 0 1 1 .5 0 0 .5 1 0 0 0 0
3 0 1 1 1 .5 0 0 0 0 1 0 .5 .5
4 1 1 1 1 .5 .5 .5 1.5 1 0 0 0 0
1 0 0 1 0 .5 .5 .5 .5 1 -1 0 0 -.5
2 1 0 1 1 .5 .5 0 .5 1 0 0 0 0
3 0 1 1 1 .5 .5 0 .5 1 0 0 0 0
4 1 1 1 1 .5 .5 0 1 1 0 0 0 0
1 0 0 1 0 .5 .5 0 0 0 0 0 0 0
2 1 0 1 1 .5 .5 0 .5 1 0 0 0 0
3 0 1 1 1 .5 .5 0 .5 1 0 0 0 0
4 1 1 1 1 .5 .5 0 1 1 0 0 0 0

3 linear regression

In fact, neural nets are just basic linear algebra:

 
xa
 xb 
~xT · w
~ =  xc  · [wa wb wc ...] = xa wa + xb wb + xc wc + ...


...
and linear algebra makes linear regression incredibly easy!
assuming a least squares cost function,
with tn the true value, given ~xn ,
and ~x · w
~ is our neural model’s predicted value.

1X
(tn − (~xTn · w))
~ 2 (1)
2 n
the minimum (i.e. derivative=0) is given where
~ = (XT · X)−1 · XT · ~t
w (2)

where X is the vector of all the input vectors, and ~t the vector of their true outcomes.
(gotta take that inverse so XT · X needs to be nonsingular)
try it out in matlab (if wealthy or connected)
or Octave or R (for the rest of us, both quite remarkable tools for free)

2
4 kernels (gpr, in particular)

even more magical mathematics (i.e. not got into here)

also derive the following elegant alternative method of prediction:

~ T · ~k
f (~x) = w (3)
−1~
~ = (K + λIn )
w t (4)

In is the identity matrix, and λ is whatever fudge room is needed (can help with inversion)
K is an n×n matrix where Kij = k(xi , xj )
~ki = k(~x, ~xi ), for all (n) observed xi

and what’s k(xi , xj )? whatever function seems most useful!

“one of the most common Gaussian Process Regression kernels”:

θ2
k(~xi , ~xj ) = θ1 exp(− ||~xi − ~xj )||2 ) + θ3 (~xTi · ~xj ) + θ4 (5)
2

the 3 terms are:

• the Gaussian kernel (note θ2 is basically the inverse of σ 2 )
• a linear term (moderated by θ3 )
• a constant offset (θ4 )
play with θ weights or optimize against error function.

3
5 classification

• For a binary classification, t is either 0 or 1.

• For multiple classifications, t becomes ~t, and ~t becomes T
(scalar becomes a vector, vector becomes a matrix): tn =[0 0 0 1], or whatever
• and that also means w
~ become W.
• (“logistic/sigmoidal”) S-functions ((1 + exp(−x))−1 ) are good for classification because any
high enough input generates a 1, and any low enough input generates a 0.

6 multilayers and back propagation?

• don’t do it unless you have to.

• multilayers are said to be necessary for functions like exclusive ‘or’.
• the activation function must be differential (e.g. sigmoidal).
• it’s basically just gradient descent.
• it can get stuck in local minima.
• start with random weights, adjust until error is tolerable.
(do multiple times to check for better minima)
(as presented by R. Rojas: Neural Networks, Springer-Verlag, Berlin, 1996
http://page.mi.fu-berlin.de/rojas/neural/chapter/K7.pdf)
for sigmoidal activation:

∆wij = −γoi dj (6)

dj = oj (1 − oj )(oj − tj ) (7)
X
dj = oj (1 − oj ) wjq dq (8)
q

δE δ 1 (oj − tj )2
= oi dj , 2 = oj − tj (9)
δwij δwij

1 δs(x)
s(x) = , = s(x)(1 − s(x)) , s(x) = oj (10)
1 + exp(−Cx) δx

dj is “the backpropagated error”

(or perhaps rather the multiplication of its derivatives (basically the chain rule), as shown by eq. 9-10);
E is the error.
or at the hidden layers, summing over the weights and “errors” from the observed side (eq. 8).

γ is “a learning constant”; oi feeds wij ; tj is the desired output; C is whatever constant as well.

Math 323: Solutions To Homework 9
No ratings yet
Math 323: Solutions To Homework 9
8 pages
Problem 2.125
100% (2)
Problem 2.125
5 pages
Neural Network Presentation
No ratings yet
Neural Network Presentation
33 pages
DL
No ratings yet
DL
73 pages
Neural Networks Handout
No ratings yet
Neural Networks Handout
7 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
31 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
Neural Network
No ratings yet
Neural Network
6 pages
cs188 sp23 Note25
No ratings yet
cs188 sp23 Note25
8 pages
2. Neural Network Training
No ratings yet
2. Neural Network Training
73 pages
Neural Net Basics
No ratings yet
Neural Net Basics
2 pages
Main
No ratings yet
Main
25 pages
Understanding and Creating Neural Networks
No ratings yet
Understanding and Creating Neural Networks
69 pages
Anthony Kuh - Neural Networks and Learning Theory
No ratings yet
Anthony Kuh - Neural Networks and Learning Theory
72 pages
Deep Learning 1
No ratings yet
Deep Learning 1
48 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
lect8_dnn (1)
No ratings yet
lect8_dnn (1)
33 pages
ML Week 4 To 10 PDF
No ratings yet
ML Week 4 To 10 PDF
146 pages
Multi Layer Perceptron 1
No ratings yet
Multi Layer Perceptron 1
54 pages
Notes Chapter8
No ratings yet
Notes Chapter8
4 pages
Single Layer Feedforward Networks
No ratings yet
Single Layer Feedforward Networks
21 pages
Mid Summary
No ratings yet
Mid Summary
13 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
Neural Networks - Slides - CMU - Aarti Singh & Barnabas Poczos
No ratings yet
Neural Networks - Slides - CMU - Aarti Singh & Barnabas Poczos
36 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Module 3_Modified
No ratings yet
Module 3_Modified
106 pages
Pr2_ANN_WriteUp.docx
No ratings yet
Pr2_ANN_WriteUp.docx
11 pages
FML Unit5
No ratings yet
FML Unit5
21 pages
Artificial Neural Networks An Artificial Neuron: X W X W S X W W y
No ratings yet
Artificial Neural Networks An Artificial Neuron: X W X W S X W W y
7 pages
Neural Networks
No ratings yet
Neural Networks
12 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Chapter3
No ratings yet
Chapter3
30 pages
Backpropogation Learning
No ratings yet
Backpropogation Learning
9 pages
ANN-Implemetation of Back-Prop
No ratings yet
ANN-Implemetation of Back-Prop
89 pages
Neural
No ratings yet
Neural
53 pages
Chap11 Neural Nets
No ratings yet
Chap11 Neural Nets
38 pages
lec05
No ratings yet
lec05
46 pages
Feedforward Networks: Marco Kuhlmann
No ratings yet
Feedforward Networks: Marco Kuhlmann
53 pages
A Step by Step Forward Pass and Backpropagation Example
No ratings yet
A Step by Step Forward Pass and Backpropagation Example
14 pages
Theory of ANN
No ratings yet
Theory of ANN
21 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
Lecture 4
No ratings yet
Lecture 4
50 pages
ML Unit-2
No ratings yet
ML Unit-2
141 pages
Neural Networks - 2
No ratings yet
Neural Networks - 2
79 pages
Learning Rules For Multilayer Feedforward Neural Networks
No ratings yet
Learning Rules For Multilayer Feedforward Neural Networks
19 pages
3 ArtificialNeuralNetworks PDF
No ratings yet
3 ArtificialNeuralNetworks PDF
77 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
Ece18898g Neural Networks
No ratings yet
Ece18898g Neural Networks
47 pages
Introduction To Neural Networks: Revision Lectures: © John A. Bullinaria, 2004
No ratings yet
Introduction To Neural Networks: Revision Lectures: © John A. Bullinaria, 2004
24 pages
DL_ANN_RNN_CNN [Autosaved] [Autosaved]
No ratings yet
DL_ANN_RNN_CNN [Autosaved] [Autosaved]
53 pages
Unit Iv DM
No ratings yet
Unit Iv DM
58 pages
Sparseautoencoder 2011new
No ratings yet
Sparseautoencoder 2011new
19 pages
08 NN
No ratings yet
08 NN
43 pages
Activation Function To Back Pro
No ratings yet
Activation Function To Back Pro
22 pages
Shortcomings in Single Layer Neural Networks: Most Real World Problems Are Not
No ratings yet
Shortcomings in Single Layer Neural Networks: Most Real World Problems Are Not
43 pages
Neural - Networks
No ratings yet
Neural - Networks
47 pages
Institute For Advanced Management Systems Research Department of Information Technologies Abo Akademi University
No ratings yet
Institute For Advanced Management Systems Research Department of Information Technologies Abo Akademi University
41 pages
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
100% (1)
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
40 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
Math Workbook - Grade 2
From Everand
Math Workbook - Grade 2
Ruth Herlihy
No ratings yet
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
From Everand
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
Math Reproducibles - Grade 4
From Everand
Math Reproducibles - Grade 4
Linda Cernak
No ratings yet
Yang (2000) : "Internal and External Forces in Language Change"
No ratings yet
Yang (2000) : "Internal and External Forces in Language Change"
8 pages
Evolution, Efficiency, & Entropy: CHILDES ( 08)
No ratings yet
Evolution, Efficiency, & Entropy: CHILDES ( 08)
7 pages
Argument, Range, or Feature
No ratings yet
Argument, Range, or Feature
6 pages
VFS Alternations: 1 Summary of P. Jacobson
No ratings yet
VFS Alternations: 1 Summary of P. Jacobson
4 pages
Clause Selection
No ratings yet
Clause Selection
69 pages
DCH 1
No ratings yet
DCH 1
220 pages
Typology: SOV 558 SVO 322 VSO 133 VOS 24 OSV 12 OVS 10
No ratings yet
Typology: SOV 558 SVO 322 VSO 133 VOS 24 OSV 12 OVS 10
4 pages
A Kick-Ass Application: Phillip Potamites May 31, 2007
No ratings yet
A Kick-Ass Application: Phillip Potamites May 31, 2007
4 pages
Maximum Entropy
No ratings yet
Maximum Entropy
4 pages
Decision Trees: USC Linguistics July 26, 2007
No ratings yet
Decision Trees: USC Linguistics July 26, 2007
6 pages
Distributions: C (T) N C (T)
No ratings yet
Distributions: C (T) N C (T)
4 pages
p (α, β) = p (α - β) p (β) = p (α) p (β - α) (7) p (α) p (β - α) p (β) (8) : Basic Bayes
No ratings yet
p (α, β) = p (α - β) p (β) = p (α) p (β - α) (7) p (α) p (β - α) p (β) (8) : Basic Bayes
5 pages
Discipline
No ratings yet
Discipline
1 page
Filipp Zoltán-Basic-Mathematics-Precalculus-2019 PDF
No ratings yet
Filipp Zoltán-Basic-Mathematics-Precalculus-2019 PDF
99 pages
Introduction To Matrices: School of Civil Engineering
No ratings yet
Introduction To Matrices: School of Civil Engineering
6 pages
Factoring Cases
No ratings yet
Factoring Cases
6 pages
Experiment No 5
No ratings yet
Experiment No 5
10 pages
Sat 2004 Maths Questions
No ratings yet
Sat 2004 Maths Questions
3 pages
Dual Spaces PDF
No ratings yet
Dual Spaces PDF
11 pages
Maths Class 10 Notes & Study Material
60% (5)
Maths Class 10 Notes & Study Material
102 pages
Enlighten Your Polynomial Confusion: Different Strategies in Factoring Polynomials
No ratings yet
Enlighten Your Polynomial Confusion: Different Strategies in Factoring Polynomials
1 page
Hilbert's Arithmetic of Ends
No ratings yet
Hilbert's Arithmetic of Ends
3 pages
Senior Kangaroo: Instructions
No ratings yet
Senior Kangaroo: Instructions
15 pages
A Guide To The Generation of Lyapunov Functions
No ratings yet
A Guide To The Generation of Lyapunov Functions
12 pages
Discrete Mathematics For Ece: Boolean Algebra and Logic Simplification
No ratings yet
Discrete Mathematics For Ece: Boolean Algebra and Logic Simplification
41 pages
PDF Matrix Analysis and Applications 1st Edition Xian-Da Zhang download
No ratings yet
PDF Matrix Analysis and Applications 1st Edition Xian-Da Zhang download
65 pages
G7 Quarter 1 Module 7 Estimating Square Root
100% (1)
G7 Quarter 1 Module 7 Estimating Square Root
18 pages
NS-111 Calculus and Analytic Geometry OBE Course Card 2018 Eng Disciplines
No ratings yet
NS-111 Calculus and Analytic Geometry OBE Course Card 2018 Eng Disciplines
5 pages
R22 B.tech (CSE) Course Structure and Contents
No ratings yet
R22 B.tech (CSE) Course Structure and Contents
182 pages
TNPSC: Maths
No ratings yet
TNPSC: Maths
393 pages
Math Curriculum
No ratings yet
Math Curriculum
2 pages
Wavelet Lectures
No ratings yet
Wavelet Lectures
46 pages
Mark Scheme (Results) January 2014: Pearson Edexcel International GCSE Further Pure Mathematics (4PM0/02)
No ratings yet
Mark Scheme (Results) January 2014: Pearson Edexcel International GCSE Further Pure Mathematics (4PM0/02)
23 pages
Algebra 1 Textbook Aligment - Big Ideas (1)
No ratings yet
Algebra 1 Textbook Aligment - Big Ideas (1)
23 pages
2019 regional stu
No ratings yet
2019 regional stu
3 pages
Simplification Notes by Utkarsh Sir 83afd1b4
No ratings yet
Simplification Notes by Utkarsh Sir 83afd1b4
10 pages
Chapter 8 Markov Chain Model
No ratings yet
Chapter 8 Markov Chain Model
3 pages
Ecor 2606
No ratings yet
Ecor 2606
3 pages
Instant download Convex Optimization for Signal Processing and Communications: From Fundamentals to Applications 1st Edition Chong-Yung Chi pdf all chapter
100% (1)
Instant download Convex Optimization for Signal Processing and Communications: From Fundamentals to Applications 1st Edition Chong-Yung Chi pdf all chapter
65 pages
Least Squares Ellipsoid
No ratings yet
Least Squares Ellipsoid
17 pages