0% found this document useful (0 votes)

80 views26 pages

The Radial Basis Function Network: March 5, 2006

The document summarizes the Radial Basis Function Network model. It has three layers of nodes: an input layer, a hidden layer, and an output layer. The hidden layer activation is determined by the distance between the input vector and prototype vectors. There are two stages of training - first the hidden unit parameters are determined using unsupervised learning on input data alone, then the output weights are trained to map hidden units to targets. RBF networks transform inputs into a higher dimensional space in the hidden layer for linear separability.

Uploaded by

iqbal_1987

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views26 pages

The Radial Basis Function Network: March 5, 2006

Uploaded by

iqbal_1987

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

The Radial Basis Function Network

March 5, 2006
Radial Basis Functions (RBFs)

I Three layers of nodes in the network.

I It firs a curve to the data using a statistical measure for the
quality of fit. Unlike BP which uses stochastic approximation.
I Hidden unit activation is determined by the distance between
the input vector and a prototype. Distance from a prototype
is a form of clustering.
Training

I There are two stages to training:

I 1.) Parameters of the bias functions (hidden units) are
determined using unsupervised learning. The first layer of
weights are trained using input data only.
I 2.) Calculate the second weight layer values. Create a linear
mapping from hidden layer activations to the target output
patterns.
I Both steps are relatively fast when compared to BP.
High Dimensionality Mapping

I The rationale for the two processing steps, the first is

non-linear and the second linear, is attributed to Cover
(1965). This work states that a pattern classification problem
which is cast into a high dimensional space is more likely to
be linearly separable.
I The hidden units in the RBF represent the dimensionality into
which the input pattern is transformed. This is why RBF
networks normally have “large” numbers of hidden units.
Input patterns are transformed into a high dimensional space
in the hidden layer and they are then linearly separated to
classify the results.
I Also related to the dimensionality of the hidden unit space is
the ability of the network to approximate a smooth
input-output mapping. Higher dimensionl implies a more
accurate and smoother mapping.
Exact Interpolation

I Exact Interpolation is a technique for interpolating a set of

data points such that every input point is mapped onto a
target. Every input point must appear as part of the system
used to model the data without any averaging or smoothing.
I This is not what is desired in Radial Basis Function networks.
The RBF represented by the solid line hasn N data points and N
basis functions (one per dot).
Exact Interpolation Activation

I The input vectors are labeled x n .

I The activation function is:
X
h(x) = wn θ(||x − x n ||)
n
P
where n wn is a linear combination of basis functions, θ are
the basis functions which are described using ||x − x n || the
distance between input vector x and training pattern x n .
I Where θ(x) is often a Gaussian:

x2
θ(x) = exp(− )
2σ 2
and σ is a smoothing parameter.
I Other functions for θ(x) can be used.
I For Exact Interpolation, only one value of σ is used.
I More than one value for σ can be used in an RBF. The
quantity of smoothing may be consistent or vary over the
curve. Variable smoothing values are useful for data which has
more noise or variation within the samples and require more
smoothing.
Problems with Exact Interpolation

I Interpolating every point is not desirable. Noisy data will

often produce oscillations in the functions leading them to be
unsmooth.
I A better generalization can be produced and a smoother curve
can be produced if the noise is averaged.
I We would also like to reduce the number of basis functions to
less than the number of input patterns.
Radial Basis Function Networks

I Changes to the Exact Interpolation algorithm which create an

RBF.
I 1.) The number of basis functions M is much less than the
number of input patterns N.
I 2.) The centres of the basis functions are not constrained to
those given by the input vectors. Determining centres
becomes part of the training process.
I 3.) Instead of using a common with parameter σ, each basis
function has its own with σj whose value is determined during
training.
I 4.) A Bias parameter is included in the linear sum. This
compensates for differences between the average of the basis
function activations and the average of the targets.
RBF Activation

I The equations for RBF activation is:

M
X
yk (x) = wkj θj (x) + wko
j=1

where:
||x − uj ||2
θj (x) = exp(− )
2σj2
and x is the d dimensional input vector with elements xi , and uj is
the vector determining the centre of the basis function θj .
Network Training

I There are two stages to training:

1. determining the parameters of the basis functions through
unsupervised training using only the input data set
2. once basis functions have been determined and their
parameters are set then the second layer weights wkj are
determined using both input and output data (hidden units are
activated using an input pattern and the weights to the output
layer are then modified to produce the desired output for the
given input)
Output Error Calculation

I Training step 2 is the easier of the two steps as it involves

solving a set of linear equations.
I The error at the output is normally calculated using a
sum-of-squares function:
1 XX
E= yk (x n ) − tkn 2
2 n
k

where n is the number of input patterns, k is the sum of the

values for each output node k, yk (x n ) is the achieved output
for node k given input x n and tkn is the desired output for
node k given input n. This calculates the difference between
the desired and achieved outputs for all output patterns and
for all output nodes.
Output Layer Notes

I The output weight layer is normally solved using singular

value decomposition.
I Other non-linear activation functions applied to the outputs
and other choices for error functions are possible are possible
but generally not used. This would lead to determining the
second weight layer as a non-linear problem and would be a
much more difficult non-linear optimization.
Basis Function Optimization (First Layer Training)

I The operations performed can be described in terms of several

different techniques including:
I regularization theory
I noisy interpolation theory
I kernel regression
I function approximation
I estimation of posterior class probabilities
I All of these methods suggest that basis functions parameters
should be chosen to represent the probability density of the
input data.
I The training procedure which results is an unsupervised
optimization of the basis function parameters. The basis
function centres uj can be regarded as prototypes for the
input vectors.
I Large amounts of labeled data can be difficult to obtain. The
RBF can use large amounts of unlabeled data to train the first
layer and a relatively small amount of labeled data to train the
second layer.
Problems with Full Problems Spaces

I A problem occurs is the basis functions are required to fill the

problem space. In this case, the number of basis functions
increases exponentially with the dimension of the problem.
This situation requires long training times and a large number
of training patterns.
I The RBF performs better if it isn’t required to represent the
entire problem space and only needs to work in a small subset
of it.
I The cost of this problem is particularly high when there are
input variables which have a high variance but which have
little effect in determining the output. Input variables with
this property are not uncommon.
I When basis functions are selected using only input data there
is no way to identify if the patterns are relevant.
Density Function Drawbacks

I The optimal choice for basis function parameters using density

estimation may not be optimal for creating the mapping to
the output values. This occurs when the density estimation
doesn’t represent the real density and centres.
Determining Basis Function Centres (uj )

Random Selection
I Select a random subset of input vectors from the training set.
I Doesn’t attempt to provide an optimal density estimation.
Can require an overly large number of basis functions to
achieve the desired performance.
I Often used to provide a set of starting values which can be
iteratively adapted to produce a better solution.
All Data
I Use all data points as basis function centres and selectively
remove centres which have the least disruption on
performance.
I Both of these methods provide only the centres uj but not the
width parameter σJ . This is often set equal to a multiple of
the average distance between the centres. This causes the
functions to overlap and therefore provides a relatively smooth
representation of the distribution of the training data. As all
widths are equal, this may not be the best solution.
I These methods do no supply optimal parameters but they are
very fast.
Determining Basis Function Centres (uj )

Orthogonal Least Squares

I Involves starting with one basis function and adding more
which are selected to create the greatest reduction of error.
I Basis functions centres are taken from data points. Centres
are selected by constructing a set of orthogonal vectors in the
space spanned by the hidden unit activations for each training
pattern.
I This allows the choice of the next centre which will produces
the greatest reduction in error. Orthogonal vectors are those
which are most “different” from those already chosen which
should reduce error by the largest amount.
I If the algorithm is allowed to run to completion then it will
select all data points. Good generalization requires that it
stop before this occurs.
Determining Basis Function Centres (uj )
Clustering Algorithms
I Instead of selecting a subset of data points for the centres we
can find a set of clusters which better represent the
distribution.
I Data points for centres may not be the best choice. Cluster
centres are not required to be data points.
I K-means clustering provides K centres where K must be
decided in advance. The algorithm divides the data into K
subsets so that the distance between cluster centres and
points is minimized.
I Selecting a value for K may not be obvious.
I An alternative to K-means clustering is to use a neural
network which performs a similar function. The Kohonen
Self-Organizing Feature map generates a set of prototypes
vectors which represent input vector-space values. These
prototypes can be used to create basis function centres.
Determining Basis Function Centres (uj )

Gaussian Mixture Models

I The problem is one of density estimation where the basis
functions are the components of a mixture of density model
whose parameters are optimized by maximum likelihood.
I A purely statistical technique using no heuristics.
I Density is modeled using:
M
X
p(x) = P(j)θj (x)
j=1

where P(j) is the prior probabilities for data points to have

been generated by the j th component and θj (x) are the basis
functions.
I The likelihood function is maximized with respect to P(j) and
the parameters of θj (x). This is done by computing the
derivatives of the likelihood function with respect to the
parameters and then optimizing them.
I Parameters can also be found using re-estimation methods
based on expected maximization procedures.
Comparing RBFs and Back-Propagation (Multi-Layer
Perceptron)

I Both approximate arbitrary non-linear functional mappings of

multidimensional spaces.
I Mappings are created through combinations of multiple
functions.
I Hidden units in BP are the sum of inputs transformed through
a threshold function. Hidden unit activation in an RBF uses a
distance to a prototype followed by a local transformation.
I BP uses a distributed representation where many hidden units
contribute to a solution for a given input. The training
process is highly non-linear and there are problems with local
solutions. This can lead to slow convergence. RBFs use
localized basis functions and typically only a few hidden units
have significant activations.
I BP networks can have multiple layers of weights which can
vary in many ways. RBFs have a simple three layer structure
which does not change.
I BP generally uses one training method where RBFs can
determine the basis functions through many methods.
I All parameters in BP are determined simultaneously using a
single global supervised training strategy. RBFs are training in
two stages, the first of which is unsupervised and the second
is a linear supervised method.

Introduction To Radial Basis Function Networks
No ratings yet
Introduction To Radial Basis Function Networks
45 pages
RBF and Unsupervised Learning
No ratings yet
RBF and Unsupervised Learning
34 pages
21csc305p Machine Learning Unit 5
No ratings yet
21csc305p Machine Learning Unit 5
61 pages
10-Lecture Neural Network
No ratings yet
10-Lecture Neural Network
30 pages
Unit 4 - The Learning Mechanisms - New
No ratings yet
Unit 4 - The Learning Mechanisms - New
79 pages
Radial Basis Function Neural Network RBFNN
No ratings yet
Radial Basis Function Neural Network RBFNN
14 pages
Radial Basis Function
No ratings yet
Radial Basis Function
46 pages
Introduction To Radial Basis Function Networks
No ratings yet
Introduction To Radial Basis Function Networks
45 pages
Artificial Neural Networks and Deep Learning
No ratings yet
Artificial Neural Networks and Deep Learning
39 pages
Radial Basis Function
No ratings yet
Radial Basis Function
23 pages
RBF Elm PNN-2020
No ratings yet
RBF Elm PNN-2020
24 pages
L10 RBF
No ratings yet
L10 RBF
26 pages
Interpolation and Basis Function
No ratings yet
Interpolation and Basis Function
12 pages
ML Module4 RBF
No ratings yet
ML Module4 RBF
21 pages
Unit 4
No ratings yet
Unit 4
47 pages
Radial Basis Functions
No ratings yet
Radial Basis Functions
10 pages
Introduction To Radial Basis Function Networks
No ratings yet
Introduction To Radial Basis Function Networks
45 pages
RBF Neural Network
No ratings yet
RBF Neural Network
34 pages
Curs7 PDF
No ratings yet
Curs7 PDF
46 pages
Curse of Dimin
No ratings yet
Curse of Dimin
8 pages
Hamsa
No ratings yet
Hamsa
7 pages
ISC Unit II Topic-5
No ratings yet
ISC Unit II Topic-5
23 pages
Unit 8
No ratings yet
Unit 8
8 pages
Chap 5
No ratings yet
Chap 5
8 pages
Nonlinear Regression
No ratings yet
Nonlinear Regression
8 pages
Approach 1: Exact RBF
No ratings yet
Approach 1: Exact RBF
6 pages
Radial Basis Function Neural Network (RBFNN)
No ratings yet
Radial Basis Function Neural Network (RBFNN)
12 pages
RBF
No ratings yet
RBF
8 pages
Approximation Capability To Functions of Several Variables Nonlinear Functionals and Operators by Radial Basis Function Neural Networks
No ratings yet
Approximation Capability To Functions of Several Variables Nonlinear Functionals and Operators by Radial Basis Function Neural Networks
7 pages
Zhigangyu 2006
No ratings yet
Zhigangyu 2006
5 pages
Learning Rate/Rule in ANN
No ratings yet
Learning Rate/Rule in ANN
7 pages
MAI Lecture 07 RBFN
No ratings yet
MAI Lecture 07 RBFN
23 pages
Radial-Basis Function Networks
No ratings yet
Radial-Basis Function Networks
10 pages
Radial Basis Functions Neural Networks
No ratings yet
Radial Basis Functions Neural Networks
11 pages
Radial Basis Function Networks: Algorithms: Neural Computation: Lecture 14
No ratings yet
Radial Basis Function Networks: Algorithms: Neural Computation: Lecture 14
18 pages
Three Learning Phases For Radial-Basis-Function Networks: Friedhelm Schwenker, Hans A. Kestler, Guènther Palm
No ratings yet
Three Learning Phases For Radial-Basis-Function Networks: Friedhelm Schwenker, Hans A. Kestler, Guènther Palm
20 pages
Radial Basis Network: An Implementation OF Adaptive Centers: Nivas Durairaj Final Project For ECE539
No ratings yet
Radial Basis Network: An Implementation OF Adaptive Centers: Nivas Durairaj Final Project For ECE539
33 pages
RBFNetworks
No ratings yet
RBFNetworks
32 pages
Radial Basis Function (RBF) Networks
No ratings yet
Radial Basis Function (RBF) Networks
24 pages
Radial Basis Function (RBF) Networks
No ratings yet
Radial Basis Function (RBF) Networks
32 pages
My Dhsch6 RBF
No ratings yet
My Dhsch6 RBF
17 pages
Radial Basis Function Networks
No ratings yet
Radial Basis Function Networks
8 pages
Radial Basis Function (RBF) Neural Networks
No ratings yet
Radial Basis Function (RBF) Neural Networks
24 pages
Radial Basis Function Neural Network With Example Weights and Lms Linear Regression Weight Setting
No ratings yet
Radial Basis Function Neural Network With Example Weights and Lms Linear Regression Weight Setting
11 pages
Cell: 9952749533 WWW - Researchprojects.info
No ratings yet
Cell: 9952749533 WWW - Researchprojects.info
11 pages
Romantic Impressions GeorgeNeveda
100% (6)
Romantic Impressions GeorgeNeveda
35 pages
Radial Basis Function Networks: The Structure of The RBF Networks
No ratings yet
Radial Basis Function Networks: The Structure of The RBF Networks
8 pages
Hybrid Training Algorithm For RBF Network: M. Y. Mashor
No ratings yet
Hybrid Training Algorithm For RBF Network: M. Y. Mashor
17 pages
Digital Painting of Skin Texture
100% (1)
Digital Painting of Skin Texture
6 pages
Circuitanlys 2
No ratings yet
Circuitanlys 2
33 pages
Median Radial Basis Function Neural Network: G. Borg
No ratings yet
Median Radial Basis Function Neural Network: G. Borg
14 pages
Searching For A Solution To The Automatic RBF Network Design Problem
No ratings yet
Searching For A Solution To The Automatic RBF Network Design Problem
24 pages
Lecture Five Radial-Basis Function Networks: Associate Professor
No ratings yet
Lecture Five Radial-Basis Function Networks: Associate Professor
64 pages
Radial Basis Function Networks: Yousef Akhlaghi
No ratings yet
Radial Basis Function Networks: Yousef Akhlaghi
28 pages
Topic 4 2
No ratings yet
Topic 4 2
5 pages
Multidimensional Function Approximation Using Neural Networks
No ratings yet
Multidimensional Function Approximation Using Neural Networks
6 pages
6725133c9e12e9db65ccf8d9 Mopumiwejapov
No ratings yet
6725133c9e12e9db65ccf8d9 Mopumiwejapov
2 pages
Marriage D Amour
100% (1)
Marriage D Amour
6 pages
Introduction of The Radial Basis Function (RBF) Networks: February 2001
No ratings yet
Introduction of The Radial Basis Function (RBF) Networks: February 2001
8 pages
RBF Neural Networks and Its Application in Establishing Nonlinear Self-Tuning Model
No ratings yet
RBF Neural Networks and Its Application in Establishing Nonlinear Self-Tuning Model
4 pages
Evolutionary Algorithms Based RBF Neural Networks For Parkinson S Disease Diagnosis
No ratings yet
Evolutionary Algorithms Based RBF Neural Networks For Parkinson S Disease Diagnosis
5 pages
Vet EU IG - Chapter 2 - Initial Submission
No ratings yet
Vet EU IG - Chapter 2 - Initial Submission
103 pages
PDF - Physics - Introduction To Tensor Calculus and Continuum Mechanics
100% (1)
PDF - Physics - Introduction To Tensor Calculus and Continuum Mechanics
373 pages
Weekly Cyber Security Threat Updates 8th Oct To 14th Oct PDF
No ratings yet
Weekly Cyber Security Threat Updates 8th Oct To 14th Oct PDF
15 pages
Radial Basis Function
No ratings yet
Radial Basis Function
35 pages
Grade 8 Computer Question Bank
No ratings yet
Grade 8 Computer Question Bank
3 pages
Doctor Appointment Booking System
No ratings yet
Doctor Appointment Booking System
3 pages
TBChap 002 A
No ratings yet
TBChap 002 A
30 pages
Neb Class 12 Computer Programming in C Notes
No ratings yet
Neb Class 12 Computer Programming in C Notes
60 pages
Tinder Tales An Exploratory Study of Online Dating Users and Their Most Interesting Stories
No ratings yet
Tinder Tales An Exploratory Study of Online Dating Users and Their Most Interesting Stories
16 pages
Google: Google, LLC Is An American Multinational Technology Company
100% (1)
Google: Google, LLC Is An American Multinational Technology Company
45 pages
Brosur Rectiverter Dan Battery (48Vdc 24Vdc 220vac)
No ratings yet
Brosur Rectiverter Dan Battery (48Vdc 24Vdc 220vac)
6 pages
HCF and LCM PDF
No ratings yet
HCF and LCM PDF
31 pages
Interview Questions Selenium Java Coding
No ratings yet
Interview Questions Selenium Java Coding
26 pages
Nepal Power Engineering and Fabricator Pvt. Ltd.
No ratings yet
Nepal Power Engineering and Fabricator Pvt. Ltd.
27 pages
Professional Writing Skills Handout © Write It Well Academy
100% (1)
Professional Writing Skills Handout © Write It Well Academy
9 pages
(Day - 1 - 7) - Prep For Mock Conference - Info Kit (Netmission)
No ratings yet
(Day - 1 - 7) - Prep For Mock Conference - Info Kit (Netmission)
34 pages
Huawei AC650-128AP Wireless Access Controller Datasheet
No ratings yet
Huawei AC650-128AP Wireless Access Controller Datasheet
15 pages
What Is Network-Attached Storage A Complete Guide
No ratings yet
What Is Network-Attached Storage A Complete Guide
51 pages
The Blockchain Catalyst For Change - CEPR
No ratings yet
The Blockchain Catalyst For Change - CEPR
7 pages
APA Citations For Electronic Sources
No ratings yet
APA Citations For Electronic Sources
7 pages
ISDN, B-ISDN, X.25, Frame-Relay, ATM Networks: A Telephony View of Convergence Architectures
No ratings yet
ISDN, B-ISDN, X.25, Frame-Relay, ATM Networks: A Telephony View of Convergence Architectures
158 pages
DSGW-060 Smart Gateway
No ratings yet
DSGW-060 Smart Gateway
7 pages
Skyblue - Operations: Operating Manual
No ratings yet
Skyblue - Operations: Operating Manual
47 pages
It Pre-Final Examination
No ratings yet
It Pre-Final Examination
11 pages
Zarin Tasnim
No ratings yet
Zarin Tasnim
11 pages
Blockchain Based Framework For Software Development Using DevOps
No ratings yet
Blockchain Based Framework For Software Development Using DevOps
6 pages
Sample Program: XGB-INV IG5A (RS-485 Modbus RTU)
No ratings yet
Sample Program: XGB-INV IG5A (RS-485 Modbus RTU)
4 pages
Ircadv5030 Error Code E350-001 - Copytechnet - Com2
No ratings yet
Ircadv5030 Error Code E350-001 - Copytechnet - Com2
1 page
DD vcredistUI0CD6
No ratings yet
DD vcredistUI0CD6
2 pages
Badar Part
No ratings yet
Badar Part
2 pages
Ashrith Resume-1
No ratings yet
Ashrith Resume-1
2 pages

The Radial Basis Function Network: March 5, 2006

Uploaded by

The Radial Basis Function Network: March 5, 2006

Uploaded by

The Radial Basis Function Network

I Three layers of nodes in the network.

I There are two stages to training:

I The rationale for the two processing steps, the first is

I Exact Interpolation is a technique for interpolating a set of

I The input vectors are labeled x n .

I Interpolating every point is not desirable. Noisy data will

I Changes to the Exact Interpolation algorithm which create an

I The equations for RBF activation is:

I There are two stages to training:

I Training step 2 is the easier of the two steps as it involves

where n is the number of input patterns, k is the sum of the

I The output weight layer is normally solved using singular

I The operations performed can be described in terms of several

I A problem occurs is the basis functions are required to fill the

I The optimal choice for basis function parameters using density

Orthogonal Least Squares

Gaussian Mixture Models

where P(j) is the prior probabilities for data points to have

I Both approximate arbitrary non-linear functional mappings of

You might also like