Lab 05 Sol
Lab 05 Sol
BIS4435
Question 1
] Z HJ Z S
J Z HJH Z S
J Z J HH Z
S
J Z
J HZ
Z H ZS
J
Z J
H
H ZS
J Z
J S
H
Z
1 2 3
1
BIS4435 2
h) Is there any limit on how many data points (input patterns) this SOM
can analyse?
Answer: There is no such limit. You can feed as much data as you
wish into an SOM. The input space can be infinite.
i) How many clusters can this SOM detect in the input data?
Answer: Because the number of output nodes is five, this SOM can-
not distinguish between more than five different clusters. However, it
is possible that there will be fewer than five clusters detected in the
data.
j) If node D is the winner, which output nodes are its immediate neigh-
bours?
Question 2
A B
J
J
F C
J
E D
The output layer of this map consists of six nodes, A, B, C, D, E and F, which
are organised into a two–dimensional lattice with neighbours connected by
lines.
Each of the output nodes has two inputs x1 and x2 (not shown on the
diagram). Thus, each node has two weights corresponding to these inputs:
w1 and w2 . The values of the weights for all output in the SOM nodes are
given in the table below:
Node A B C D E F
w1 -1 0 3 -2 3 4
w2 2 4 -2 -3 2 -1
a) Calculate which of the six output nodes is the winner if the input
pattern is
x = (2, −4) ?
The answer should contain all the working.
b) After the winner for a given input x has been identified, the weights
of the nodes in SOM are adjusted using adaptation formula:
w0 = w + α h[x − w] ,
Answer: First, let us adapt the winner (node C). The neighbour-
hood h = 1.
! ! !
2 3 −1
x − wC = − =
−4 −2 −2
! !
−1 −0.5
α h[x − wC ] = 0.5 · 1 · =
−2 −1
! ! !
0 3 −0.5 2.5
wC = wC + α h[x − wC ] = + =
−2 −1 −3
The immediate neighbours of node C are nodes B and D. The neigh-
bourhood h = 0.5. The new weights of node B and D are:
! " ! !# !
0 0 2 0 0.5
wB = + 0.5 · 0.5 · − =
4 −4 4 2
! " ! !# !
0 −2 2 −2 −1
wD = + 0.5 · 0.5 · − =
−3 −4 −3 −3.25
All other nodes in the lattice have neighbourhood h = 0. Thus, their
weights do not change.
Question 3
What are the main similarities and differences between feed–forward neural
networks and self–organising maps?
BIS4435 5
• In FFNN the output values of nodes are important, and they are
defined by the activation functions. In SOM nodes do not have any
activation functions, and the output values are not important.
• In FFNN all the output nodes can fire, while in SOM only one.
Question 4
Answer: Maximum five types because there are only five output
nodes.
Answer: Fighter hets usually are small, very fast and carry very few
passengers. Thus, it can be represented by a vector (1, 5, 1), which is
exactly like the weights of node C (1 mark). Node C will be the
winner if the SOM is fed with a fighter jet vector, hence it must be
representing fighter jets (1 mark).
d) Suppose you were asked to change the design of the SOM in order to
take into account two additional parameters: Price and fuel consump-
tion. What would you need to change in this SOM?
Answer: Yes and no. Yes because ships can be described in terms of
size, speed, passenger load, price and fuel consumption. However, this
classification will not take into account some other features of ships
and airplanes that put them into different classes. Therefore, in order
to make this classification more precise, one would have to increase
the number of parameters (dimensions).
Question 5
of essay how a self–organising map (SOM) could be used for this analysis.
Why would the results, produced by an SOM, be particularly useful for the
reports presented to strategic managers?
Answer:
• Based on description above, the centralised data warehouse contains
information about thousands of customers, each avaluated on 50 pa-
rameters. Thus, the input dataset has several thousands points in
50–dimensional space.
• The SOM would need to have 50 inputs, and the output lattice would
have arbitrary number of nodes, but probably high enough to distin-
guish between many groups of customers.
• The SOM can show the distribution of customers and their clusters
on this output map.
• Because SOM shows results on one or two dimensional feature map,
they can easily be included into a report and help to visualise the
results of the analysis.
• The individual groups of customers, discovered by SOM, can then be
further investigated and new products and promotions can be designed
specifically for these groups.
Question 6
What are the main features of an SOM that can help to analyse the business
data in a data warehouse?
Answer:
• Reduction of dimensions: The multidimensional data from a data
warehouse can be fed into an SOM, which then can display the re-
sults on a one or a two–dimensional feature map.
• SOM can detect clusters in data (similar data points), which are dif-
ficult to spot in huge databases using other means.
BIS4435 8
Question 7
Answer: SOMs are widely used for data analysis (data mining), and for
this reason it is most obvious to use SOM during the intelligence phase.
SOMs can help to classify data by clusters, provide convenient representation
and visualisation of the clusters, reduce number of dimensions. SOM uses
an unsupervised learning algorithm that helps to find some structure in
data without prior knowledge about its. Can handle missing data.