EOL Quiz Answers Module 3
EOL Quiz Answers Module 3
Module Three
John Richards
The University of New South Wales
The Australian National University
2
Which band would you discard if you were trying to retain the best three of the four
features?
The first and second bands are highly correlated (0.85) as are the third and fourth
bands (0.86). However, band 3 also shows stronger correlations with bands 1 and 2
(0.31 and 0,39). No other band matches that, so band 3 would be the one to discard.
• Can you show that the classification time is quadratically dependent on the number
of bands for the maximum likelihood classifier? You may find the following
expression for the discriminant function helpful in this answer.
• In the correlation matrix below, what do the large white blocks near the top right
hand and bottom left hand corners indicate?
That the corresponding sets of bands are highly correlated.
• In the same correlation matrix, what are the large vertical and horizontal black
stripes through the matrix?
They are bands which correspond to significant water absorption, for which little
signal is received by the sensor.
3
• Suppose we wanted to compare directly the average within class covariance matrix
𝐶' and the among class covariance matrix 𝐶( , such as in the formula
$%
𝐶' 𝐶(
Is this a scalar and, if not, how can a scalar measure be derived from it?
Would we seek to minimise or maximise the expression?
That matrix product is a matrix. We want to maximise it, so that we have the largest
separation among the classes and the smallest class variances. If we wanted to
replace it by an appropriate scalar, we could use its trace.
• Can you verify the end points of the divergence and JM curves shown in the 7th slide
of this lecture?
Consider their formulas:
Divergence
𝑑!" (𝒙) = ½tr'(𝑪! − 𝑪" +(𝑪#$ #$ #$ #$
! − 𝑪" +, + ½tr{(𝑪! + 𝑪" )(𝒎! − 𝒎" +(𝒎! − 𝒎" )
%
𝟏 𝐂𝒊 0𝐂𝒋 #𝟏 𝟏 𝐂𝒊 0𝐂𝒋
in which 𝐵𝒊𝒋 (𝐱) = 𝟖
(𝒎𝒊 − 𝒎𝒋 )𝐓 6 𝟐
7 (𝒎𝒊 − 𝒎𝒋 ++ 𝟐 8 𝟏/𝟐 9
𝟐2𝐂𝒊 𝐂𝒋 2
The abscissa of the graphs is the distance between means. When that is zero, divergence
has a constant value equal to its first term, which is a function of the covariance matrices.
Likewise, for the JM distance 𝐵 is then just a function of the covariances, as is 𝐽, so that it
also has a constant value at the origin.
When the distance between the means is very large both divergence and 𝐵 increase
quadratically without bound. But the exponential term in the JM formula limits it to 2.
• Would the city block distance measure be preferable to Euclidean distance when
computing the weight adjustments in ReliefF?
The ReliefF algorithm, in common with most clustering techniques, is heavily reliant
on distance comparisons. The city block distance is faster to compute than Euclidean
distance
• If you did not know the class prior probabilities how would the within and among
class scatter matrix definitions be modified in NDA?
Equal priors could be assumed. Thus 𝑝(𝜔" ) = 1/𝐶.
• Is producer’s accuracy more important than user’s accuracy? You may wish to
answer this from two points of view—as the designer of a classification algorithm or
as a user interested in crop hectarages on a thematic map.
As the names imply, the user of a thematic map is interested in the map’s accuracy—
thus user’s accuracy is the key measure. On the other hand, the designer of a
classification strategy uses producer’s accuracy to see how well the algorithm and
strategy are working.
• If a classifier performed equally well on all classes would the user’s accuracies all be
the same?
Not necessarily; it depends on the errors of commission.
• Are there practical problems with cross validation? To answer this remember that a
classifier has to be trained as many times as there are partitions of the reference
data set.
5
The problem is one of cost (time to get an answer), which is directly related to the
number of classifications that have to be performed; in turn that is determined by
how many subsets the reference data is divided into.
• Explain carefully the third dot point in the summary of the previous slide
Assume, as a simple example, that a classifier has an accuracy of 100% on a water
class, but that class, although important, occupies only 1% of the scene. By
comparison, suppose a grassland class occupies the other 99% of the scene. Assume
the classifier has an accuracy of 80% on the grassland class. That means 20% of the
time it labels grassland as water (committing an error)—that’s equivalent to about
19.8% of the scene. Thus, the water class in the thematic map is therefore badly in
error—only 1 in 20 of the pixels labelled water are actually water (even though the
classifier got all the actual water pixels right!).
• Verify the average map accuracy figures in the table on the sixth slide of this lecture
From the slide following (the seventh) the map accuracy can be calculated as the
classifier accuracies on each class, weighted by the prior probabilities. Thus, for each
of the three sets of priors the calculations are, in order:
0.700*0.368+0.925*0.294+0.891*0.338=0.831
0.700*0.333+0.925*0.333+0.891*0.333=0.838
0.700*0.900+0.925*0.050+0.891*0.050=0.721
• In slide 11 of this lecture the number of testing pixels drops with the anticipated
level of map accuracy. Why?
Because fewer test pixels are need if the accuracy is high. For example, if the
accuracy were suspected of being 100% only 1 testing pixel is needed to verify that
fact. If the accuracy were about 50% then a large number would be needed so that
on random trials (samples) the 50% estimate would be reached reliably.
𝑁 is the number of pixels in the map and 𝑛 is the number of testing pixels used to
assess map accuracy.
If 𝑛 = 𝑁 the variance will be zero. The implication of this is as follows: There
will be uncertainty in the map accurate estimate derived from the n testing pixels,
which reduces as 𝑛 gets larger. In the ultimate, when 𝑛 = 𝑁—i.e. every map pixel is
checked—the real value of the map accuracy is found; in other words, there is then
no variance and thus no uncertainty.
• Explain the difference between confidence level and the tolerance on the estimated
accuracy 𝜀.
A given level of confidence tells us how sure we are that the map accuracy
determined from using reference samples lies within ±𝜀 of the true value.
• Verify the information class labels attached to the clusters in the ninth slide of this
lecture
8000
7000
6000
5000
4000
3000
2000
1000
0
0.511 0.634 0.847 1.617 2.153
wavelength in micrometres
The cluster centres are plotted above, whereupon they can be identified by a knowledge of
spectral response curves. In general, those cover types that are predominantly vegetation
have a response at 0.634 (red) lower than that at 0.511 (green), and a high response at
0.847 (near IR). Those cover types that are predominantly bare/buildings etc. have a
response at 0.634 higher than that at 0.511, and a high response at 0.847. Both vegetated
and bare surfaces have lower responses at 2.153 (mid IR). Water tends to be low at all
wavelengths and tends to drop with increasing wavelength.
7
• The following is a scatter diagram in the red near infrared space showing a range of
ground covers. Show how the simple vegetation index and NDVI would plot in such
a space.
7000
vegetation
6000
channel 29 (NIR) brightness
5000
4000
1000
water
0
0 2000 4000 6000 8000
channel 15 (vis red) brightness
6000 6000
channel 29 (NIR) brightness
5000 5000
4000 4000
3000 3000
2000 2000
1000 1000
0 0
0 2000 4000 6000 8000 0 2000 4000 6000 8000
channel 15 (vis red) brightness channel 15 (vis red) brightness
• Even though it is small there is some microwave energy naturally radiated by the
earth. If you wanted to form an image using natural microwave emissions what
would you have to consider in terms of pixels size?
In order to collect enough energy to create a measurable signal a very large pixel size
is needed.
8
500
450
400
ground range resolution
350
300
250
200
150
100
50
0
20 30 40 50 60 70
look angle
• What would be the vertical dimensions of a SAR antenna to achieve a swath width of
30km at an altitude of 900km and a look angle of 30 degrees and operating
wavelength of 20cm?
1
The beamwidth of an antenna is given by Θ = 2 rad. When projected onto the
1
ground at distance R this is gives a swath of 𝑆 = 𝑅 Θ= 2 𝑅. Thus, for this example, to
achieve a 30km swath width at 900km range, for 30 degrees look angle, the vertical
1
dimension of the antenna will be 𝑙= 3 𝑅=6.93m, where R=900km/cos30.
• For the same system as above what would be the values of the other system
parameters to achieve a resolution cell size of 30x30m?
The azimuth antenna length will be 60m (twice the azimuth resolution) and the
ranging pulse width would be 100ns.
9
• How would the resolution cell size change if the platform were at an altitude of
1100km?
Both the azimuth and range resolutions are independent of platform altitude, so
they don’t change.
• Refer to the slide in which we described radar cross section; would you expect 𝜎 to
change if the target were rotated so that it showed a different aspect to the
incoming wave front?
Radar cross section is defined such that the energy intercepted from the incoming
wave front is then re-radiated isotopically. Th energy intercepted will depend upon
the geometry of the object facing the incoming wave front. If it were a flat plate,
then in rotation it could go from looking like a large area intercepting energy from
the wave to an edge which would intercept virtually none. Thus, the orientation of
the target affects the measured radar cross section. Often the radar cross section of
an object will be displayed on polar coordinates, showing this dependence.
• With respect to your answer above, what about possible changes in 𝜎 4 with
incidence angle for earth surface scattering?
In general, the same applies. It depends, theoretically on the “shapes” of the
incremental scatterers that reside within a resolution cell. If they were all roughly
spherical, which night be the case for some vegetation canopies, then one would
expect only a weak dependence on incidence angle. If they were shaped, like facets
representing an undulating surface, then a stronger dependence would be seen
• By reference to the radar range equation, if the range is doubled how does that
affect the received power level?
Since the received power is proportional to the inverse fourth power of range, a
doubling of range leads to a sixteen-fold drop in received power.
• In the 6th slide of this lecture, describe how the pixel would appear in an image if it
included a large dominant scatterer, such as the building shown in slide 9
10
The radar cross section of the dominant scatterer will be larger than the scattering
coefficient multiplied by the size of the pixel, so that the pixel response becomes
essentially just that of the dominant scatterer. Among a group of pixels with the
same background, that pixel will show as a bright spot.
• The standard deviation of the speckle in a radar image, reduced by averaging over 𝑁
5(789)
looks, is given by 𝜎(𝑎𝑣𝑔𝑒) = , where 𝜎(𝑟𝑎𝑤) is the standard deviation of the
√<
speckle in the image as recorded (i.e. the raw image). Note here 𝜎 is the symbol for
standard deviation and not scattering coefficient.
The diagram below shows a pixel composed as the average of four raw recorded
pixels. Discuss what it implies, including the levels of speckle noise.
!# /4
!"
This diagram tells us that the radar system was designed as a four (azimuth) look
system. The azimuth resolution was designed to be four times better than the
ground range resolution, so that the four pixels above could be averaged in azimuth
to produce on pixel with a square shape. In doing so the four looks that are
averaged reduce the speckle noise (standard deviation) by one half.
• In the 5th slide the formula for reflection coefficient is given for both horizontal and
vertical polarisation. For vertical incidence show that polarization dependence
disappears.
%$ > $>! ?√>!
For 𝜃 = 0, 𝑐𝑜𝑠𝜃 = 1 𝑎𝑛𝑑 𝑠𝑖𝑛𝜃 = 0. Thus we have 𝜌= = %?√>! and 𝜌@ = > ! ? √> !
=
√ !
√>! $√>! ?% %$ >
= %?√>! . Thus, both reflection coefficients are the same, demonstrating
√>! √>! ?% √ !
that three is no polarization difference at vertical incidence.
• If you were interested in mapping surface roughness would you use small or large
incidence angles?
As seen in slide 8 there is a greater difference between the surface scattering
coefficients at larger angles of incidence. Therefore, larger angles would be
preferred. In addition, at smaller angles of incidence terrain distortion becomes a
problem which often masks (the small) surface roughness variations.
11
• Suppose the dielectric constant of a surface was close to unity. What does that say
about the reflection and transmission of the incident electric field (and thus power
density)?
*4,.$√%$,"-" .
If 𝜀7 = 1, then from the formulas on slide 5, 𝜌= = = 0 and 𝜌A =
*4,.?√%$,"-" .
$*4,.?√%$,"-" .
= 0. Thus there is no reflection from the surface; all incident energy
*4,.?√%$,"-" .
is transmitted into the medium.
• In the last slide before the summary above, fence lines are evident in the image.
Why?
They are wire fences, acting as strong scatterers. There is however a cardinal effect
evident; the fences running roughly north-south are easier to discern.
• If the total path of travel through dry sand was equivalent to four penetration
depths, what is the power density scattered back to the air-sand interface compared
with the incident level of power density just under the interface?
The power density would have dropped to 1[𝑒 B = 1[55 of its value just under the
interface.
• In slide 4 there seems to be other detail in the radar image adjacent to the paleo
river channel that also does not appear on the optical image. What might that detail
be?
Because that detail is not evident in the colour infrared image, which shows just the
sand sheet, it is below the sand. Typically, it would be the bedrock under the sand,
showing drainage patterns and surface roughness detail.
• In slide 5 show that the two expressions are the same when 𝜃=45 degrees.
Physically, why should that be the case?
C
For 𝜃=45 (i.e. B 𝑟𝑎𝑑𝑖𝑎𝑛𝑠) , 𝑠𝑖𝑛& (𝜃 + 𝜋/4) in the left-hand expression is 1. The 𝑠𝑖𝑛& 𝜃
% % %
term in the right-hand expression is = &. Thus, both expressions are the same.
√& √&
That happens because at 45 degrees, the ground reflection of the vertical flat plate
in the right-hand diagram then has the same dimensions as that vertical element.
• How would the shapes of the curves in slide 8 be altered if they were computed for a
ship at sea, where the side of the vessel and the horizontal ocean surface form a
dihedral corner reflector?
They would not drop off at the higher angles of incidence because there is no
equivalent of canopy attenuation.
12
• There is an implicit assumption in slides 5, 6 and 7— that is, that the incoming radar
beam is square on to the corner reflector facets. What would happen if that were
not the case?
The equivalent radar cross section would drop significantly. Ray tracing shows that
the return ray is then not parallel to the incident ray and will not be received by the
radar.
• The figure below shows the cross-section of a bridge over a smooth water body. In
radar imagery it often shows up as three reflections as indicated. Why?
direct scattering
(single bounce)
triple bounce
scattering
double bounce
scattering
• In slide 7 how would the back slope appear in a radar image if it was quite severe
such that it was hidden behind the summit of the mountain when viewed from the
radar?
The back slope would be in shadow and appear black in the radar image.
• In slide 7 how would the front slope appear in a radar image if was a vertical cliff?
Layover would occur, but with scattering from the backslope mixing with that from
the cliff.
• In slide 9 there is a bright patch on the bottom right-hand corner of the top Seasat
image. What might that be?
It is an urban region (Harrisburg), not easily seen on the second Seasat image
because of the direction of illumination (cardinal effect).
• Why are look angles in the mid-range of 35° - 50° good for most land applications in
radar remote sensing?
If the angle is too small, the variation of surface scatter with surface roughness is not
as good as it is at larger angles. Also, relief distortion is exaggerated at small angles.
At very large angles shadowing can be a problem, and all surface (and canopy)
responses fall off. So, a good compromise is mid-range angles.
vertical plate
horizontal
plate
reflected
ray
incident
ray
• A particular imaging radar operates with 100ns long ranging pulses and a look angle
of 45o. If an active radar calibrator has an overall time delay of 500ns, how far will it
be displaced in the range direction in the recorded image?
The ground range resolution at a look angle of 45 degrees will be approximately
64m. That corresponds to the reflections from the ranging pulse being 100ns apart.
A reflection received 500ns from the first reflection will correspond to 5 resolutions
cells later in the range direction, which is the displacement of the ARC response.
Thus, on reception we then know that that response came from an object placed 5
resolution cells earlier.
• How do the round-trip path lengths differ between ping pong and standard modes of
InSAR operation? Assume that the distance to the surface is sufficiently large that
the beams from both antennas could be considered to be parallel.
To answer this question, we examine the situation close to the antennas, as below.
The orthogonal
baseline is !" = !$%&'
• For the ERS-1 example on slide 4, what height variation corresponds to one full 2𝜋
cycle of phase variation?
From the result at the bottom of the slide 0.17rad of phase change corresponds to a
metre of height variation. Therefore 2𝜋 change in phase will be caused by a height
variation of 2𝜋/0.17 =37m.
• In the example on slide 2 what vertical change does the example represent, if the
phase change was due to vertical movement alone?
For a change in slant range of ∆𝑟7 = 14mm the corresponding change in ground
range is seen on the slide to be 14 sin23 = 5.5mm. The corresponding change in
height would be 14 cos23 = 12.9mm, if it were a height variation that led to the
change in slant range.
• How should the temporal baseline in along-track interferometry relate to the time of
occurrence of the change of interest?
The temporal baseline is the time between the two acquisitions used to form the
along track interferometer. The time change in the scene of interest has to have
occurred between the two acquisitions.
• Why are there the three images of the Sydney harbor bridge in slide 5
Each image corresponds to a reflection that arrives back at the radar receiver at a
given time. If the arrival times are different for different reflections from the same
target will appear several times in the same image. For the bridge, there are at least
three pathways for the incident radiation to take on its way to and from the bridge:
one is a direct reflection from the bridge itself—the shortest path; one is a double
bounce reflection from the bridge to the sea surface and then back to the receiver
(and vice-versa)— longer path; one is a triple bounce involving bridge-water-bridge
again on the way back to the receiver—the longest path.
• A set of logical rules for labelling a pixel can be put in the form
If radar imagery says X and optical imagery says Y then the pixel should be labelled Z
Devise suitable rules that involve both data types to differentiate among the
following cover types in the same scene. Assume the optical data consists of just a
visible red band and a near infrared band, and the radar data is HH polarized, L band
imagery.
Grassland
Desert sand
Smooth dry agricultural soil
Coniferous forest
Lake water