Unit 4 QB GIS
Unit 4 QB GIS
operator of areas with ‘potato’ cover with the areas having clay
soil),
• The fields that do not have potato as crop (take the
complement of the potato areas).
2. Neighbourhood functions.
Whereas overlays combine features at the same location,
neighbourhood functions evaluate the characteristics of an area
surrounding a feature’s location. A neighbourhood function
‘scans’ the neighbourhood of the given feature(s), and performs a
computation on it.
• Search functions allow the retrieval of features that fall within a
given search window. This window may be a rectangle, circle, or
polygon.
• Buffer zone generation (or buffering) is one of the bestknown
neighbourhood functions. It determines a spatial envelope
(buffer) around (a) given feature(s). The created buffer may have
a fixed width, or a variable width that depends on characteristics
of the area.
• Interpolation functions predict unknown values using the known
values at nearby locations. This typically occurs for continuous
fields, like elevation, when the data actually stored does not
provide the direct answer for the location(s) of interest.
Interpolation of continuous data was discussed in Section 5.4.2.
• Topographic functions determine characteristics of an area by
looking at the immediate neighbourhood as well. Typical
examples are slope computations on digital terrain models (i.e.
continuous spatial fields). The slope in a location is defined as the
plane tangent to the topography in that location. Various
computations can be performed, such as:
– determination of slope angle,
– determination of slope aspect,
– determination of slope length,
– determination of contour lines. These are lines that connect
points with the same value (for elevation, depth, temperature,
barometric pressure, water salinity etc).
3. Connectivity functions.
These functions work on the basis of networks, including road
Tilak Education Society’s
S.K College of Science and Commerce, Nerul
Question Bank, TYIT
Subject: GIS
UNIT IV
ANS These belong to the most frequently used functionsin a GIS application.
They allow the combination of two (or more) spatial data layers
comparing them position by position, and treating areas of overlap—and
of non-overlap—in distinct ways. Many GISs support overlays through an
algebraic language, expressing an overlay function as a formula in which
the data layers are the arguments. In this way, we canfind
• The potato fields on clay soils (select the ‘potato’ cover in the
cropdata layer and the ‘clay’ cover in the soil data layer and perform
anintersection of the two areas found),
• The fields where potato or maize is the crop (select both areas
Tilak Education Society’s
S.K College of Science and Commerce, Nerul
Question Bank, TYIT
Subject: GIS
UNIT IV
of‘potato’ and ‘maize’ cover in the crop data layer and take their
union),
• The potato fields not on clay soils (perform a difference operator
ofareas with ‘potato’ cover with the areas having clay soil),
• The fields that do not have potato as crop (take the complement of
the
potato areas).
ANS
When exploring a spatial data set, the first thing one usually
wants is to select certain features, to (temporarily) restrict the
exploration. Such selections can be made on geometric/spatial
grounds, or on the basis of attribute data associated with the spatial
features. We discuss both techniques below. Interactive spatial
selection In interactive spatial selection, one defines the selection
condition by pointing at or drawing spatial objects on the screen
Tilak Education Society’s
S.K College of Science and Commerce, Nerul
Question Bank, TYIT
Subject: GIS
UNIT IV
display, after having indicated the spatial data layer(s) from which to
select features. The interactively defined objects are called the
selection objects; they can be points, lines, or polygons. The GIS
Selection objects then selects the features in the indicated data
layer(s) that overlap (i.e. intersect, meet, contain, or are contained
in; see Figure) with the selection objects. These become the selected
objects. spatial data stored in a geodatabase is associated with its
attribute data through a key/foreign key link. Selections of features
lead to selections on the records. Vice versa, selection of records
may lead to selection of features. Interactive spatial selection
answers questions like “What is at . . . ?” In Figure, the selection
object is a circle and the selected objects are the red polygons; they
overlap with the selection object.
Area Perimeter Ward_id Ward_nam
65420380.0000 41654.940000 1 KUNDUCHI Kinondoni 22106 27212.00
Figure 6.2: All city wards that overlap with the selection object—
here a circle—are selected (left), and their corresponding attribute
records are high lighted (right, only part of the table is shown). Data
from an urban application in Dar es Salaam, Tanzania. Data source:
Dept. of Urban & Regional Planning and Geo-information
Management, ITC.
was computed from them. In some case stud- ies, zonated buffers
must be determined, for instance in assessments of traffic noise
effects. Most GISs support this type of zonated buffer computation.
An illustration is provided in Figure.
(a) (b)
In vector-based buffer generation, the buffers themselves
become polygon fea- tures, usually in a separate data layer, that can
be used in further spatial analysis.
Figure : Buffer zone generation: (a) around main and minor
roads. Dif- ferent distances were ap- plied: 25 metres for minor
roads, 75 metres for main roads. (b) Zonated buffer zones around
main roads. Three different zones were obtained: at 100 metres
from main road, at 200, and at 300 metres.
Buffer generation on rasters is a fairly simple function. The
target location or lo- cations are always represented by a selection of
the raster’s cells, and geometric distance is defined, using cell
resolution as the unit. The distance function ap- plied is the
Pythagorean distance between the cell centres. The distance from a
non-target cell to the target is the minimal distance one can find
between that non-target cell and any target cell.
Tilak Education Society’s
S.K College of Science and Commerce, Nerul
Question Bank, TYIT
Subject: GIS
UNIT IV
ANS Flow computations determine how a phenomenon spreads over the area,
in principle in all directions, though with varying difficulty or resistance.
Tilak Education Society’s
S.K College of Science and Commerce, Nerul
Question Bank, TYIT
Subject: GIS
UNIT IV
There are also cases where a phenomenon does not spread in all
directions, but moves or ‘flows’ along a given, least-cost path, determined
again by local terrain characteristics. The typical case arises when we want
to determine the drainage patterns in a catchment: the rainfall water
‘chooses’ a way to leave the area.
This principle is illustrated with a simple elevation raster, in Figure.
For each cell in that raster, the steepest downward slope to a
neighbour cell is computed, and its direction is stored in a new
raster. This computation determines the elevation difference
between the cell and a neighbour cell,and takes into account cell
distance—1 for neighbour cells in N–S or W–E direction, root(2) for
cells in NE–SW or NW-SE direction. Among its eight
neighbourDetermining flow directioncells, it picks the one with the
steepest path to it. The directions in raster (b), thus obtained, are
encoded in integer values, and we have ‘decoded’ them for the sake
of illustration. Raster (b) can be called the flow direction raster. From
raster (b), the GIS can compute the accumulated flow count raster, a
raster that for each cell indicates how many cells have their water
flow into the cell.
Cells with a high accumulated flow count represent areas of
concentrated flow, and thus may belong to a stream. By using some
appropriately chosen threshold value in a map algebra expression,
we may decide whether they do. Cells with an accumulated flow
count of zero are local topographic highs, and can be used to identify
ridges.
Tilak Education Society’s
S.K College of Science and Commerce, Nerul
Question Bank, TYIT
Subject: GIS
UNIT IV
ANS The determination of neighbourhood of one or more target locations may de- pend
not only on distance—cases which we discussed above—but also on direction and
differences in the terrain in different directions. This typically is the case when the
target location contains a ‘source material’ that spreads over time,referred to as
diffusion. This ‘source material’ may be air, water or soil pollution, Diffusion and
spreadcommuters exiting a train station, people from an opened-up refugee camp, a
water spring uphill, or the radio waves emitted from a radio relay station. In all
these cases, one will not expect the spread to occur evenly in all directions. There
will be local terrain factors that influence the spread, making it easier or more
difficult. Many GISs provide support for this type of computation, and we discuss
some of its principles here, in the context of raster data.
Diffusion computation involves one or more target locations, which are better
called source locations in this context. They are the locations of the source of
whatever spreads. The computation also involves a local resistance raster,
which for each cell provides a value that indicates how difficult it is for the
‘source material’ to pass by that cell. The value in the cell must be normalized:
i.e. valid Resistancefor a standardized length (usually the cell’s width) of
spread path. From the source location(s) and the local resistance raster, the
GIS will be able to compute a new raster that indicates how much minimal
total resistance the spread has witnessed for reaching a raster cell. This
process is illustrated in Figure
While computing total resistances, the GIS takes proper care of the path
lengths. Obviously, the diffusion from a cell csrc to its neighbour cell to the
east ce isshorter than to the cell that is √its northeast neighbour cne.
The distance ratiobetween these two cases is 1 : 2. If val(c) indicates the local
Tilak Education Society’s
S.K College of Science and Commerce, Nerul
Question Bank, TYIT
Subject: GIS
UNIT IV
resistance value
for cell c, the GIS computes the total incurred resistance for diffusion
from csrc to
ce as 1 (val(csrc) + val(ce)), while the same for csrc to cne is 1
(val(csrc) + val(cne)) ×
√ 22
2. The accumulated resistance along a path of cells is simply the sum
of these incurred resistances from pairwise neighbour cells.
Since ‘source material’ has the habit of taking the easiest route to
spread, we must determine at what minimal cost (i.e. at what
minimal resistance) it may
have arrived in a cell. Therefore, we are interested in the minimal
cost path. To Minimal cost path
determine the minimal total resistance along a path from the source
location csrc to an arbitrary cell cx, the GIS determines all possible
paths from csrc to cx, and then determines which one has the lowest
total resistance. This value is found, for each cell, in the raster of
Figure.
For instance, there are three paths from the green source location to its
northeast neighbour cell (with local resistance 5). We can define them as
path 1 (N–E),
Q12 Explain two types of classification:
Tilak Education Society’s
S.K College of Science and Commerce, Nerul
Question Bank, TYIT
Subject: GIS
UNIT IV
User-controlled classification:
• In user-controlled classification, a user selects the attribute(s)
that will be used as the classification parameter(s) and
defines the classification method.
• The latter involves declaring the number of classes as well as
the correspondence between the old attribute values and the
new classes. This is usually done via a classification table.
• It is rather typical for cases in which the used parameter
domain is continuous
(as in household income). Then, the table indicates value
ranges to be mapped to the same category.
• Another case exists when the classification parameter is
nominal or at least discrete.
• We must also define the data format of the output, as a spatial
data layer, which will contain the new classification attribute.
The data type of this attribute is always categorical, i.e. integer
or string, no matter what is the data type of the attribute(s)
from which the classification was obtained.
• Sometimes, one may want to perform classification only on a
selection of features. In such cases, there are two options for
the features that are not selected.
One option is to keep their original values, while the other is to
assign a null value to them in the output data set. A null value
is a special value that means that no applicable value is
present. Care must be taken to deal with these values
correctly, both in computation and in visualization.
Automatic classification:
User-controlled classifications require a classification table or
user interaction.
GIS software can also perform automatic classification, in
Tilak Education Society’s
S.K College of Science and Commerce, Nerul
Question Bank, TYIT
Subject: GIS
UNIT IV
We have discussed the notion that real world processes are often
highly complex. Models are simplified abstractions of reality
representing or describing its most important elements and their
interactions. Modelling and GIS are more or less inseparable, as GIS
is itself a tool for modelling ‘the real world’ (or al least some part of
it).
The solution to a (spatial) problem usually depends on a (large)
number of pa- rameters. Since these parameters are often
interrelated, their interaction is made more precise in an application
model.
Here we define application models to include any kind of GIS based
model (in- cluding so-called analytical and process models) for a
specific real-world appli- cation. Such a model, in one way or other,
describes as faithfully as possible how the relevant geographic
phenomena behave, and it does so in terms of the parameters.
The nature of application models varies enormously. GIS applications
for famine relief programs, for instance, are very different from
earthquake risk assessment applications, though both can make use
of GIS to derive a solution. Many kinds of application models exist,
and they can be classified in many different ways. Here we identify
five characteristics of GIS-based application models:
differential equations.
Rule-based models attempt to model processes by using local
(spatial) rules. Cellular Automata (CA) are examples of models in this
category. These are often used to understand systems which are
generally not well understood, but for which their local processes
are well known. For example, the characteristics of neighbourhood
cells (such as wind direction and vegetation type) in a raster- based
CA model might be used to model the direction of spread of a fire
over several time steps.
Agent-based models (ABM) attempt to model movement and
development of multiple interacting agents (which might represent
individuals), often using sets of decision-rules about what the agent
can and cannot do. Complex agent-based models have been
developed to understand aspects of travel behaviour and crowd
interactions which also incorporate stochastic components.
Scale refers to whether the components of the model are individual
or aggregate in nature. Essentially this refers to the ‘level’ at which
the model operates.
Individual-based models are based on individual entities, such as the
agent-based
models described above, whereas aggregate models deal with
‘grouped’ data, Individual and aggregate
such as population census data. Aggregate models may operate on
data at the level of a city block (for example, using population census
data for particular social groups), at the regional, or even at a global
scale.