Chapter 2 Feature Representation Introto GISand Spatial A
Chapter 2 Feature Representation Introto GISand Spatial A
-raster
To work in a GIS environment, real world observations (objects or events that can be recorded
in 2D or 3D space) need to be reduced to spatial entities. These spatial entities can be
represented in a GIS as a vector data model or a raster data model.
2.1.1 Vector
Vector features can be decomposed into three different geometric primitives: points,
polylines and polygons.
2.1.1.1 Point
Figure 2.2: Three point objects defined by their X and Y coordinate values.
1 of 8 11/14/2022, 5:59 AM
Chapter 2 Feature Representation | Intro to GIS and Spatial Analysis https://mgimond.github.io/Spatial/chp02_0.html#vector-vs.-raster
We seem capable of interpreting such symbols as points, but there may be instances when
such interpretation may be ambiguous (e.g. is a round symbol delineating the area of a round
feature on the ground such as a large oil storage tank or is it representing the point location of
that tank?).
2.1.1.2 Polyline
A polyline is composed of a sequence of two or more coordinate pairs called vertices. A vertex
is defined by coordinate pairs, just like a point, but what differentiates a vertex from a point is
its explicitly defined relationship with neighboring vertices. A vertex is connected to at least
one other vertex.
Like a point, a true line can’t be seen since it has no area. And like a point, a line is symbolized
using shapes that have a color, width and style (e.g. solid, dashed, dotted, etc…). Roads and
rivers are commonly stored as polylines in a GIS.
2.1.1.3 Polygon
2 of 8 11/14/2022, 5:59 AM
Chapter 2 Feature Representation | Intro to GIS and Spatial Analysis https://mgimond.github.io/Spatial/chp02_0.html#vector-vs.-raster
Figure 2.4: A simple polygon object defined by an area enclosed by connected vertices.
A polygon is composed of three or more line segments whose starting and ending coordinate
pairs are the same. Sometimes you will see the words lattice or area used in lieu of ‘polygon’.
Polygons represent both length (i.e. the perimeter of the area) and area. They also embody
the idea of an inside and an outside; in fact, the area that a polygon encloses is explicitly
defined in a GIS environment. If it isn’t, then you are working with a polyline feature. If this
does not seem intuitive, think of three connected lines defining a triangle: they can represent
three connected road segments (thus polyline features), or they can represent the grassy strip
enclosed by the connected roads (in which case an ‘inside’ is implied thus defining a polygon).
2.1.2 Raster
Figure 2.5: A simple raster object defined by a 10x10 array of cells or pixels.
A raster data model uses an array of cells, or pixels, to represent real-world objects. Raster
datasets are commonly used for representing and managing imagery, surface temperatures,
digital elevation models, and numerous other entities.
A raster can be thought of as a special case of an area object where the area is divided into a
regular grid of cells. But a regularly spaced array of marked points may be a better analogy
since rasters are stored as an array of values where each cell is defined by a single coordinate
pair inside of most GIS environments.
3 of 8 11/14/2022, 5:59 AM
Chapter 2 Feature Representation | Intro to GIS and Spatial Analysis https://mgimond.github.io/Spatial/chp02_0.html#vector-vs.-raster
Implicit in a raster data model is a value associated with each cell or pixel. This is in contrast to
a vector model that may or may not have a value associated with the geometric primitive.
Also note that a raster data structure is square or rectangular. So, if the features in a raster do
not cover the full square or rectangular extent, their pixel values will be set to no data values
(e.g. NULL or NoData ).
The traditional vector/raster perspective of our world is one that has been driven by software
and data storage environments. But this perspective is not particularly helpful if one is
interested in analyzing the pattern. In fact, it can mask some important properties of the entity
being studied. An object vs. field view of the world proves to be more insightful even though it
may seem more abstract.
An object view of the world treats entities as discrete objects; they need not occur at every
location within a study area. Point locations of cities would be an example of an object. So
would be polygonal representations of urban areas which may be non-contiguous.
A field view of the world treats entities as a scalar field. This is a mathematical concept in
which a scalar is a quantity having a magnitude. It is measurable at every location within the
study region. Two popular examples of a scalar field are surface elevation and surface
temperature. Each represents a property that can be measured at any location.
Another example of a scalar field is the presence and absence of a building. This is a binary
scalar where a value of 0 is assigned to a location devoid of buildings and a value of 1 is
assigned to locations having one or more buildings. A field representation of buildings may not
seem intuitive, in fact, given the definition of an object view of the world in the last section, it
would seem only fitting to view buildings as objects. In fact, buildings can be viewed as both
field or objects. The context of the analysis is ultimately what will dictate which view to adopt. If
4 of 8 11/14/2022, 5:59 AM
Chapter 2 Feature Representation | Intro to GIS and Spatial Analysis https://mgimond.github.io/Spatial/chp02_0.html#vector-vs.-raster
we’re interested in studying the distribution of buildings over a study area, then an object view
of the features makes sense. If, on the other hand, we are interested in identifying all locations
where buildings don’t exist, then a binary field view of these entities would make sense.
2.3 Scale
How one chooses to represent a real-world entity will be in large part dictated by the scale of
the analysis. In a GIS, scale has a specific meaning: it’s the ratio of distance on the map to
that in the real world. So a large scale map implies a relatively large ratio and thus a small
extent. This is counter to the layperson’s interpretation of large scale which focuses on the
scope or extent of a study; so a large scale analysis would imply one that covers a large area.
The following two maps represent the same entity: the Boston region. At a small scale
(e.g. 1:10,000,000), Boston and other cities may be best represented as points. At a large
scale (e.g. 1:34,000), Boston may be best represented as a polygon. Note that at this large
scale, roads may also be represented as polygon features instead of polylines.
Figure 2.6: Map of the Boston area at a 1:10,000,000 scale. Note that in geography, this is
considered small scale whereas in layperson terms, this extent is often referred to as a large
scale (i.e. covering a large area).
Figure 2.7: Map of the Boston area at a 1:34,000 scale. Note that in geography, this is
considered large scale whereas in layperson terms, this extent is often referred to as a small
scale (i.e. covering a small area).
5 of 8 11/14/2022, 5:59 AM
Chapter 2 Feature Representation | Intro to GIS and Spatial Analysis https://mgimond.github.io/Spatial/chp02_0.html#vector-vs.-raster
Raster data can also have attributes only if pixels are represented using a small set of unique
integer values. Raster datasets that contain attribute tables typically have cell values that
represent or define a class, group, category, or membership. NOTE: not all GIS raster data
formats can store attribute information; in fact most raster datasets you will work with in this
course will not have attribute tables.
• Nominal data which have no implied order, size or quantitative information (e.g. paved
and unpaved roads)
• Ordinal data have an implied order (e.g. ranked scores), however, we cannot quantify the
difference since a linear scale is not implied.
• Interval data are numeric and have a linear scale, however they do not have a true zero
and can therefore not be used to measure relative magnitudes. For example, one cannot
say that 60°F is twice as warm as 30°F since when presented in degrees °C the
temperature values are 15.5°C and -1.1°C respectively (and 15.5 is clearly not twice as
big as -1.1).
• Ratio scale data are interval data with a true zero such as monetary value (e.g. $1, $20,
$100).
6 of 8 11/14/2022, 5:59 AM
Chapter 2 Feature Representation | Intro to GIS and Spatial Analysis https://mgimond.github.io/Spatial/chp02_0.html#vector-vs.-raster
Another way to categorize an attribute is by its data type. ArcGIS supports several data types
such as integer, float, double and text. Knowing your data type and measurement level
should dictate how they are stored in a GIS environment. The following table lists popular data
types available in most GIS applications.
While whole numbers can be stored as a float or double (i.e. we can store the number 2 as
2.0) doing so comes at a cost: an increase in storage space. This may not be a big deal if the
dataset is small, but if it consists of tens of thousands of records the increase in file size and
processing time may become an issue.
While storing an integer value as a float may not have dire consequences, the same cannot be
said of storing a float as an integer. For example, if your values consist of 0.2, 0.01, 0.34, 0.1
and 0.876, their integer counterpart would be 0, 0, 0, and 1 (i.e. values rounded to the nearest
whole number). This can have a significant impact on a map as shown in the following
example.
7 of 8 11/14/2022, 5:59 AM
Chapter 2 Feature Representation | Intro to GIS and Spatial Analysis https://mgimond.github.io/Spatial/chp02_0.html#vector-vs.-raster
8 of 8 11/14/2022, 5:59 AM