Geo 321
Geo 321
KIBABII UNIVERSITY
FACULTY OF EDUCATION AND SOCIAL
SCIENCES(FESS)
GEO 321: GEOGRAPHIC INFORMATION SYSTEMS
Course Lecturer: Nelly Masayi
Purpose of the course:
This course provides students with skills in Remote Sensing and Geographical Information Systems
Course Content
Introduction to Maps and Cartography
Introduction to Arc GIS Desktop (Arc View, Arc Catalogue, Arc Toolbox)
Basic Components of GIS
Vector and Rastor Data
Adding Marginal information on maps Using Arc GIS 10.8
Data Analysis of Vector Data
a. Selection of Features in Arc map
b. Overlaying of Features
c. Clipping of a point feature
d. Clipping of a line feature
e. Clipping of an area image
f. Clipping of a raster image
g. Labelling of features
h. Overlaying of features
i. Buffering of Point Feature
j. Buffering of Area
k. Use of Symbology
CAT 1
Transport Analysis
a. Shortest Distance to a particular structure
Further Reading
1) Slocum, T. A., McMaster, R. B., Kessler, F. C. and Howard, H. H. (2009). Thematic
Cartography and Geovisualization. London: Prentice Hall.
2) Peterson, G. N. (2009). GIS cartography: a guide to effective map design. Boca
Raton, FL: CRC Press.
McMaster DN, (1978) Map reading for East Africa, Longman
Brewer, C. A. (2008). Designed Maps: A Sourcebook for GIS Users. Redlands, CA: Esri
Press
History of Cartography volume 1 & 2. Cartography in prehistoric, ancient, and medieval
Europe and the Mediterranean-Edited by J. B. Harley and David Woodward
Web. Links: en.wikipedia.org/.../History_of_cartography.www.maphistory.info/
academic.emporia.edu/.../h_map.htm
Geographic – 80% of government data collected is associated with some location in space
Information - attributes, or the characteristics (data), can be used to symbolize and provide further
insight into a given location
System – a seamless operation linking the information to the geography – which requires hardware,
networks, software, data, and operational procedures.
COMPONENTS OF GIS
GIS is an integration of five basic components
People-
People are user of Geographic Information System. They run the GIS software. Hardware
and software have seen tremendous development which made people easy to run t he GIS
software. These tasks may be creating simple map or performing advance GIS analysis.
The people are main component for the successful GIS. People develop procedures and define
the task of GIS.
Data
The most important and expensive component of the Geographic Information System is
Data which is generally known as fuel for GIS. GIS data is combination of graphic and
tabular data. GIS data can be stored as vector or raster. Both type of data can be created
in house using GIS software or can be purchased. The process of creating the GIS data from
the analogue data or paper format is called digitization. Digitization process involves
registering of raster image using few GCP (ground control point) or known
coordinates. This process is widely known as rubber sheeting or georeferencing. Polygon,
lines and points are created by digitizing raster image. Raster image itself can be registered
with coordinates which is widely known as rectifying the image. Availability of accurate data
affects the kind of results.
Hardware
Hardware is Computer on which GIS software runs. Nowadays there are a different range
of computer, it might be Desktop or server based. Arc GIS Server is server based computer
where GIS software runs on network computer or cloud based. For computer to perform well
all hardware component must have high capacity. Some of the hardware components are:
Motherboard, Hard driver, processor, graphics card, printer. These all component
function together to run a GIS software smoothly. Hardware capability affects processing
speeds, ease of use and type of output available. The parts of Hardware include
Motherboard: It is board where major hardware parts are installed or It is a place where all
components gets hooked up.
if the organization has an existing DBMS that is being used for to satisfy other business
requirements. Often it is desirable for the same DBMS to be utilized in the GIS applications.
3. Editing Data and Updating Data
The primary function in the data storage and retrieval subsystem involves the editing and updating
of data. Frequently, the following data editing capabilities are required:
interactive editing of spatial data;
the ability to add, manipulate, modify, and delete both spatial features and attributes
(independently or simultaneously) ; and the
Given a map, GIS is able to update the map by adding title, legend, scale and grid lines.
4. Spatial Analysis
Spatial analysis involves the interactions within a GIS to answer questions, support decisions, and
reveal patterns. Spatial analysis includes all of the transformations, manipulations, and methods that
can be applied to geographic data to turn them into useful information. Spatial analysis can range
from simple analysis to very sophisticated analysis. In the context of GIS, analysis is “Deriving new
information from existing data” It is also the manipulation of data to solve a problem e.g. identify
all areas within 500m of a lake. The data manipulation and analysis subsystem in GIS allows the
user to define and execute spatial and attribute procedures to generate derived information A GIS
must be able to answer questions regarding the interaction of spatial relationships between multiple
data sets. GIS carries out various analysis operations.
i. Simple Query
It includes the identification of objects and their attributes either by location or attribute query. They
are the most basic of analysis operations, in which the GIS is used to answer simple questions posed
by the user. No changes occur in the database and no new data are produced.
ii. Buffering involves the ability to create distance buffers around selected features, be it
points, lines, or areas. Buffers are created as polygons because they represent an area
around a feature. Buffering is typically used with point or linear features.
The generation of buffers for selected features is frequently based on a distance from that
feature, or on a specific attribute of that feature. Buffering is also referred to as corridor or zone
generation with the raster data model. Usually, the results of a buffering process are utilized in
a topological overlay with another data layer. For example, to determine the volume of timber
within a selected distance of a cutline, the user would first buffer the cutline data layer. They
would then overlay the resultant buffer data layer, a buffer polygon, with the forest cover data
layer in a clipping fashion. This would result in a new data layer that only contained the forest
cover within the buffer zone. Since all attributes are maintained in the topological overlay and
buffering processes, a map or report could then be generated. In simple terms, buffering involves
creation of an area of interest around an object. It is often used in proximity analysis and
environmental impact assessment. Buffering can be done both on a line image, point or area. It
can also be done on a raster image.
iii. Measuring
Measurements are simple numerical values that describe aspects of geographic data. They include
measurement of simple properties of objects, such as length, area, or shape, and of the relationships
between pairs of objects, such as distance or direction.
Measuring distance is a very important component of GIS. In geography, we measure the length of
a section of a River or Road. You can measure distance on maps using the Measure Distance
tool which is located in the expanded toolbox. It is important to make tool active, by clicking on
the desired start location. The latitude and longitude coordinates of your starting point are
automatically registered in the Information Panel. By adding another node you complete one line,
and the position of the second node and the distance between the two nodes is given.
On the Data toolbar, click the Measure tool . When you click the Measure tool,
the Measure window appears. This dialog box allows you to set different options for how you
measure, including whether to measure lines, areas, or features; use snapping; and set which units
are reported. Measurements are displayed inside the window, so it is easy to copy and paste them
into other applications. The Measure window contains tools for measuring distance and features. By
default, the distance (line) measurement tool is enabled until you choose a different option from the
Measure window. The measure window also gives one an opportunity to measure area and
perimeter.
iv. Clipping
In GIS, to clip is to overlay a polygon on one or more target features (layers) and extract from
the target feature (or features) only the target feature data that lies within the area outlined by
the clip polygon. In other words, the boundaries of the second polygon are imposed on the first
polygon.
Land Use of Eastern Province clipped from Kenyan Land Use(clipping of vector image)
Fig: Visualization by use of different colours and by use of graduated colours(same colour, different
shade)
iv. Overlaying: Superimposing one or more map so that you end up having a new map.
Overlaying involves superimposing two or more map layers to produce a new map layer.
Example: a new genetically engineered variety of wheat grows well in dry environments, with
long growing seasons and alkaline soils. Given the availability of data on the length of the
growing season, moisture regime and soil alkalinity, where is the best place to plant the wheat?.
overlaying (superimposing) several maps showing (separately) water-budget, growing season
length, soil pH, sodium content, and so on. The GIS analysis can establish the locations where
all the favorable soil conditions coincide, as the places where the wheat will grow best
5. Storing data `
Data storage and retrieval subsystem organizes the data, spatial and attribute, in a form which
permits it to be quickly retrieved by the user for analysis, and permits rapid and accurate updates to
be made to the database. This component usually involves use of a database management system
(DBMS) for maintaining attribute data. Spatial data is usually encoded and maintained in a
proprietary file format. In contemporary geographic information systems, geospatial data is stored
as digital data.
Digital data is stored on a variety of physical media, depending on how quickly the data needs to
be accessed, how much data needs to be stored, and whether the data needs to continue to exist when
the digital device is turned off or rebooted.
• Random access memory (RAM) is made with silicon transistors to quickly store and access
data that is being actively used. RAM is fast but more expensive than other forms of memory,
and the data is lost when the device is turned off or rebooted
• Magnetic hard disks are spinning platters coated with magnetic material that stores data in
magnetic patterns on the disk. Hard drives can store very large amounts of data (in the
terabytes), but this data takes longer to access than RAM. Hard drives are a reliable,
established technology. Data on a hard drive remains even after the hard drive is powered
down, but hard drives do not last forever and will eventually fail, often taking their data with
them
• Flash memory is made with transistors like RAM, but built with a special structure (floating-
gate MOSFET) that allows the data to persist even if the power is turned off. Flash memory
has become ubiquitous in consumer devices (SD cards, thumb drives, smartphones, etc)
because it has high capacity and has become inexpensive over the past decade. Flash memory
is slowly replacing magnetic hard disks with solid-state drives that are faster and use less
power. However, flash memory is limited in the number of times it can be written to, so
solid-state drives do not last as long as magnetic hard drives and are prone to unexpected
failures
• Optical disks as compact disks (CDs) and digital versatile disks (DVDs) store bits as
indentations in aluminum or chemical films that are then encased in plastic disks. Optical
disks have high capacity and are inexpensive to manufacture in bulk. However, they are
generally used only for data that will not change for extended periods of time, and they are
commonly used to archive and backup data from magnetic and flash drives. It is
uncertain how long data on a CD or DVD can be expected to last, and optical disks are rapidly
becoming obsolete
• Magnetic floppy disks store data in a similar manner to magnetic hard disks, except on a
removable plastic disk nestled in a protective case. You may occasionally encounter old data
stored on floppy disks, although this technology is obsolete and unreliable. You should
migrate any important data off these disks and onto a hard drive as soon as possible so the
data is not lost to physical degradation
• Magnetic tape is a roll of plastic film coated with a magnetic material and used to store bits
in a similar way as magnetic hard drives. Although tape is one of the oldest technologies for
storing digital data, tape drives are still used to back up hard drives for long-term storage
6. Displaying data
A gis must be able to have tools for visualizing geographic features using a variety of symbology.
7. Output
GIS must be able to display data/results in a variety of formats, such as maps reports, and graphs.
The data output subsystem allows the user to generate graphic displays, normally maps, and tabular
reports representing derived information products. Maps, Charts and Reports are part of the output.
DATA STRUCTURE
GIS data can be separated into two categories: spatially referenced data which is represented by
vector and raster forms (including imagery) and attribute tables which is represented in tabular
format. Within the spatial referenced data group, the GIS data can be further classified into two
different types: vector and raster. Most GIS software applications mainly focus on the usage and
Point data is most commonly used to represent nonadjacent features and to represent discrete data
points. Points have zero dimensions; therefore you can measure neither length nor area with this
dataset. Examples would be schools, points of interest, and in the example below, bridge and culvert
locations. Point features are also used to represent abstract points. For instance, point locations could
represent city locations or place name
Line (or arc) data is used to represent linear features. Common examples would be rivers, trails, and
streets. Line features only have one dimension and therefore can only be used to measure
length. Line features have a starting and ending point. Common examples would be road
centerlines and hydrology. Symbology most commonly used to distinguish arc features from one
another are line types (solid lines versus dashed lines) and combinations using colors and line
thicknesses. In the example below roads are distinguished from the stream network by designating
the roads as a solid black line and the hydrology a dashed blue line.
Polygons are used to represent areas such as the boundary of a city (on a large scale map), lake, or
forest. Polygon features are two dimensional and therefore can be used to measure the area and
perimeter of a geographic feature. Polygon features are most commonly distinguished using either a
thematic mapping symbology (color schemes), patterns, or in the case of numeric gradation, a color
gradation scheme could be used.
Both line and point feature data represent polygon data at a much smaller scale. They help reduce
clutter by simplifying data locations. As the features are zoomed in, the point location of a school is
more realistically represented by a series of building footprints showing the physical location of the
campus. Line features of a street centerline file only represent the physical location of the street.
An example of a thematic raster dataset is called a Digital Elevation Model (DEM). Each cell
contains one value representing the dominate value of that cell. Raster datasets are intrinsic to most
spatial analysis. Data analysis such as extracting slope and aspect from Digital Elevation Models
occurs with raster datasets. Spatial hydrology modeling such as extracting watersheds and flow lines
also uses a raster-based system. Spectral data presents aerial or satellite imagery which is then often
used to derive vegetation geologic information by classifying the spectral signatures of each type of
feature.
Raster clip
Cuts out a portion of a raster dataset. An existing raster or vector layer can be used as the clip extent.
Composite Band 5, 6, 4
ELEVATION SURFACES
Elevation surfaces define height values across the extent of a map or scene. For example, when you
create features, you can specify the x,y location as you edit, but the z-value of the feature can be
derived from an elevation surface.
Spatial analysis of elevation surfaces may include
a. Digital Elevation Model
A digital elevation model (DEM) is a 3D computer graphics representation of elevation data to
represent terrain, commonly of a planet, moon, or asteroid. DEM is an important tool for topographic
parameterization especially for erosion, drainage analysis, hill slope hydrology among many other
studies.
Contours
Contour lines (or isohypses) are isolines showing equal elevation. Contour lines connect points of
equal elevation, and are therefore isogonic lines. This is the most common way of numerically
showing elevation, and is familiar from topographic maps. The vertical distance (elevation
difference) between contour lines is called the contour interval. The choice of a contour interval
depends on the map scale, the degree of relief, and the needs of the user for whom the map is
prepared. Contour lines are defined as lines linking points on a map which are the same height above
a certain fixed level. This fixed level is often sea level. However sea level is variable, so mapping
authorities usually define a fixed level called a datum. Contour lines are abstract; they do not exist
in reality. However a large amount of relevant information can be derived from them. This includes:
• the locations of high points; and
• the shape and slope of the ground
• the position and nature of ridge lines and valleys
Characteristics of Contours
All contours have the same characteristics. These characteristics include.
• As with all types of isolines, the distance between adjacent contour lines indicates the
steepness of the slope. When contour lines are close together, they represent a steep slope,
whereas when lines are far from each other, they represent a gradual slope. On concave
surfaces, contours are closer near the top. On convex surfaces, contours are closer near the
bottom
• Every fifth or tenth contour starting at sea level is an index contour, which is drawn as a
heavier line and is labelled.
• As contour lines represent points of equal elevations on the ground, they cannot merge or
cross each other on a map, except on vertical surfaces such as cliffs, caves or walls. However,
these contours are seldom mapped.
• As contour lines represent level lines, they are perpendicular to the lines of steepest slope.
• Contours crossing flat man-made surfaces such as roads and sidewalks, or large rivers should
be perpendicular to the edges of this surface. This is the most violated principal of contouring
in computer generated maps, and is caused by lack of sufficient field data to accurately
contour the features. If the scale of the map is sufficiently large, then the crown of the road
surface should also appear as a slight curvature in the contour line when it crosses the road
at angles that are perpendicular to the edges.
• Contours are consistent. For instance, if the contour interval is 2-m, then successive contours
must be consecutive multiples of two. As an example, successive contours of 122, 124, 128,
130 can never occur since the interval of 126 was skipped.
• A single contour line cannot lie between two
surfaces of higher or lower elevation. This
would indicate a knife-edge in nature. Due to
erosion, this type of surface does not naturally
exist.
Hill shade
Hillshading is a technique used to create a realistic view of terrain by creating a three-
dimensional surface from a two-dimensional display of it. Used as a background, hillshades provide
a relief over which you can draw raster data or vector data. In Arc gis, hill shades can be generated
from digital elevation models.
REMOTE SENSING
In order to study large areas of the Earth’s surface geographers use devices known as remote sensors.
These sensors are mounted on platforms such as helicopters, planes, and satellites that make it
possible for the sensors to observe the Earth from above.
Types of Sensor
A passive sensor system needs an external energy source. In most cases this source is the
sun. These sensors generally detect reflected and emitted energy wave lengths from a
phenomenon. The source of the object illumination is independent of the sensor and it is
a natural source). Passive systems are much more common than active systems Examples
of passive remote sensors include film photography, infrared, charge-coupled devices,
and radiometers.
The sun provides a very convenient source of energy for remote sensing. The sun's energy
is either reflected, as it is for visible wavelengths, or absorbed and then re-emitted, as it
is for thermal infrared wavelengths. Passive sensors can only be used to detect energy
when the naturally occurring energy is available. For all reflected energy, this can only
take place during the time when the sun is illuminating the Earth. There is no reflected
energy available from the sun at night. Energy that is naturally emitted (such as thermal
infrared) can be detected day or night, as long as the amount of energy is large enough to
be recorded. Passive sensors detect sunlight radiation reflected from the earth and thermal
radiation in the visible and infrared of the electromagnetic spectrum. Most passive
sensors make use of a scanner for imaging, e.g. LANDSAT. Equiped with spectrometers
they measure signals at several spectral bands simultaneously, resulting in so-called
multispectral images which allow numerous interpretations.
➢ Energy Source or Illumination (A) - the first requirement for remote sensing is to have an
energy source which illuminates or provides electromagnetic energy to the target of interest.
➢ Radiation and the Atmosphere (B) - as the energy travels from its source to the target, it will
come in contact with and interact with the atmosphere it passes through. This interaction may
take place a second time as the energy travels from the target to the sensor.
➢ Interaction with the Target (C) - once the energy makes its way to the target through the
atmosphere, it interacts with the target depending on the properties of both the target and the
radiation.
➢ Recording of Energy by the Sensor (D) - after the energy has been scattered by, or emitted
from the target, we require a sensor (remote - not in contact with the target) to collect and
record the electromagnetic radiation.
➢ Transmission, Reception, and Processing (E) - the energy recorded by the sensor has to be
transmitted, often in electronic form, to a receiving and processing station where the data are
processed into an image (hardcopy and/or digital).
➢ Interpretation and Analysis (F) - the processed image is interpreted, visually and/or digitally
or electronically, to extract information about the target which was illuminated.
➢ Application (G) -the final element of the remote sensing process is achieved when we apply
the information we have been able to extract from the imagery about the target in order to
better understand it, reveal some new information, or assist in solving a particular problem.
Active sensors, on the other hand, provide their own energy source for illumination. The
sensor emits radiation which is directed toward the target to be investigated. The sensor
can be an active system (where the satellite or the aircraft provides the source of
illumination, this technique is used when no suitable natural source of radiation exists.
The radiation reflected or backscattered from that target is detected and measured by the
sensor. Advantages for active sensors include the ability to obtain measurements anytime,
regardless of the time of day or season. Active sensors can be used for examining
wavelengths that are not sufficiently provided by the sun, such as microwaves, or to better
control the way a target is illuminated. However, active systems require the generation
of a fairly large amount of energy to adequately illuminate targets. Some examples of
active sensors are a laser fluorosensor and a synthetic aperture radar (SAR).
Most sensors record information about the Earth’s surface by measuring the transmission
of energy from the surface in different portions of the electromagnetic (EM) spectrum.
Because the Earth’s surface varies in nature, the transmitted energy also varies. This
variation in energy allows images of the surface to be created. Human eyes see this
variation in energy in the visible portion of the EM spectrum. Sensors detect variations
in energy in both the visible and non-visible areas of the spectrum.
Electromagnetic spectrum
Traditional aerial
photographs were black and white pictures based on camera and film technology.
Such photographs related to one region of the EM spectrum. Satellite images are
generally captured using sensors and digital technology. A sensor often records
simultaneously in several different regions of the spectrum creating multi-images
taken at the same time. The regions of the spectrum scanned are called “bands.
A single band image shows features in various grey tones but if several images are
combined, they can form a color composite. A true color composite is based on using
the red, green, and blue portions of the visible region of the EM spectrum. This type
of composite relates to what the human eye would see if a person was on a satellite
or aircraft looking down at the Earth.
Other band combinations form false color composites. A false color composite
generally enhances certain features on an image, features that might not be as
apparent on a true color composite. Using different color composites is one way that
a remote sensing specialist detects features on the Earth.
Remote sensing can be used for applications in several different areas, including:
a. Geology and Mineral exploration
b. Hazard assessment
c. Oceanography
d. Agriculture and forestry
e. Land degradation
f. Environmental monitoring
g. Remote Sensing Emerges as an Important Tool for Habitat and Species Conservation.
1. Shape
It refers to the general configuration of the outlines. e.g., road has linear shape, building,
polygon and trees as random shape
2. Size- Space occupied by the object in the image. It helps distinguish objects by measuring
them. We can have the smallest
3. Shadow- Sense inclination angle on the object while recapturing image. It helps interpret
manmade features. Helps interpret shape, i.e. valleys hills and terrains
4. Color- It appears in multispectral image. Use of human image makes it easy to interpret
5. Tone- Change in tones of one colour. Short plants light, trees dark colour. Dry land, light
colour while wet land have dark colour
6. Texture- Repetition of intensity of colour in the image
7. Pattern- Repetance and arrangement of objects. Arranged patterns may shows human made
features
Unsupervised classification
It is where the outcomes (groupings of pixels with common characteristics) are based on the software
analysis of an image without the user providing sample classes. The computer uses techniques to
determine which pixels are related and groups them into classes. The user can specify which
algorism the software will use and the desired number of output classes but otherwise does not aid
in the classification process. However, the user must have knowledge of the area being classified
when the groupings of pixels with common characteristics produced by the computer have to be
related to actual features on the ground (such as wetlands, developed areas, coniferous forests, etc.).
Supervised classification
It is based on the idea that a user can select sample pixels in an image that are representative of
specific classes and then direct the image processing software to use these training sites as references
for the classification of all other pixels in the image. Training sites (also known as testing sets or
input classes) are selected based on the knowledge of the user. The user also sets the bounds for
how similar other pixels must be to group them together. These bounds are often set based on the
spectral characteristics of the training area, plus or minus a certain increment (often based on
“brightness” or strength of reflection in specific spectral bands). The user also designates the number
of classes that the image is classified into. Many analysts use a combination of supervised and
unsupervised classification processes to develop final output analysis and classified maps.