PGIS Unit 2 Crash Course Contents
PGIS Unit 2 Crash Course Contents
IT
Sem-6 Principles of Geographic Information Systems
Unit II:
Data Management and Processing Systems
Hardware and Software Trends
Geographic Information Systems: GIS Software, GIS Architecture and
functionality, Spatial Data Infrastructure (SDI)
Stages of Spatial Data handling: Spatial data handling and preparation,
Spatial Data Storage and maintenance, Spatial Query and Analysis, Spatial
Data Presentation.
Database management Systems: Reasons for using a DBMS, Alternatives
for data management, The relational data model, Querying the relational
database.
GIS and Spatial Databases: Linking GIS and DBMS, Spatial database
functionality.
GIS can be a data store (i.e. a system that stores spatial data), a toolbox, a
technology, an information source or a field of science. The main
characteristics of a GIS software package are its analytical functions that
provide means for deriving new geoinformation from existing spatial and
attribute data.
The use of tools for problem solving is one thing, but the production of
these tools is something quite different. Not all tools are equally well-
suited for a particular application, and they can be improved and perfected
to better serve a particular need or application. The discipline of
geographic information science is driven by the use of our GIS tools.
There is no GIS package which is necessarily ‘better’ than another one: this
depends on factors such as the intended application, and the expertise of
its user. ILWIS’s traditional strengths are in raster processing and scientific
spatial data analysis, especially in project-based GIS applications.
Intergraph, ESRI and MapInfo products have been known better for their
support of vector-based spatial data and their operations, user interface
and map production (a bit more typical of institutional GIS applications).
Standards exist for all facets of GIS, ranging from data capture to data
presentation. They are developed by different organizations, of which the
most prominent are the Inter-national Organization for Standardization
(ISO) and the Open Geospatial Consortium (OGC).
The functions for capturing data are closely related to the disciplines of
surveying engineering, photogrammetry, remote sensing, and the
processes of digitizing, i.e. the conversion of analogue data into digital
representations.
The data, once obtained in some digital format, may not be quite ready for
usei n the system. This may be because the format obtained from the
capturing process is not quite the format required for storage and further
use, which means that some type of data conversion is required.
The way that data is stored plays a central role in the processing and the
eventual understanding of that data. In most of the available systems,
spatial data is organized in layers by theme and/or scale.
Vector data types describe an object through its boundary, thus dividing
the space into parts that are occupied by the respective objects. The raster
approach subdivides space into (regular) cells, mostly as a square
tessellation of dimension two or three. These cells are called either cells or
pixels in 2D, and voxels in 3D. The data indicates for every cell which real
world feature it covers, in case it represents a discrete field. In case of a
continuous field, the cell holds a representative value for that field.
Raster encoding
This simple encoding scheme is known as row ordering. The header of the
raster file will typically inform how many rows and columns the raster has,
which encoding scheme is used, and what sort of values are stored for
each cell. Raster files can be quite big data sets. For computational reasons,
it is wise to organize the long list of cell values in such a way that spatially
nearby cells are also near to each other in the list.
Data maintenance
Updating spatial data stems from the requirements that the data users
impose, as well as the fact that many aspects of the real world change
continuously. These data updates can take different forms. It may be that a
complete, new survey has been carried out, from which an entirely new
data set is derived that will replace the current set.
The most distinguishing parts of a GIS are its functions for spatial analysis,
i.e. operators that use spatial data to derive new geoinformation. Spatial
queries and process models play an important role in this functionality.
One of the key uses of
GISs has been to support spatial decisions. Spatial decision support
systems(SDSS) are a category of information systems composed of a
database, GIS software, models, and a so-called knowledge engine which
allow users to deal specifically with locational problems.
In a GIS, data are usually grouped into layers (or themes). Usually, several
themes are part of a project. The analysis functions of a GIS use the spatial
and non-spatial attributes of the data in a spatial database to provide
answers
to user questions. GIS functions are used for maintenance of the data, and
for analyzing the data in order to infer information from it. Analysis of
spatial data
can be defined as computing new information that provides new insight
from the existing, stored spatial data.
Method Devices
Hard copy printer
plotter (pen plotter, ink-jet printer, thermal
transfer printer, electrostatic plotter)
film writer
Soft copy computer screen
Output of magnetic tape
digital data sets CD-ROM or DVD
the Internet
Q What is Database management systems? What are reasons of using
DBMS in GIS?
There are various reasons why one would want to use a DBMS for data
storage and processing.
A table or relation is itself a collection of tuples (or records). In fact, each table
is a collection of tuples that are similarly shaped
.An attribute’s domain is a (possibly infinite) set of atomic values such as the
set Of integer number values, the set of real number values,etc
When a relation is created, we need to indicate what type of tuples it will
store. This means that we must
The three query operators have some traits in common. First, all of them
require input and produce output, and both input and output are relations!
This guarantees that the output of one query (a relation) can be the input of
another query, and this gives us the possibility to build more and more
complex queries
The first operator is Tuple selection works like a filter: it allows tuples that
meet the selection condition to pass, and disallows tuples that do not meet
the condition
Our third query operator differs from the two above in that it requires two
input relations.
The join operator takes two input relations and produces one output
relation, gluing two tuples together (one from each input relation), to form
a bigger tuple, if they meet a specified condition.
SELECT
FROM TitleDeed, Parcel
WHERE TitleDeed.Plot = Parcel.PId
GIS software provides support for spatial data and thematic or attribute
data. GISs have traditionally stored spatial data and attribute data
separately. This required the GIS to provide a link between the spatial data
(represented with rasters or vectors), and their non-spatial attribute data.
The strength of GIS technology lies in its built-in ‘understanding’ of
geographic space and all functions that derive from this, for purposes such
as storage, analysis, and map production.
DBMSs offer much better table functionality, since they are specifically
designed for this purpose. A lot of the data in GIS applications is attribute
data, so it made sense to use a DBMS for it. For this reason, many GIS
applications have made use of external DBMSs for data support. In this
role, the DBMS serves as a centralized data repository for all users, while
each user runs her/his own GIS software that obtains its data from the
DBMS. This meant that a GIS had to link the spatial data represented with
raster’s or vectors, and the attribute data stored in an external DBMS.
Currently, GIS software packages are able to store spatial data using a
range of commercial and open source DBMSs such as Oracle, Informix, IBM
DB2, Sybase, and PostgreSQL, with the help of spatial extensions. Some GIS
software have integrated database ‘engines’, and therefore do not need
these extensions.
ESRI’s ArcGIS, for example, has the main components of the MS Access
data- Spatial DBMS base software built-in. This means that the designer of
a GIS application can
choose whether to store the application data in the GIS or in the DBMS
SELECT R.Name
FROM Restaurants AS R, Hotels as H
WHERE R.Type = “Thai” AND H.name = “Hilton” AND
ST Intersects(R.Geometry, ST Buffer(H.Geometry, 2000))
In this case the WHERE clause uses the ST Intersects function to perform a
spatial join between a 2000 m buffer of the selected hotel and the selected
subset of restaurants. The Geometry column carries the spatial data.