UNIT 3 RS & Gis Notes
UNIT 3 RS & Gis Notes
Basic data types such as integers and/or real numbers are extended into spatial data
Data are managed using two types of computer-based files, physical files and
logical files.
The different types of information required for a GIS require storage which
allows the information to be updated and queried for analysis by the user.
There are two types of information to be stored; spatial data and attribute
data.
Spatial data: Spatial data is usually stored as themes, layers, or coverage. The
georeferenced spatial data is displayed in a GIS in its proper place in relation to other
spatial data because of the georeference information attached to the data. An example
Prepared by , Dr. Kavita Singh, Associate
could be the latitude and longitude in decimal
Professor,Civil degrees
Engineering Department, 8
MRIT
The latitude and longitude of this school is embedded in the file so it will be
displayed in a GIS in its proper location in relation to other features in the city.
Themes can be anything that are currently located on a traditional paper map, such as
roads, rivers, cities, wells, forest boundaries, school district boundaries, etc., and they
are all georeferenced.
OVERLAYING
is a digital representation of
Graphical Maps
In order to perform these operations, the computer is able to store, locate, retrieve, analyze
and manipulate the raw data derived from a number of sources by using representational file
structures. In other words, each graphical identity must be stored explicitly, along with its
attributes, so that, we can retrieve and select the correct combinations of entities and
attributes in a reasonable time. GIS database comprise spatial or entity or graphical database,
nonspatial or attribute database, and a linkage mechanism for their topology, to show the
relationship between the spatial data and attribute data for further analysis.
An entity (either a point, or a line, or an area) has both spatial and attribute data to describe
it. Spatial data can be known as "where
Prepared bythings are"
, Dr. Kavita Singh,data and attribute data the 'what things
Associate
Professor,Civil Engineering Department, 22
are' MRIT
For example, a point entity, the Charminar, a monument in
Hyderabad, has the reference in terms of a latitude and
longitude, and to accompany this there would be an attribute
data about the nature of Remote Sensing and GIS the real world -
feature that the point represents. More clearly, in this example,
Entity type is the point Spatial data are longitude and latitude
Attribute data is the monument, Charminar. Nonspatial
(attribute) data can be stored in any conventional databases,
whereas spatial data, which is the dominant data in GIS, should
have the database' which is capable of handling spatial data.
data (in a database). Stern and Stern (1993) consider that a DBMS will allow users
to join, manipulate or otherwise access the data in any number of database files. A
DBMS must allow the definition of data and their attributes and relationships, as
well as providing security, and an interface between the end users and their
The first three tasks are most likely to be performed by the database professional,
while the fourth will be required by a variety of user types possessing a range of skills
and experience as well as variable needs or requirements in terms of frequency and
flexibility of access
between the high-level objects in the query language statement and the
physical location of the data on the storage device. These mappings are
made using the system- catalogue. Access to DBMS data is handled by the
stored data manager, which is called the operating system for control of
physical access to storage devices. The DBMS has a query complier which
may call the query optimizer to optimize the code, so that the performance
Entities are describable by means of their attributes (for example, the name,
boundary, and population of a district). Entities have explicit relationships with other
entities. Entities are grouped into entity types, where entities of the same type have
the same attribute and relationship structure. An ERA diagram representing the
structure of these data in the example database. Entity types are represented by
rectangles with offshoot attributes and connecting edges showing relationships.
Prepared by , Dr. Kavita Singh, Associate
Professor,Civil Engineering Department, 30
MRIT
Prepared by , Dr. Kavita Singh, Associate
Professor,Civil Engineering Department, 31
MRIT
GIS DATA MODELS
The construction of models of spatial form can be taken as a series of stages of
data abstraction. By applying this abstraction process the GIS designer moves from
the position of observing the geographical complexities of the real world to one of
simulating them in the computer.
i) Identifying the spatial features from the real world that are of interest in the
context of an application. (ii) Representing the conceptual model by an
appropriate spatial data model. This involves choosing between one of the two
approaches: raster or vector. (iii) Selecting an appropriate spatial data structure
to store the model within the computer. The spatial data structure is the physical
way in which entities are coded for the purpose of storage and manipulation.
In raster representation , the terrain is divided into a number of parcels or quantized the
space into units. A parcel or a unit is called a grid cell. Raster data structures do not
provide precise locational information because geographic space is now divided into
discrete grids, as much as we divide a checkerboard into uniform squares. Instead of
representing points with their absolute locations, they are represented as a single grid
cell.
In grid-based or raster GIS, there are two general ways of including attribute data for
each entity. The simplest is to assign a single number representing an attribute like a
class of land cover, for each grid cell location. By positioning these numbers, we,
ultimately, are allowing the position of the attribute value to act as the default location
for the entity. For example, if we assign a code number of 10 to represent water, then
list this as the first number in the X or column direction, and the first in the Y or row
direction, by default the upper left grid cell is the location of a portion of the earth
Prepared by , Dr. Kavita Singh, Associate
representing water Professor,Civil Engineering Department, 35
MRIT
Prepared by , Dr. Kavita Singh, Associate
Professor,Civil Engineering Department, 36
MRIT
Vector Data Representation
The second method of representing geographic space, called vector, allows us to give
specific spatial locations explicitly. In this method it is assumed that geographic space is
continuous , rather than being quantized as small discrete grids. This perspective is
interconnected lines whose first and last coordinate points are the same
Anything that has a single (X, Y) coordinate pair not physically connected to any other
store the shape of a spatial entity. In the vector world the point is the basic building
block from which all spatial entities are constructed. The simplest spatial entity,
the point, is represented by a single (x, y) coordinate pair. Line and area entities
The more complex the shape of a line or area feature, the greater the number of
construct an entity is one of the major problems in vector based GIS data
representation.
combine the entity data with associated attribute data kept in a separate file through a
In vector data structures, a line consists of two or more coordinate pairs, again storing
the attributes for that line in a separate file. This is explained in the next section under
vector models. For straight lines, two coordinate pairs are enough to show location
and orientation in space. More complex lines will require a number of line segments,
GRID Model
The first and foremost model for the representation of raster data is the GIRD
model. The method of storing, manipulating, and analyzing the grid based data
was first conceptualized by an attempt to develop GRID model.
In this method, each grid cell is referenced and addressed individually and is
associated with identically positioned grid cells in all other coverages, rather like a
vertical column of grid cells, each dealing with a separate theme. Comparisons
between coverage are therefore performed on a single column at a time.
Prepared by , Dr. Kavita Singh, Associate
Professor,Civil Engineering Department, 44
MRIT
ex. For example, to compare soil attributes in one coverage with vegetation attributes
in a second coverage, land use/land cover attributes in a third coverage, each X and Y
location must be examined individually So a soil grid cell at location must be examined
individually.
Like the raster spatial data model, there are many potential vector data models that can
Prepared by , Dr. Kavita Singh, Associate
Professor,Civil Engineering Department, 47
be used to store the geometric representation
MRITof entities in the computer.
A point is the simplest spatial entity that can be represented in the vector
world with topology. A point requires to be topologically correct with respect
to a geographical reference system which locates it with respect to other
spatial entities. To have topology a line entity must consist of an ordered set
of points a locus of number points, (known as an arc, segment, or chain)
with a defined start and end points (nodes). The simplest vector data
structure that can be used to reproduce a geographical image in the
computer is a file containing (x, y) coordinate pairs that represent the
location of individual point features.
Such a vector data structure for a car park near Hussain Sagar lake in
Hyderabad. Now, how a closed ring of coordinate pairs defines the
boundary of the polygon, is clear. The limitations of simple vector data
structures start emerging when more complex spatial entities are
considered.
There are two types of data model in GIS
Spaghetti Model
Topological Models
Prepared by , Dr. Kavita Singh, Associate
Professor,Civil Engineering Department, 48
MRIT
Spaghetti Model
The simplest vector data structure that can be used to reproduce a geographical image
in the computer is a file containing (x, y) coordinate pairs that represent the location of
individual point features.
Each graphic object can be represented with a piece of spaghetti. Each piece of
spaghetti acts as a single entity. The shortest spaghetti can be represented as a point,
collection of a number of point spaghettis for a line entity and collections of line
segments that come together at the beginning and ending of surrounding areas form an
area entity. Each entity is a single, logical record in the computer, coded as variable
length strings of (x, y) coordinate pairs.
Let us assume that two polygons lie adjacent to each other in a thematic coverage. These
two adjacent polygons must hake separate pieces of spaghetti for adjacent sides. That is,
no two adjacent polygons share the same string of spaghetti.
Each side of polygon is uniquely defined by its own set of lines and coordinate pairs. In
this model of representing vector data, all the spaghettis are recorded separately for
polygons. But in the .computer they should have the same coordinates.
This is done by explicitly recording adjacency information into the basic logical entity in
topological data structures, beginning and ending when it contacts or intersects another
line, or when there is a change in the direction of the line.
Each line then has two sets of numbers: a pair of coordinates. and an associated node
number. The node is the intersection of two or more lines, and its number is used to refer
to any line to which it is connected. In addition, each line segment, called a link, has its
own identification number that is used as a pointer to indicate the set of nodes that
represent its beginning and ending polygon. These links also have identification codes that
relate polygon numbers to see which two polygons are adjacent to each other along its
length. In fact, the left and right polygon are also stored explicitly, so that even this tedious
step is eliminated.
The topological models available in GIS ensure (a) that no node or line segment is
duplicated, (b) that line segments and nodes can be referenced to more than one
polygon, and (c) that all polygons can be adequately represented.
To understand the topological vector data structure, let us consider a network with 8
nodes encoded as n1 to n8. The links