Geography Project
Geography Project
SOURCES OF DATA
(A) Primary Data
Primary data is that type of data which provide first- hand evidence or data about an
organization, event or any person at work. People collect the data by going into a field and data
are collected for the very first time by individuals groups of individuals, institutions or
organization.
Sources of Primary Data:-
1. Personal Observation : - Personal observation refers to that type of observation through
which there is a direct observation in the field and is collected by individual and group of
individuals. Through a field survey, information about the relief feature, drainage patterns,
types of soil and natural vegetable, as well as population structure, sex, ratio, literacy rate,
means of transport and communication, urban and rural settlements etc., is collected.
2. Interview : Interview is another way of collecting primary data. In this step, there is a presence
of an interviewer and the interviewer asks the questions from an interviewee, it is generally in
oral terms and face – to – face conversation with the person.
3. Questionnaire: In this method, simple question and their possible answer are written on a
plain paper and the respondents have to use tick –mark the possible answers from the given
choices. The objectives of the survey should be clearly mentioned in the questionnaire,
Sometimes, a set of structured question are written and sufficient space is provide in the
questionnaire where the respondents can write their opinions.
4. Schedule: Similar to the questionnaire that contains the questions pertaining to the matter of
investigation is known as the schedule.
5. Other Methods: Apart from the above methods of collecting primary data there are some
other methods through which we can collect the data. For example, the data about the
properties of soil and water are collected directly in the field by measuring their characteristics
using soil and water quality kit; similarly, field scientists collect data about the health of the
crops and vegetation using different kinds of sensors.
(B) Secondary Data
Data collected from any published or unpublished sources are called Secondary data. In
secondary data, people do not go directly into the field for collecting the information rather
than they are dependent on primary data which is already been published or unpublished
Example – Government publications (Census of India) books, journals, etc.
Secondary Source of Data : -
PUBLISHED SOURCES
1. Government publications
The publications of the various ministers and the departments of the government of India, state
governments and the District are one of the most important sources of secondary, information.
These include the census of India published by the office of the register General of India.
Reports of the National sample survey, weather reports of Indian meteorological Department,
and statically abstracts published by state governments and the periodical reports published by
different commission.
2. Semi /Quasi – governments publications
The publications and reports of urban development authorities and municipal corporations of
various cities and towns, Jilaparishad (District councils), etc. fall under this category.
3. International publications
The international publications comprise year books reports and monographs published by
different agencies of the United Nations.
4. Newspapers and magazines.
The daily newspapers and the weekly. fortnightly and monthly magazines serve as easily
accessible source of secondary data.
5. Electronic media
The electronic media especially internet has emerged as a major source of secondary data in
recent times.
6. Private Publications :
Private publications include the year books, surveys, research reports and monographs
published by newspapers and private organization that fall under this category.
UNPUBLISHED SOURCES
1. Government Documents
The unpublished reports, monographs and documents are yet another source of secondary data
for example, the village level revenue records maintained by the patwaries of respective
villages serves as an important source of village-level information.
2. Quasi – Government Records
The periodical reports and the development plans prepared and maintained by different
municipal corporations, District councils, and civil services of village departments are included
in Quasi – Government records.
Tabulation and Classification of Data
The data collected from primary or secondary sources initially appear as a big dumbly of
information with the least of comprehension. This is known as raw data, to draw meaningful
inferences and to make them usable the raw data require tabulation and classification.
One of the simplest devices to summaries and present the data in the statistical table. It is a
systematic arrangement of data in columns and rows.
The purpose of table is to simply the presentation and to facilitate comparisons. This table
enables the reader to locate the purpose of table is to simplify the presentation and to facility
comparisons. This table enables the reader to locate the desired information quickly. The table make
it possible for the analyst to present a huge mess of data in an orderly member within a minimum of
space.
(ii) Percentage/Ratio: -
Some time date are tabulated in a ratio or percentage form that are computed form a common
parameters such as literacy rate or growth rate of population.
15-19 3 3
20-24 3 6
25-29 1 7
30-34 2 9
35-39 3 12
40-44 3 15
45-49 2 17
50-54 2 19
55-59 3 22
60-64 8 30
𝒇 = 𝟑𝟎
FREQUENCY POLYGON
Frequency polygon shows/represents frequency distribution through graph and is capable of showing
two or more than two frequency distribution.
Ogive
Ogives are graphs that are used to estimate how many numbers lie below or above a particular variable of
value in data. To construct an Ogive, firstly, the cumulative frequency of the variables is calculated using a
frequency table. It is done by adding the frequencies of all the previous variables in the given data set. The
result or the last number in the cumulative frequency table is always equal to the total frequencies of the
variables.
Ogive can be constructed by either of the following two methods, Less than & More than ogive.
Less than ogive: The frequencies of all preceding classes are added to the frequency of a class. This series is called
the less than cumulative series. It is constructed by adding the first – class frequency to the second- class frequency
and then to the third class frequency and so on. On plotting these frequencies on a graph paper we get a rising
curve shown as Less than Ogive.
Class Interval (Less than Ogive) Cumulative Frequency (cf)
Less than 20 3
Less than 25 6
Less than 30 7
Less than 35 9
Less than 40 12
Less than 45 15
Less than 50 17
Less than 55 19
Less than 60 22
Less than 65 30
More than ogive : The frequencies of the succeeding classes are added to the frequency of a class. This
series is called the more than or greater than cumulative series. It is constructed by subtracting the first
class, second class frequency from the total, third class frequency from that so on. The upward cumulating
result is greater than or more than the cumulative series. On plotting these frequencies on the graph paper
we get a declining curve which is known as ‘ More than Ogive’
Class Interval (Less than Ogive) Cumulative Frequency (cf)
More than 15 30
More than 20 27
More than 25 24
More than 30 23
More than 35 21
More than 40 18
More than 45 15
More than 50 13
More than 55 11
More than 60 8
Combined ‘Less than ‘ and ‘More than’ Ogives
When ‘less than’ and ‘more than’ Ogives are combined together we get combined ‘less than’ and ‘more
than’ ogive. This is helpful in determine the median of a series.
Class Interval Less Than More Than
15-20 3 30
20-25 6 27
25-30 7 24
30-35 9 23
35-40 12 21
40-45 15 18
45-50 17 15
50-55 19 13
55-60 22 11
60-65 30 8
PRACTICAL 2
Data Processing
Measures of Central Tendency
The statistical techniques used to find out the contra of distributions are referred as measures of central
tendency. The number denoting the central tendency is the representative figure for the entire date set
because it is the point about which items have a tendency to cluster. Measures of Central Tendency are
also known as averages. These are classified as mean, median and the mode.
Mean
The mean is the value desired by summing all the values and dividing it by the value number of observation. It is
the simple arithmetic average of the different values of a variable. Mean can be calculated by direct or indirect methods,
both for grouped and ungrouped data.
Direct Method - The following is to used compute the mean using direct method :
∑𝒙
𝒙=
𝑵
where,
𝒙 = 𝑴𝒆𝒂𝒏
𝑵 = 𝑵𝒖𝒎𝒃𝒆𝒓 𝒐𝒇 𝑴𝒆𝒂𝒔𝒖𝒓𝒆𝒔
Indirect Method
The following formula is used in computing the mean using indirect
∑𝒅
𝒙−𝑨+
𝑵
Where,
A = Subtracted constant
Where
𝒙 = Mean
f= frequencies
x = midpoints of class intervals
N= number of observations (it may also be defined as ∑ 𝒇)
Indirect method
The following formula can be used for the indirect method for grouped data.
The principles of this formula are similar to that of the indirect method given for ungrouped data. It
is expressed as under.
∑ 𝒇𝒅
𝒙=A
𝒏
Where,
A = midpoint
f = frequency
d = deviation from the assumed mean group (A)
N = sum of cases of ∑ 𝒇
i = Interval width
Calculation of Mean Rainfall
∑
X=A +
X= 900+
= 900+ 166.85
X= 1066.85mm.
Computing Mean from Grouped Data
The mean can also be computed for the grouped data using either direct or indirect method. Direct
method: We can calculate mean by direct method using the formula given below.
∑
X=
Where
→ =Mean
f= Frequencies
x= Mid-points of class intervals
N= Number of observations (it may be defined as ∑ 𝑓)
Note : To find out mid- points, add the values of classes and divide it by two.
Question 2 : Compute the average wage rate of mines workers using data given in table below.
Wage Rate 150-200 200-250 250-300 300-350 300-350 400-450
(Rs/Day)
Number of 10 20 30 35 40 10
Workers
Computation of Mean
Classes Frequency(f) Mid- Fx d=x-a(x- Fd U=(x-a) fu
points (x) 325) /h
Where N= ∑ f = 145
A= assumed average
H= difference in class interval
Table 2.2 provides the procedure for calculating the mean for ground data both using direct and
indirect methods.
(i) Formula for Grouped Data (Direct Method )
∑ 𝑓𝑥 45.125
→= = 311.21
𝑁 145
(ii) Formula for Grouped Data (Indirect Method )
∑ 𝑓𝑑 2000
→= = 325 −
𝑁 145
= 325 − 13.79 = 311.21
Median
Median is a positional average. It may be defined “as the point in a distribution with an equal
number of cases or each side of it: The median is expressed using symbol M.
Computing Median for ungrouped data
When the Score are ungrouped. These are arranged or ascending or descending order . Media
can be formed by berating the central observation or value in the arranged series the central value
may be located from either end of the series arranged in ascending or descending order the following
equation is used to compute the median.
Example : Calculate median height of mountains packs in parts of the following Himalayas using the
following
8, 126 m, 8, 611m, 181,m, 8,172,m 8,076m, 8,848, 8,598 m
Computation : Median (M) May be calculated in the following steps :
(i) Arrange the given data in ascending or descending order.
(ii) Apply the formula for locating the central value in the series
Thus Value of ( ) th Piexn
=( ) th itam
=( ) th item
4 th item in the arranged series will be the median.
Arrangement of data in ascending order :
7,817, 8,076, 8,126, 8, 172 , 8,598, 8,611, 8,84
4thitem
Hence , M- 8, 172m
Computing Median for grouped Data
When the scores are ground we have to find the value of the point where an individual or
observation is centrally located. In the group.
It can be computed using the following formula
𝑀 =1+ ( − 𝐶) / 𝑀 =4+ 𝑥
Where
M= Median for grouped data
L= Lower limit of the median class
I = Interval
F= frequency of the median class.
N= Total number of frequencies or number of observation.
C= Cumulative frequency of the pre- median Class .
Example : Calculate the median for the following distribution :
Class 50-60 60-70 70-80 80-90 90-100 100-110
f 3 1 11 16 8 5
Computation of Mediam
Class Frequency Cumdative frequency (f) Calculation of mediam class
50-60 3 3
60-70 7 10
70-80 11 21c
80-90 16 f 37 𝑁
(Median group) 𝑀=
2
90-100 8 45 50
=
2
𝑓 𝑜𝑟 =25
𝑁 = 50
The median is computed from the following steps : -
(i) Cumulative frequencies are obtained first by adding each normal frequency of the following groups.
(ii) ∑ 𝑓 𝑜𝑟 total frequencies are calculated ie N and then ½ is obtained.
(iii) Here N- 50 do N/2 =25 which is more than 21 and less than 37 in the column of cumulative frequency.
(iv ) So, (80-90 ) is the median class, where c.f. 3 f lies.
The frequency of the medium class, where c.f. 3 Lies.
(V) And the cumulative frequency of the free medium class is 21 ( fc).
Then ,
(Medium ) M=4+ 𝑋 1
25 − 21
= 80 + 𝑋 10
16
𝑀
= 80 + 𝑋 10
164
10
= 80 +
4
= 80 + 2.5
= 82.5
Mode
The value that occurs most frequently in a distribution is referred to as mode. It is symbolized
as z or mo. Mode is a measure that is less widely compared to mean and medium there can be more
that one type.
Computing mode for ungrouped Data:-
While computing mode from the given data sets all measures are first arranged in ascending or
descending order. It helps in identifying the most frequently occurring measure easily.
Example: Calculate mode for the following test score in geography for the students.
61, 10, 88, 31, 61, 72, 55, 67, 46, 22
Computation: To Find the mode the measures are arranged in as ascending given below :-
10, 22, 37, 46, 55, 61, 61, 61, 72, 88,
The measures 61 occurring three times in the series is the mode in the given dataset. As no other
number is in the similar way in the data it , it processes the property of being unimodal.
Example 2 = Calculate the mode using a different sample of ten other students , who soured :
82, 11, 57, 82, 08, 11, 82, 95, 41, 11,
Computation: Arrange the given measures in an ascending order as shown below : =
08, 11, 11, 11, 41, 57, 82, 82, 82, 95.
It can easily be observed that measures of 11 and 82 both are occurring these times in the
distribution. The data set, therefore, is bimodal in appearance.
If there values have equal and highest frequency the series is trimmed similarly, a recurrences of
many measures in a series makes it multimodal. However, when there is no measure being repeated
in a series it is design mated as without mode.
Merits of Mode
1. Simple and popular
2. Less effect of marginal values
3. Graphic presentation
4. Best representative
Demerits of Mode
Following are the various demerits of mode.
1. Uncertain and unclear: Mode is an uncertain and unclear measure of central tendency.
2. Not capable of algebraic treatment: Unlike mean, mode is not capable of further algebraic
treatment.
3. Difficult: With frequencies of all items being identical, it is difficult to identify by modal value .
Comparison Between Mean, Median and mode
Mean Median Mode
Mean is the average value that Median is the central value of Mode is the most repetitive
is equal to the ratio of sum of given set of values when value of a given set of value.
values in a data set and total arranged in an order whether
number of values. ascending or descending
Mean =
For example, if we have set of values = 2, 2, 3, 4, 5 then
Mean = ( 2+2+3+4+5)/5=3.2 Median = 3 Mode = 2
Prominence
Global
Peaks Other names and meaning Region
Rank
(metres)
1. Selection of a Suitable Method: Data reflect a variety of topics such as temperature, rainfall, population
growth and dispersion, production and distribution, among others, A suitable graphical method must be used to
display the data’s characteristic Given the variety of ways available, we must select the most appropriate graphical
method. For example, if we want to show the increase of any element, we should use a line graph. When displaying
different components of a single element, a pie chart is the best option. The population density will be depicted by
using a choropleth map.
2. Selection of Suitable Scale : For data representation on diagrams and maps, the scale is utilized as a measure of
the data. As a result, selecting a suitable scale for the provided data sets should be done with care, taking into
account the complete data set to be represented. The scale must not be too big or too small.
3. Design: Graphical presentation of the data must be attractive. Map design must have legibility, visual contrast
figure- ground and balance. These should be carefully shown on a map or diagram.
4. Heading of the Diagram: There should be a title for each diagram. As far as feasible, the heading should be
complete and succinct. There are four elements that must be included in the heading.
(i) The thing depicted in the diagram’s name. (ii) The area’s name.
(iii) The year or period of time. (iv) The data’s unit of measurement.
5. Legend/ Index : A legend or index is an essential part of any graph or map. It describes the map and map/
contents. Furthermore, it must be appropriately positioned. A legend is usually displayed on the map sheet’s lower
left or lower right side.
6. Direction : The maps, being a representation of the part of the earth’s surface need to be oriented to the
directions Hence, the direction symbol, I.e. North, should also be drawn and properly placed on the final map.
7. Accuracy : The accuracy of the diagram Should be carefully considered . In other words, the diagram should be
able to appropriately depict the relevant data.
8. Simplicity : Diagrams should be simple as far as possible so that the reader can understand them properly.
Construction of Diagrams
The diagrams and the maps that are drawn to represent these data related characteristics may be grouped into the
following types : -
Line Graph
The Line graphs are usually drawn to represent the time series data related to the temperature. Rainfall,
Population growth birth rates and the death rates.
Polygraph
Polygraph is a line which two or more than two variables are shown by an equal number of
lines for an immediate comparison such as the. Growth rate of different crops like rice, wheat, pulses
or the birth rates, death rates and life expectancy or sex ratio in different states or countries.
Example :Construct a polygraph to compare the growth of sex ratio in different states.
Table : Sex – Ratio L female per 1000 male ) of selected states – 1961-2011
States/UT 1961 1971 1981 1991 2001 2011
Delhi 785 801 808 827 821 866
Haryana 868 867 870 860 846 871
Uttar 907 876 882 876 898 908
Pradesh
Bar Diagram
The bar diagrams are drawn through columns of equal width it is also called a column
diagram. Following rules should be observed while constructing a bar diagram.
(a)The width of all the bars or columns should be similar .
(b) All the bars should be placed on equal intervals , distance.
(c)Bars may be showed With shows color or patterns to make them distinct and attractive.
Simple Bar Diagram
A simple bar diagram is constructed for an immediate comparison , it is advisable to arrange the given data
set in as ascending or descending order and plot the data variables accordingly. However. Time series data are
represented according to the sequencing of the time period.
Months J F M A M J J A S O N D
Rainfall in 2.3 2.1 3.7 10.6 20.8 35.6 22.3 14.6 13.18 27.3 20.6 7.5
cm
Line and Bar Graph
The line and bar groups as drawn separately may also be combined to depict the data rllated to
some the closely associated characteristics such as the climatic data of mean monthly temperature
and rainfall .In doing so, a simple diagram is drawn in which months are represented on X-axis
while temperature and rainfall data are shown on y- axis at both sides of the diagram.
Example :Construct a line graph and bar diagram to represent the average monthly rainfall and
temperature data of Delhi.
Multiple bar diagrams are constructed to represent two or more than two variables for the purpose of comparison.
Example :Construct a suitable bar diagram to show decadal literacy rate in India during 1951- 2011
When different components are grouped in one set of variable or different variable of one component are put
together, their representation is mode by a compound bar diagram. In this method, different variables are shown in
a single bar with different rectangles.
Example : Construct a compound bar diagram to depict the data as given in table below : -
Pie Diagram is another graphical method of the express of data . It is drawn to depict the total value of the given attribute using
a circle. Dividing the circle into corresponding degrees of angle then represent the subsets of the data. Hence, it is also called as
If the data is given in percentage form, the angles are calculated using the given formula
𝑷𝒆𝒓𝒄𝒆𝒏𝒕𝒂𝒈𝒆 𝒐𝒇 𝑿 𝟑𝟔𝟎
𝟏𝟎𝟎
For example :
A Pie diagram may be drawn to show total population of India along with the proportion of the reualand urban
popular in this case. The circle of an appropriate radio sps drawn to represent the total population and its sub-
division rural and urban population are shown by corresponding degrees o angle.
Example : Represent the data as given in table (a) with a suitable diagram.
Total 100
Table ( b) Calculation for pie – India’s export to Major Regions of the world in 2010-11
Countries % Calculation
Europe 20.2 20.2 x 3.6 = 72.72 730
Africa 6.5 6.5 x 3.6 = 23. 4 230
America 14.8 14.8 x 3.6 =53.28 530
Asia and Asian 56.2 56.2 x 3.6 = 208 .32 2030
Others 2.3 2.3 x 3.6 = 8.28 80
Total 100 360
FLOW MAPS/CHART
These are dynamic diagrams because the phrase flow communicates the impression of movement. The flow of
buses, railroads and other modes of transportation is depicted using these diagrams. They also depict the movement
of people and goods, as well as the flow of water in rivers and canals.
Features of Flow Maps:
(ii) The number of vehicles, people, commodities required, and other items transported from one
place to another.
(i) A map of the selected area is produced, showing transportation lines and notable locations.
(i) Data of people or goods movement from point of origin to point of destination is compiled.
(iii) A suitable scale is chosen to represent the number or quantity to be displayed by a line thickness
proportional to the number or quantity to be displayed.
Example 6: Construct a flow map to represent the number of trains running in Delhi and the adjoining
areas.
Number of trains running between selected routes of Delhi and adjoining areas:
(i) Take an outline map of Delhi and the surrounding areas that include the railway line and nodal stations.
(ii)Choose a scale for the number of trains. The maximum and minimum numbers are 50 and 6, respectively. The
maximum and minimum numbers will be depicted on the map by a strip of 10 mm thick lines and 1.2 mm thick
lines, respectively, if we choose a scale of 1 cm = 50 trains.
(iii) Draw a graph of the thickness of each route strip between the two railway lines .
(iv) Create a legend with a terraced scale and a separate sign or symbol to represent the nodal points(stations)
inside the strip
THEMATIC MAPS:
Graphs and diagrams are excellent for getting a sense of how internal variances within data of various qualities
compare. They frequently fail to establish a regional viewpoint, which necessitates the drawing and comprehension
of regional distributions or spatial variations. Distribution maps are another name for these types of thematic maps.
(i) State/district data pertaining to the theme for which the map is to be created.
(i) Thematic maps must be meticulously designed before being drawn. The finished map should include the
following elements:
Choropleth Maps
Dot Map
Isopleth Map
DOT MAP:
The dot method of showing distribution on a map is the most essential, practical, and straightforward method. This
technique is especially handy for displaying absolute values. On the map, dots rrepresent
epresent the distribution of the
worry. A unique value is assigned to each dot. The dots are placed in accordance with the actual distribution of the
element being displayed. This approach is used to display people, livestock, crops, minerals, and industri
industrial
distributions.
(b) Statistical data for the chosen administrative units on a specific issue, such as total population, cattle, and so on.
(a) The lines dividing administrative units should not be too thick or bold.
(i) Because of its precision, this method outperforms all other methods of displaying distribution. The distribution
of dots resembles the commodity that they have displayed.
(ii) It has a better visual impression and is more useful than the other distribution-display approaches.
(iii) By counting the dots, we may get an approximation of the figures on the map.
(iv) We do not need to consult the index on a regular basis (as in the case of choropleth maps). The dotmap is
simple to read.
(v) Using the multiple dot method, more than one element can be displayed on a single map.
(ii) It is difficult to count and calculate actual figures when there are a lot of dots.
(iv) To illustrate the distribution clearly, the size of the dots must be properly chosen.
Example 7: Construct a dot map to represent population data of 2011 .
Table 3.8: Population of Indian states (as per 2011 Census)
(i) Determine the number of dots in each state using the given scale. For example, number of dots in
(iii) Place the dots in each state as per the determined number in all states.
CHOROPLETH MAP
Choropleth maps are frequently used to show the data properties concerning to the administrative units.
These maps are used to show population density, literacy/growth rates, gender ratios, and so on.
Procedures to be Followed
(b) Divide the data into five categories: very high, high, medium, low and very low. Concentrations in the low and
very low ranges.
(c) The distance between the categories can be calculated using the formula below. Range/5 and Range =
maximum value - minimum value are examples of equations.
(d) Patterns, shades, or colours are used to represent the selected categories. It should be numbered in ascending or
descending order.
Example 8: Construct a choropleth map to represent the literacy rates in India .
S.No. States/ Union territories Literacy S.No. States/ Union territories Literacy
rate(%) rate(%)
1 Andaman & Nicobar 86.63% 19 Madhya Pradesh 69.32%
islands
2 Andhra Pradesh 67.02% 20 Maharashtra 82.34%
3 Arunachal Pradesh 65.38% 21 Manipur 76.94%
4 Assam 72.19% 22 Meghalaya 74.43%
5 Bihar 61.80% 23 Mizoram 91.33%
6 Chandigarh* 86.05% 24 Nagaland 79.55%
7 Chhatisgarh 70.28% 25 NCT of Delhi* 86.21%
8 Dadra & Nagar Haveli* 76.24% 26 Odisha 72.87%
9 Daman & Diu* 87.10% 27 Puducherry* 85.85%
10 Goa 88.70% 28 Punjab 75.84%
11 Gujarat 78.03% 29 Rajasthan 66.11%
12 Haryana 75.55% 30 Sikkim 81.42%
13 Himachal Pradesh 82.80% 31 Tamil Nadu 80.09%
14 Jammu & Kashmir 67.16% 32 Tripura 87.22%
15 Jharkhand 66.41% 33 Uttar Pradesh 67.68%
16 Karnataka 75.36% 34 Uttrakhand 78.82%
17 Kerela 94.00% 35 West Bengal 76.26%
18 Lakshadweep* 91.85%
(B) Data on Literacy in India as arranged in ascending order
S.No. States/ Union territories Literacy S.No. States/ Union territories Literacy
rate(%) rate(%)
1 Bihar 61.80% 19 Gujarat 78.03%
2 Arunachal Pradesh 65.38% 20 Uttrakhand 78.82%
3 Rajasthan 66.11% 21 Nagaland 79.55%
4 Jharkhand 66.41% 22 Tamil Nadu 80.09%
5 Andhra Pradesh 67.02% 23 Sikkim 81.42%
6 Jammu & Kashmir 67.16% 24 Maharashtra 82.34%
7 Uttar Pradesh 67.68% 25 Himachal Pradesh 82.80%
8 Madhya Pradesh 69.32% 26 Puducherry* 85.85%
9 Chhatisgarh 70.28% 27 Chandigarh* 86.05%
10 Assam 72.19% 28 NCT of Delhi* 86.21%
11 Odisha 72.87% 29 Andaman & Nicobar 86.63%
islands
12 Meghalaya 74.43% 30 Daman & Diu* 87.10%
13 Karnataka 75.36% 31 Tripura 87.22%
14 Haryana 75.55% 32 Goa 88.70%
15 Punjab 75.84% 33 Mizoram 91.33%
16 Dadra & Nagar Haveli* 76.24% 34 Lakshadweep* 91.85%
17 West Bengal 76.26% 35 Kerela 94.00%
18 Manipur 76.94%
Steps of Construction
(b) Determine the data range. Bihar (61.80%) and Kerala (94.00%) have the lowest and the highest literacyrates in
the current scenario, respectively. As a result, the range is 94.00-61.80 = 32.2 (or = 32).
(c) Divide by 5 to get categories ranging from very low to very high.
(d) Determine the number of categories as well as each category's range. As before, add 6.0 to the lowest value of
61.80,
(e) Assign shades/patterns to each category ranging from lower to higher hues.
(g) Complete the map with respect to the attributes of map design.
Advantages of Choropleth Map
(i) It makes it simple to see how a measurement changes over time in a certain area.
(ii) They're good for displaying differences in land use, such as how much recreational land is covered.
(i) They create the illusion of a sharp change at the shaded unit's limits.
ISOPLETH MAP
Isopleth refers to an imaginary line that connects points of equal value. Isotherms, isobars, isohyets, isonephs, and
isohels are examples of isopleth maps.
Requirement
(ii) Accurate data on temperature, pressure, rainfall, and other variables throughout a specific time. (iii) Drawing
tools, particularly French curves, etc.
INTERPOLATION
Interpolation is a technique for inserting intermediate values between observed values at two stations or locations,
such as temperature recorded at Chennai and Hyderabad or spot heights at two sites. Interpolation is the process of
connecting sites with the same value by drawing isopleths.
Method of Interpolation
(i) First, figure out what the maximum and minimum ranges are.
(iii) Determine the interval in a whole number, such as 5, 10, 15, etc., based on Range.
Formula of Isopleth: The exact point of drawing an Isopleth is determined by using the following formula:
The interval is the difference between the interpolated value and the real value on the map.
PRACTICAL 4
Spatial Information
Technology
The word spatial is derived from space. It refers to live the features and the phenomena distributed over a
geographically definable space. Thus having physically measurable dimensions. The Spatial information
technology related to the use of the technological inputs in collecting , storing , retrieving , displaying
manipulating, managing and analyzing the spatial , GPS, GIS, Digital Cartography and Database management
systems.
A System for capturing, storing, checking, integrating, manipulating, analyzing and displaying data, which are
spatially reference to the earth.
This is normally considered to involve a spatially reference computer database and appropriate applications
software. It is an amalgamation of computer assisted cartography and database management system and draws
conceptual and methodological from both spatial and allied science such as computer science, statistics,
cartography remote sensing, Database Technology, Geography, Geology, Hydrology ,Resource
The two types of the data represent the geographical information. These are spatial and non – spatial data.
The spatial are characterized by their potential, linear and areal forms of appearances.
Those describe the spatial date are called as Non – Spatial or attract data the spatial data are the most important
pre- requisites a spatial or geographical information system. In a GIS Core, it could be built in several ways. These
are :
The choice of a source of geographical data for a GIS application is, however largely governed by :
For Many users, the most common source of spatial data is topographical or thematic maps in hard copy (paper)
and soft
A definite scale which provides relationship between the map and the surface it resents.
Use of symbols and colour which define attributes of entities mapped, and .
An agreed coordinate system. Which define the location of entities on the earth’s surface.
Limitation of Manual Methods
1. Users can interrogate displayed spatial features and retrieve associated attribute information for analyses.
3. Spatial operations (polygon overlay or buffering) can be applied and integrated database to generate new sets
of information.
9. Different items of attribute data can be associated with on another through shared location code.
COMPONENTS OF GIS
GIS constitutes five key components. These are :
(i) Hardware: The machine that a GIS run on is called hardware. GIS run on is called hardware. GIS
software can now be run on a wide range of hardware form centralized computer servers to stand – alone
on networked desktop computers.
(ii) Software : A GIS must have application software that includes the following functional modules :
(a) Software related to data entry, editing and maintenance.
(b) Software related to analysis/ transformation / manipulation.
(c) Software related to data display and output.
(iii) Data: All GIS software is built to work with spatial data. Information regarding location, linkages with
other elements, and specifics of non-spatial qualities characterize spatial data. Geographical data spatial
reference is critical and should be considered from the start of every GIS project.
(iv) People: Hardware and software engineers as well as resources and environment scientists, policymakers,
and monitoring and implementing agencies, are all GIS users. GIS is used by a diverse group of people to
develop a decision support system and solve real – time challenges, No GIS exists outside of the context of
the company. And humans must constantly be present to develop, execute, and run the system, as well as
make decisions based on the output.
(v) Procedures: Data will be retrieved, entered into the system , saved, managed, transformed, analyzed, and
eventually displayed in a final output, according to the procedures.
A graphical characteristic is represented by a pattern of grids of squares in raster data, whereas vector data is
represented by a set of lines or arcs drawn between specified points in vector data. Consider a diagonal line drawn
on a piece of paper.
This image would be represented as raster file by subdividing the paper into a matrix of small rectangles.
Comparable to the cells on a sheet of graph paper. Each cell in the data file is given a position and a value based on
the attribute at that location . Its row and column coordinates can be used to identify any single pixel . The user
can simply rebuild or visualize the original image using this data format.
The relationship between cell size and the number of calls is expressed as the resolution of the raster .
Applications of Raster File Formats:
The followings are the most common uses raster file formats :
(i) For digital representations of aerial photographs, satellite images, scanned paper maps , etc.
(ii) When the map does not require analysis of individual map features.
(iii) In raster data, each cell contains an attribute that aids in the representation of continuous natural features such
as land use and soil types.
(i) The vast majority of spatial data, whether in the form of topographical maps of thematic maps, is available as line maps
requiring no data conversion.
(iii) The system needs less memory because boundary lines and points are only stored.
The following sequence of the activities are involved in GIS- related work :
(i) Acquiring digital data sets from a data supplies: Digital data, ranging from small –scale maps to large scale
blueprints, is now freely available thanks to modern data sources. Such data are an important source for many local
governments and business organizations, as they liberate users from the costs of digitizing of collecting their own
data.
Although it is appealing and time- saving to use pre – existing, data sets, careful consideration must be given to
data compatibility when data from many sources/ suppliers is merged in on project. Differences in projection, scale
base level and attribute description could cause issues. To guarantee that the data is compatible with the
programme on a practical level. Users must consider the following qualities of the data:
(ii) Creating digital data sets by manual input : The manual input of data to a GIS involves four main stages :
(a) Entering the spatial data . (b) Entering the attribute data.
The most common ways of inputting spatial data into a GIS are digitization and scanning.
(a) Digitization: Geographic data is represented by points, lines, and/or polygons defined by a set of coordinates in
the entity model. These can be found by using the map’s geographic referencing systems. The usage of digitizers
and scanners reduces the amount of time and effort required to write down coordinates.
(b) Scanners: These are used to transform analogue data from scanners into digital grid – based images . They aid
in the acquisition of spatial data by converting a line map to high – resolution raster images that can be used
directly or processed further to provide vector topology. Following are two types of scanners.
Attribute data obtained from sources such as public records, official censuses, primary surveys, or spreadsheets can
be manually entered or imported into a GIS database using a common transfer format.
The following are the types of errors that can occur during the capture of spatial and attribute data:
(a) Spatial data are incomplete or double: Omissions in the input of points, lines, or polygons/ area of manually
entered data result in spatial data that is incomplete. Omissions in scanned data normally appear as gaps
between lines. Indicating that the raster vector conversion process failed to combine all parts of a line.
(b) Spatial data at wrong scale: The digitizing at the wrong scale produce input spatial data at a wrong scale. In
scanned data, the problems usually arise during the geo – referencing process when incorrect values are used.
(c) Spatial data are distorted: If the base maps used for digitizing aren’t scaled right. The spatial data may be
affected as well. Because of lens distortions. Relief, and till displacements, aerial pictures, in particular, are
characterized by erroneous scale, Most GIS software has editing and updating functions that can be used to
remedy these issue. The procedure is lengthy and engaging and it can take longer than the data entry itself.
Minor location errors in a vector database may be corrected by moving the spatial entity through the screen
cursor. In some GIS computer commands may be used directly to move, rotate, erase, insert, stretch or
truncate the graphical entities are required. Where excess coordinates define a line these may be removed
using ‘weeding’ algorithms. Attribute values and spatial errors in raster data must be corrected by changing
the value of the faulty cells. Once the spatial errors have been corrected. The topology of vector line and
topology of vector line and polygon networks can be generated.
(i) Overlay analysis operation Overlay operations are a defining feature of GIS. An essential analysis
function is the integration of many layers of maps utilizing overlay technique. To put it another way.
GIS allows you to combine two or more thematic layers of maps from the same area to create a new
map layer.
GIS overlay procedures are comparable to sieve mapping, which involves. Overlaying tracings of maps on
light table to make comparisons and generate on output map.
Use of map overlay is:
(a) A map overlay can be used to compare and contrast changes in land use/cover two time periods and
analyses land transformations.
(b) By overlaying the two maps, the changes in urban land use can be shown, and the urban sprawl may be
visualized through time.
(c) Overly analysis is also important in determining if a particular land use is suitable for prospective land
uses.
(ii) Buffer Operation: A buffer of a certain length can be constructed along a point, a line, or an area. It
can be used to locate areas/ populations that have access to or have been refused access to facilities
and services such as schools, colleges, universities, hospital, post offices, roads, parks and so on. It
can also be used to investigate the consequences of point sources of pollution such as air, water, land,
and noise
Arc View /Arc GIS, Geometric, and all other GIS software give point line, and area attributes using
proper software commands. Buffers of 2, 4, 6, 8, 10 km. can be created around cities with large health
or education centers. In general, neighborhoods close to these institutions received better service than
those further away.
_________________________________________________________________________________________