Unit 2
Unit 2
2.1 Introduction
2.4 Assumptions
2.1 INTRODUCTION
Population study is dynamic in nature of research and population data is
changing over the period of time. Population data are providing by either
census operation or by periodical survey. Since most of the modern-day
measurements are discrete in nature, we normally do not have an analytic
function to demonstrate these set of measurement and interpolation and
extrapolation are useful to find that function. Interpolation is an important
statistical technique to estimate the unknown value from known given values
whereas extrapolation is the process of calculating unknown values beyond
the given data points. The central idea here is to find an analytic function that
passes through given points to interpolate or extrapolate for any arbitrary
point. For instance, we are given the steel production figures of a country for
the years 1931, 1941, 1951, 1961, 1971, 1981, 1991, 2011 etc., then to find
125
Estimation of out the production figure for any year falling between 1931 and 2011 can be
Population termed as interpolation. If we compute the production figure for any year
outside the interval 1931 to 2011 it will be termed as extrapolation. Keep in
mind that population is in a constant state of change, both in terms of size and
characteristics. We know that periodic census enumeration obtain data on the
size and composition of the population at the time of census. For many
practical purposes, it is important to know the number and characteristics of
people during the period between the two censuses and even after the last
census. That is why population interpolation and extrapolation are important
in population studies.
Definition of Interpolation
Based on the definition we can say that interpolation and extrapolation are
the same techniques, but interpolation is related to the past trends whereas
extrapolation gives us the future trends of population figures or forecast or
number of events. Following objectives of interpolation and extrapolation-
b) Check your answer with possible answers given at the end of the
unit.
127
Estimation of One of the easiest ways to understand these differences is to understand the
Population prefix of each term. Extra- refers to "in addition to," while inter- means "in
between." Thus, extrapolation indicates a user is trying to find a value in
addition to existing values, while interpolation means that they want to
determine a new value in between existing value. The basic differences
between these two terms are given in above table.
2.4 ASSUMPTIONS
Despite the fact there are number of dependable methods of interpolation and
extrapolation to find out most possible values of the past and future but even
then, there are certain conditions that need to be met to get reliable outcomes.
Such as-
We can say that interpolation and extrapolation are the population estimates
and population projection techniques. If the situation is somewhat different
for post-census estimates; that is in the case of estimates for dates subsequent
to the most recent census for which data is available. First, there is no later
census to indicate the upper limit of the range within which the true figures
usually lie. Second, post census estimates most commonly are made for dates
so recent that vital statistics and other data bearing on population change
during the period since the last census may not yet be available for the entire
period.
For example, suppose we have a regression line y = 30x + 40. We know that,
to produce this "best-fit" line, the value of x must between 0 and 10. Suppose
we choose x = 6. Based on this best-fit line and equation, we can estimate the
value of y as the following:
y = 30(6) + 40 = 220
Our x value (6) is within the range of acceptable x values used to make the
line of best fit, so this is a valid y value, which we have calculated by
interpolation.
For example, suppose a data set consists of four given values: 1, 3, 5 and 7. If
these values were plotted on a graph and the line was expected to continue in
the same way, the fifth value could be extrapolated as 9.
In curve fitting problems, the constraint that the interpolant has to go exactly
through the data points is relaxed. It is only required to approach the data
points as closely as possible (within some other constraints). This requires
parameterizing the potential interpolants and having some way of measuring
the error. In the simplest case this leads to least squares approximation.
b) Check your answer with possible answers given at the end of the
unit.
Built on the Binomial Theorem, this method is easy to understand and apply.
The basic requirement of this particular method is that the intervals of
dependent variables must be equal throughout the data and missing value, or
values of the independent variables must correspond to these intervals.
For instance, this method is applicable when the data looks like this-
Marks obtained 10 20 30 40 50
Number of students 5 23 ? 45 75
o Then find ∆ =0 or 1 = 0.
o To get another missing value, use the same equation by doing small
alteration that is raising y suffix by 1.
Problem: Estimate the steel production for the year 2001 and 2005 from the
following data.
132
Solution: Let X be the year and Y be the steel production. Interpolation and
Extrapolation using
Growth Rate Methods
X 1995 1997 1999 2001 2003 2005
Y 20 (y0) 40 (y1) 70 (y2) ? (y3) 130 (y4) ? (y5)
The number of known values is n= 4.
∆ =0 or 1 =0
1 = 0.
=410 – 4y3 = 0
= -4y3 = -410
=y3 = 102.5
y5 – 145 = 0
y5 = 145.
133
Estimation of I- Finding the average increment per year by dividing the total
Population increase in population by the number of year between the last two
censuses.
Population in 2010 if the population growth rate had been constant from 2001
to 2011:
the same procedures are used for extrapolation as for interpolation. The only
difference between the two, as explained earlier, is that interpolation in
between two known values and extrapolation is for years later than the period
between the two known values.
2.7.5 Forecasting
Extrapolation is the best possible tool for forecasting. All economic planning,
policy formulations etc. are nothing but future estimations. The methods
provide scientifically based formulae for predictions and forecasting.
b) Check your answer with possible answers given at the end of the
unit.
135
Estimation of
Population
2.8 LET US SUM UP
In this unit we have learnt that interpolation and extrapolation are very
powerful statistical tools that are useful for estimating missing number in a
given data series and forecasting a number outside a given data points. We
also discussed the basic concepts, assumptions and methods involved in
interpolation and extrapolation, followed by their real-world application in
various fields. We have therefore dealt with the subject adequately. Solving
an interpolation problem leads to a problem in linear algebra where we have
to solve a system of linear equations. Using a standard monomial basis for
our interpolation polynomial we get the very complicated Vander monde
matrix. By choosing another basis, the Newton basis, we get a system of
linear equations with a much simpler lower triangular matrix which can be
solved faster.
Population Growth Rate (1991 to 2001 Census) =0.1*LN (Population 2001/Population 1991)
137