Sample Assignment Project
Sample Assignment Project
Education Index
1|P a ge
1. Introduction
1.1 Education Index Defined
An Education Index is a component of the Human Development Index published every year
by the United Nations Development Programme. The United Nations Development
Programme obtains Education Index metrics by measuring the mean of years of schooling for
adults aged twenty five and over and expected years of schooling for children of school entering
age. The reader must keep in mind that the lowest Education Index score that can be received
by a country is 0 and the highest is 1.
2|P a ge
2. Comparative Analysis
2.1 Descriptive Statistics
The table above does an excellent job of comparing the Education Index values of the four
different geographic regions. Of all the measures presented above, we are particularly
interested in comparing the values of the:
1. Mean
2. Median
3. Standard Deviation
4. Skewness
5. Coefficient of Variation
4|P a ge
2.2 Histograms
2.2.1 Americas
Education Index
Figure 1
5|P a ge
2.2.2 Africa
Education Index
Figure 2
The Histogram shown above indicates that the distribution of the value of Education Index in
African countries is approximately symmetric or very slightly positively skewed. More than
half of the countries have values of Education Index below 0.5, which indicates that most of
Africa has a poor learning infrastructure in place. No country in Africa has an Education Index
of more than 0.8. Generally, this means that even the best education systems in Africa are
nowhere near as good as the average standard for Europe.
6|P a ge
2.2.3 Asia
Education Index
Figure 3
The distribution above, at first glance looks heavily negatively skewed. But, the mean value
lies at the center of the 0.6-0.7 class interval and so distribution is approximately symmetric,
but less symmetric than that of the Americas, Africa and Europe. Most (12) countries in Asia
have an Education Index in the class of 0.6-0.7, but there is no country with excellent Education
Index value greater than 0.9. There are however, many countries that delivered subpar
education. 14 out of 45 countries have an Education Index of less than 0.5, which is a poor
score.
7|P a ge
2.2.4 Europe
Education Index
Figure 4
The x-axis of the Histogram for European countries shows that Europe has the best Education
System in place out of all four regions analyzed in this report. The distribution in seen to be
approximately symmetric. No country in Europe has an Education Index lower than 0.7. A
staggering 12 countries have excellent education systems as shown by them having an
Education Index score of more than 0.9.
8|P a ge
2.3 Box and Whisker Plot
Figure 5
A box and whisker plot is one of the best ways to compare data sets qualitatively in a
standardized manner. The diagram above shows Europe leading the line in terms of quality of
Education, followed by the Americas, Asia and then finally Africa. The box and whisker plot
also helps showcase measures like the mean, median and Interquartile range of the data sets.
Asia has the greatest Interquartile range, which reflects it having the highest standard deviation
within its data set for Educational Index. Africa follows Asia on second place in this regard.
Data for Asia is also the least symmetric out of all four geographic regions. The Americas and
Africa have a symmetric relation as shown by the differences between the median value and
the first and third quartile on the box and whisker plot. It is interesting to note that the American
geographic region also has outliers. This is because the range of this data is much larger than
the Interquartile range and it also has a large and sharp peak in its distribution at its center.
10 | P a g e
3. Regression Analysis
In this section of the report, the real life relationship between Education index for a country
and two other variables (each one at a time) will be investigated. Then, two separate models
will be formulated and tested. The two variables being taken are HDI (Human Development
Index) and GDP (Gross Domestic Product) per capita. It is important to note that Education
Index values, HDI ranks and GDP per Capita numbers are all taken from the year 2019.
Regression Statistics
Multiple R 0.952703903
R Square 0.907644727
Adjusted R Square 0.907116983
Standard Error 0.052802642
Observations 177
ANOVA
df SS MS F Significance F
Regression 1 4.795164905 4.795164905 1719.85662 1.89885E-92
Residual 175 0.487920824 0.002788119
Total 176 5.283085729
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 0.939498046 0.007885565 119.1414981 1.8689E-169 0.923934996 0.955061096 0.923934996 0.955061096
HDI Rank -0.00299486 7.22155E-05 -41.47115407 1.89885E-92 -0.003137385 -0.002852334 -0.003137385 -0.002852334
11 | P a g e
The Regression Line:
3.1.2 Interpretation
The R-squared value is 0.907644727. Hence, approximately 90.76% of variation in Education
Index is explained by HDI rank in 2019. Square root of R-squared = 0.9527. The sign of the
gradient of the regression line is negative. Hence, correlation coefficient = -0.9527. Therefore,
this is a very strong, negative linear correlation. Hence, HDI rank is a very good predictor of
Education Index.
The equation of the regression line is:
y = -0.003x + 0.9395
This implies that for every one unit increase (this corresponds to one step down on the HDI
list) for a country in 2019, the Education Index of that country falls by 0.003 units. A country
with HDI rank of 0 in 2019 (which implies that country is highest rated on HDI list) will have
an Education Index of 0.9395. This is not practically possible since the ranking starts at position
1.
Elasticity of Education Index with respect to HDI rank
= Gradient x Average of all HDI Ranks /Average of all Education Indices
12 | P a g e
= -0.003 x 94.3559322/0.656915254
= -0.431
Thus a one percent increase in HDI rank of a country is associated with a decrease of Education
Index value by 0.431% on average.
ANOVA
df SS MS F Significance F
Regression 1 1.829862862 1.829862862 92.73250325 7.03888E-18
Residual 175 3.453222867 0.019732702
Total 176 5.283085729
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 0.589326297 0.012678606 46.48194885 2.1278E-100 0.564303643 0.614348951 0.564303643 0.614348951
GDP per Capita/ $1000 0.004417328 0.000458716 9.629771713 7.03888E-18 0.003512001 0.005322655 0.003512001 0.005322655
13 | P a g e
The Regression Line:
3.2.2 Interpretation
The R-squared value is 0.3464. Hence, approximately 34.64% of variation in Education Index
data is explained by GDP per Capita in 2019. Square root of R-squared = 0.589. The sign of
the gradient of the regression line is positive. Hence, correlation coefficient = +0.589.
Therefore, this is a moderate, positive linear correlation.
The equation of the regression line is:
y = 0.0044x + 0.5893
This implies that for every $1000 increase in GDP per capita for a country in 2019, the
Education Index of that country increases by 0.0044. A country with GDP per Capita of $0 in
2019 (which implies that that country has the lowest possible GDP per Capita) will have an
Education Index of 0.5893. This is practically not possible as any country must have some
Gross Domestic Product if it exists.
Elasticity of Education Index with respect to GDP per Capita in Thousands of Dollars
= Gradient x Average of GDP per Capita in Thousands of Dollars /Average of all Education
Indices
= 0.0044 x 15.3008692/0.656915254
= 0.102
14 | P a g e
Thus a one percent increase in GDP per Capita of a country is associated with an increase of
Education Index value by 0.102% on average.
4. Conclusion
From the analysis conducted in this report, it is abundantly clear that Europe has the best
education systems out of all the geographic regions analyzed with not only the greatest average
Education Index, but also the least variation and most consistency (data-wise). Africa, on the
other hand, is a long way from establishing good education systems for all the opposite reasons.
Americas and Asia were about average in their performance and that may be due to the diversity
present in those regions.
The regression analysis clearly showed that the Education Index and the Human Development
Index were very strongly positively correlated. One might expect GDP per Capita to greatly
affect Education Index, but it does not. Clearly, income inequality is a big reason why this is
so. Other factors such as political stability, investment and health systems also play a big role
in determining how educated a country is.
15 | P a g e
5. Data Sources and Software Used
HDI rank and education Index Data for 2019 has been gathered from the United Nations:
http://hdr.undp.org/en/data#
List of Countries and corresponding continents has been taken from:
https://github.com/dbouquin/IS_608/blob/master/NanosatDB_munging/Countries-
Continents.csv
GDP per Capita data for 2019 has been obtained from:
https://unstats.un.org/unsd/snaama/Downloads
HDI (Human Development Index) Definition:
https://databank.worldbank.org/metadataglossary/africa-development-
indicators/series/UNDP.HDI.XD#:~:text=The%20Human%20Development%20Index%20(H
DI,a%20decent%20standard%20of%20living.
GDP per Capita Definition:
https://databank.worldbank.org/metadataglossary/statistical-capacity-
indicators/series/5.51.01.10.gdp#:~:text=Metadata%20Glossary&text=Long%20definition-
,GDP%20per%20capita%20is%20the%20sum%20of%20gross%20value%20added,GDP%20
data%20in%20local%20currency.
Data Analysis has been carried out using Microsoft Excel
16 | P a g e