Term Project - Stats 1E
Term Project - Stats 1E
Term Project - Stats 1E
Analysis of Health
Budget and Life
Expectancy in
Countries
Statistics for Decision-Making
End-Term Project
Group 1-E
Section 1
Table of content
The US spends more, so why did the US face difficulties during COVID-19?
But is government spending the correct metric to judge? How about the percent contribution from GDP?
India spent 2.1% of its GDP on healthcare in 2022-23, while the US spent 16%
So, is there any relation between the percent GDP contribution to a country’s health and its
citizens? How about a statistical analysis?
Data Extraction
To analyse the above question, we considered the two parameters:
Life expectancy & % contribution of GDP to the healthcare system by a country.
Life expectancy shows the health of a country in a broader sense and considers all factors
related to the health system.
We extracted data from World Health Organisation open datasets to determine a country's
life expectancy year.
https://www.who.int/data/gho/data/indicators/indicator-details/GHO/life-expectancy-at-birth-(years)
We received data for the past 15 years about life expectancies in different countries.
We extracted data for the spending on healthcare by a country from World Bank Datasets.
https://data.worldbank.org/indicator/SH.XPD.CHEX.GD.ZS
The data for the same past 15 years was collated for it also.
Data Cleaning
Data received was not complete and was very vast for our analysis.
Incomplete datasets were not considered for our Analysis.
In our sample, we have taken 159 countries.
We considered 2012, 2013, and 2014 for our analysis.
Why so?
These years had no impact of COVID or any other major outbreak in
the world
An unbiased analysis between Life expectancy and percent spent of GDP
was possible.
Data Visualization
As evident from the increasing mean, the average life expectancy rose in these years globally.
The kurtosis across the years is negative, indicating a flatter bell curve closer to the normal distribution.
The negative skewness shows the left skewed data that is more data is concentrated towards the higher end of Life
expectancy.
The increase in maximum from 2012 to 2014 shows the shift towards higher life expectancy with the advent
of technology and growth in medicinal science.
Descriptive Statistics(Health Budget)
The average global percent contribution of GDP was highest in 2013 and shows a generally increasing trend.
The negative kurtosis- data in the distribution has fewer extreme values and is more spread out than a normal
distribution.
The skewness is positive; more global countries do not contribute much of their GDP to healthcare expenditure.
Regression Analysis is done between life expectancy and Percent contribution of GDP towards
health expenditure across the three years to find the degree of relatedness of the variables
Regression Analysis
Anirban Roy 2023PGP005
Mayank Sharma 2023PGP036
Ayesha Fatima 2023PGP017
Soumee Guha 2023PGP059
Mahima Barate 2023PGP035
Thank You
Himanshu Dixit 2023PGP025
Vartika Razathani-2023PGP067