0% found this document useful (0 votes)

47 views16 pages

Income Prediction Analysis

This document presents a statistics project report focused on income prediction analysis in the US, examining factors such as education, gender, race, and experience that influence wages. The authors utilize both linear and logistic regression models to analyze a dataset from the US Bureau of Labor Statistics, aiming to identify significant predictors of income and trends in earnings. The findings suggest that education is the most impactful predictor, while the logistic regression model is preferred for its classification capabilities in predicting income levels.

Uploaded by

cbsiva2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views16 pages

Income Prediction Analysis

Uploaded by

cbsiva2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/378152705

STATISTICS PROJECT REPORT

Thesis · October 2023

DOI: 10.13140/RG.2.2.33401.24162

CITATIONS READS
0 1,745

3 authors, including:

Yashkumar Kalariya
Mercer University
36 PUBLICATIONS 0 CITATIONS

SEE PROFILE

All content following this page was uploaded by Yashkumar Kalariya on 13 February 2024.

The user has requested enhancement of the downloaded file.

Benita Dadson, Yash Kumar Kalariya, Sai Vamshidhar Reddy Salla

Dr. Arnab Nayak

BDA 610: Advanced Business Statistics

October 14, 2023

Income Prediction Analysis

1 Introduction

Income in the US is determined by several factors including age, sex, occupation, and
educational status. Individual decisions may affect a person's earnings in the US. For example,
whether a person chooses to receive a Master's or a Doctorate degree can affect the income level
of the individual. Having a good understanding of the determining factors to earning a high
income is crucial for informed decision

In this research paper, we will explore the variables that predict wages for an adult in the labor
force. We will be highlighting certain factors (independent variables) such as years on education,
sex, race, age, and hours per week. The dependent variable from our dataset will be income.

We will also examine which of the independent variables has a more significant impact in
determining income levels in the US and identify any existing trends in income based on the
factors listed.

2 Literature Review

Much research across different fields has been conducted to study the relationship between
income and race, gender, marital status, skillset and many more. Predicting a person's income
has many benefits including determining the right educational and career paths or forecasting an
individual's financial situations. The prediction model we will be developing for this research
will assess the variation and the significance of the variables that affect income levels while
predicting individual potential earnings for informed decision making.
The goal of the data sources is to examine trends in earnings based on several factors including,
age, gender, race, educational level, etc. We have found supporting research and articles that
identify the difference in earnings between men and women. According to Blau, F. D., & Kahn,
L. M., 2017, there is a clear pay gap between men and women in society. From their research,
they highlight gender being an influencing factor in finding income and earnings in society.

The dataset supplies a detailed income prediction and the variables that affect it. However, with
most datasets, there is evidence of limitations which may call into question the accuracy of the
proposed model. For example, there are variables that highly correlate with each other. 'Age` and
`Experience` is a perfect example of this. As an individual ages, the chances of him/her gaining
more experience that may lead to an increase in wages are high.

However, the Income dataset does not account for every occupation in the US that earns more or
less than income. For example, minimum wages vary across different states in the US which the
dataset does not account for. In most datasets, there will be some biases detected. Although the
model has been proven to be effective in showing the relationship or the trends between income
and gender, and educational attainment, it is important that once there are omitted variables, it
leaves room for biases in the prediction model. (Analytics Vidhya, 2022)

3 About the Dataset

The dataset that we will be using in this project originated from the US Bureau of Labor
Statistics. It contains 9799 observations with 23 variables. The dataset is collected from a survey
that highlights the factors that may affect an individual's wage in the US. Some of these factors
include hours worked, educational and work experience, age, location, race, and marital status.
The summary statistics of the dataset is described in table 1.

3.1 Data Dictionary

We have a total of 23 variables in this dataset. 22 of these are factors that affect wages in the US
according to the Current Population Survey.

The data dictionary of the variables in this dataset is in Table 2.

3.2 Data Visualization

According to Jacob A. Mincer, (1974), there’s evidence of a correlation between higher
education and higher lifetime income earnings. He uses machine learning techniques and logistic
regression to develop a statistical summary and analysis of education and the impact it has on
earnings. He explores the growth of the human capital through the educational experience of
people in the labor force (Mincer, 1974). This scatterplot in Figure 1 illustrates the relationship
between wages and education. From the regression line, it shows an upward slope. We can
conclude from the diagram that an increase in educational years will lead to an increase in
wages. This shows the relationship between wage and education

These data visualization shows the Distribution of wage, age, educational experience, work
experience and family income. The purpose of creating distribution charts is to visualize the
distribution of the variables to gain a proper understanding of them for informed decision
making.

The data visualization in Figure 2 shows the Distribution of wage, age, educational experience,
work experience and family income. The purpose of creating distribution charts is to visualize
the distribution of the variables to gain a proper understanding of them for informed decision
making. Each chart represents the distribution of the variables, wage, age, experience, and family
income

We also plotted the plots for the binary variables in our dataset. The plots in Figure 3 help us
further visualize our dataset. It gives an illustration and count of individuals in the labor force of
different races (i.e., Asian, black, white), the gender count (i.e., no. Of people that identify as
male or female), hours worked, the count of married individuals, number of children, married or
unmarried individuals and the various locations of the individuals used in the sample of our
dataset.

4 Data Cleaning Process

The above data did not include any missing or ’N/A’ variables. However, for this project, we
may exclude a few of the variables that have been listed as factors that influence wages in the
US. For example, the information on Medicaid, Insurance and Medicare can be excluded from
the dataset. We can analyze the data to see if there are indeed any missing variables. We will also
analyze the dataset to see if there are any existing outliers that may affect the accuracy of our
model. Figure 4 shows an Illustration of the Continuous Variables in the Dataset for Outlier
Detection

5 Income Prediction Analysis

Can we predict income based on our Predictor Variables in this Dataset? The prediction model is
meant to be used in real life situations. Here are a few instances in which the prediction can be
applied to: Navigating educational paths and career choices: Students or individuals in the labor
force can use this model to steer them in the right career path that would be more beneficial to
them financially. Incorporating future financial planning: Individuals can use the model to
predict their future earnings based on their qualifications for future planning and budgeting.
Encouraging fair and appropriate hiring practices: Businesses and corporations may utilize the
model to identify which candidates deserve a higher income in a competitive setting. For
example, all other factors being constant, a candidate with more years of educational experience
will earn more than a fellow candidate with less educational experience. In addition to this,
companies can adopt this model in identifying gender bias when it comes to wages. The accuracy
of the income prediction model in this case depends on the relationship between the predictor
variables and income.

5.1 Models for Income Prediction

In this project, we will be using the regression models to estimate the correlation between wages
and the independent variables listed in our dataset using Linear and Logistic regression models.

5.1.1 Methodology for Income Prediction

This method aims to predict income based on a set of predictor variables using two regression
models: Linear Regression and Logistic Regression. The goal is to develop right predictive
models to help individuals and organizations in estimating income levels effectively. After the
data collection and cleaning process, we can explore our predictor variables appropriately.
5.1.2 Linear Regression

The above results show information on a linear regression model that predicts ‘wages 'based on
the predictor variables. The coefficients represent the impact of the predictor variables on wage.
Significant predictors (p-value ¡ 0.05) include ‘educ ‘, ‘exper‘, ‘faminc‘, ‘nchild‘, ‘black ‘,
‘female ‘, ‘metro ‘, ‘Midwest ‘, and ‘south ‘. The R-squared value 0.2362 shows that the model
explains 23.62 The larger the absolute value of the coefficient, the stronger the impact of the
variable on wages. In this case, Education has the highest positive coefficient (2.62), indicating
that, on average, each added unit of education is associated with an increase in wage by 2.62
units. Therefore, ‘educ‘is the most efficient predictor of wages in this model. Table 3 gives a
summary of the linear regression model. Table 4 shows the VIF Values of the predicting
variables from the linear regression model.

In addition to ‘educ ‘(Education), which is the most efficient predictor of wages based on its
coefficient magnitude, there are several other predictors that are statistically significant and have
meaningful coefficients, indicating their efficiency in predicting wages.

These predictors include:

exper (Experience): With a coefficient of 0.20, experience is positively related to wages. On

average, as experience increases, wages tend to increase as well.

faminc (Family Income): The coefficient for faminc is positive (0.000016), suggesting that
higher family income is associated with higher wages. While the coefficient is small, it is
statistically significant.

nchild (Number of Children): The coefficient for nchild is 1.11, indicating that having more
children is associated with higher wages, on average.

metro (Metropolitan Area): Living in a metropolitan area has a positive impact on wages, as
indicated by the coefficient of 3.097.

female (Gender): The coefficient for female is -4.440, indicating that being female is associated
with lower wages, on average.

Midwest (Region - Midwest): Residing in the Midwest region has a negative impact on wages, as
shown by the coefficient of -2.104.
south (Region - South): Similarly, living in the South region is associated with lower wages, as
indicated by the coefficient of -0.791.

5.1.3 Logistic Regression

The second prediction model we will be using to predict income is the Logistic Regression
Model. To do this, the response variable wage would have to be a binary variable. Table 5 gives
a summary of the logistic regression model. Table 6 shows the VIF Values of the predicting
variables from the logistic regression model.

Intercept: The intercept represents the estimated log-odds of the binary outcome variable (wage
binary) when all predictor variables are set to zero. In this case, it’s approximately -7.60548. The
negative sign indicates that the log-odds of the binary outcome are negative when all predictors
are zero. Predictor Variables:

educ: For a one-unit increase in education (e.g., one additional year of education), the log-odds
of the binary outcome are expected to increase by approximately 0.42135. The p-value (0.00000)
suggests that education is highly statistically significant.

exper: For a one-unit increase in experience, the log-odds of the binary outcome are expected to
increase by approximately 0.02908. The p-value (0.00000) indicates that experience is highly
statistically significant.

faminc: The coefficient is close to zero (0.00000), suggesting that changes in family income have
a negligible effect on the log-odds of the binary outcome. However, it has a statistically
significant p-value (0.00028).

Hrswork: For a one-unit increase in hours worked, the log-odds of the binary outcome are
expected to increase by approximately 0.00424. The p-value (0.14612) suggests that hours
worked are not statistically significant at a typical significance level of 0.05.

nchild: For a one-unit increase in the number of children, the log-odds of the binary outcome are
expected to increase by approximately 0.15371. The p-value (0.00000) indicates that the number
of children is highly statistically significant.
Asian: The coefficient is negative (-0.02795), suggesting that being Asian is associated with a
decrease in the log-odds of the binary outcome, but the effect is not statistically significant (p-
value: 0.79560).

black: For individuals who are Black, the log-odds of the binary outcome are expected to
decrease by approximately -0.49815. The p-value (0.00000) suggests that race (being Black) is
highly statistically significant.

divorced: Being divorced is associated with an increase in the log-odds of the binary outcome by
approximately 0.07800, but the effect is not statistically significant (p-value: 0.29488).

female: Being female is associated with a decrease in the log-odds of the binary outcome by
approximately -0.65940. The p-value (0.00000) suggests that gender (being female) is highly
statistically significant.

metro: Living in a metropolitan area is associated with an increase in the odds of the binary
outcome by approximately 0.52112. The p-value (0.00000) shows that metro status is highly
statistically significant.

Midwest: Living in the Midwest is associated with a decrease in the log odds of the binary
outcome by approximately -0.16256. The p-value (0.01903) suggests that living in the Midwest
is statistically significant.

northeast: Living in the Northeast is associated with a slight increase in the log-odds of the
binary outcome by approximately 0.05265, but the effect is not statistically significant (p-value:
0.46397).

south: Living in the South is associated with a decrease in the log-odds of the binary outcome by
approximately -0.06954, but the effect is not statistically significant (p-value: 0.29559)

Here are the VIF Values from the Logistic Regression

6 Conclusion

In conclusion, both the linear and logistic regression models have shown promise in predicting
income. Table. 7 illustrates both the summary for the linear and logistic models. However, when
choosing between the two, the logistic regression model appears as the preferred choice. This
decision is based on the problem. Also, the logistic regression model gives a more realistic result
based on the predicting variables given in our dataset. For example, in our logistic model, the
results show is positive coefficient for the number of hours worked, meaning that there is indeed
a positive effect on income when you work more hours. Predicting income levels often involves
classifying individuals into income categories, making it a classification problem. Logistic
regression is well-suited for such tasks as it gives a clear probability-based classification, making
it easier to interpret and act upon. Therefore, due to its suitability for the income prediction
problem and its ability to give more insights, the logistic regression model is our preferred
choice.

References

“Income and Poverty in the US: 2020” US Census Bureau, URL:

https://www.census.gov/library/publications/2022/demo/p60-276.html

Aamir Ali A. “Adult Census Income-Analysis” Medium. November 11, 2019, URL:
https://medium.com/data-warriors/eda-of-adult-census-income-dataset-cc9ac1a3d552

“Is Adult Income Dataset Imbalanced” Analytics Vidhya, URL:

https://www.analyticsvidhya.com/blog/2022/06/is-adult-income-dataset-imbalanced/

Mincer, Jacob A., “Introduction to Schooling, Earnings and Income,” National Bureau of Economic
Research, January 1974, URL: https://www.nber.org/books-and-chapters/schooling-experience-and-
earnings/introduction-schooling-experience-and-earnings

Blau, Francine D., and Lawrence M. Kahn. 2017. "The Gender Wage Gap: Extent, Trends, and
Explanations." Journal of Economic Literature, 55 (3): 789-865.DOI: 10.1257/jel.20160995 URL:
https://www.aeaweb.org/articles?id=10.1257/jel.20160995
Dataset: cpsdata_PoEbook.csv

Appendix

Tables
Figures
View publication stats

DPWH Memorandum Summary
No ratings yet
DPWH Memorandum Summary
1 page
Epa Probit Analysis Program
100% (1)
Epa Probit Analysis Program
5 pages
Nagpur Gondia
No ratings yet
Nagpur Gondia
2 pages
Test Bank For Strategic Management: Theory and Cases: An Integrated Approach, 13th Edition, Charles W. L. Hilldownload
100% (3)
Test Bank For Strategic Management: Theory and Cases: An Integrated Approach, 13th Edition, Charles W. L. Hilldownload
32 pages
Adult Census Income Prediction
100% (1)
Adult Census Income Prediction
31 pages
Princ Ch19 Presentation7e
100% (1)
Princ Ch19 Presentation7e
33 pages
Biostatistics - Data and Its Types
No ratings yet
Biostatistics - Data and Its Types
11 pages
National Economics University Advanced Educational Program: Advanced Finance 64A Ph.D. Nguyen Manh The
No ratings yet
National Economics University Advanced Educational Program: Advanced Finance 64A Ph.D. Nguyen Manh The
25 pages
Econ 251 PS2 Solutions
No ratings yet
Econ 251 PS2 Solutions
11 pages
Admission of A Partner MCQs 2024
No ratings yet
Admission of A Partner MCQs 2024
4 pages
Settlement Management
No ratings yet
Settlement Management
6 pages
Pertemuan 1
No ratings yet
Pertemuan 1
32 pages
Understanding Data
No ratings yet
Understanding Data
64 pages
1 s2.0 S0014292121000660 Main
No ratings yet
1 s2.0 S0014292121000660 Main
29 pages
Final Project
No ratings yet
Final Project
22 pages
Salary Data Analysis - Phase 1
No ratings yet
Salary Data Analysis - Phase 1
5 pages
Recitation 3 ENG CLASS
No ratings yet
Recitation 3 ENG CLASS
12 pages
Micro ch19 Presentation6e (2013)
No ratings yet
Micro ch19 Presentation6e (2013)
33 pages
Nguyễn Trí Dũng - AU09HN - Elementary Statistics - Final Exam
No ratings yet
Nguyễn Trí Dũng - AU09HN - Elementary Statistics - Final Exam
10 pages
Ecnometrics 8775
No ratings yet
Ecnometrics 8775
6 pages
Iin Rahmah Fadhillah
No ratings yet
Iin Rahmah Fadhillah
6 pages
Spending Data Script
No ratings yet
Spending Data Script
4 pages
Assign Docs
No ratings yet
Assign Docs
20 pages
Ali Zubair
No ratings yet
Ali Zubair
4 pages
Test Metrics
No ratings yet
Test Metrics
10 pages
PBM - S2022 (4549211) (Gturanker - Com)
No ratings yet
PBM - S2022 (4549211) (Gturanker - Com)
2 pages
Surigao Del Sur State University
No ratings yet
Surigao Del Sur State University
5 pages
Lesson 1
No ratings yet
Lesson 1
76 pages
BE As 3 (Fixed)
No ratings yet
BE As 3 (Fixed)
13 pages
Chapter 7 - Autocorelation
No ratings yet
Chapter 7 - Autocorelation
36 pages
NFPA 13-2019 Handbook 60
No ratings yet
NFPA 13-2019 Handbook 60
1 page
Srilanka Tandc
No ratings yet
Srilanka Tandc
4 pages
Report
No ratings yet
Report
5 pages
Black and White Grey Modular Abstract Strategy Deck Business Presentation
No ratings yet
Black and White Grey Modular Abstract Strategy Deck Business Presentation
20 pages
ECON1203 Business Economics and Statistics
No ratings yet
ECON1203 Business Economics and Statistics
4 pages
Financial Statement For Igp
No ratings yet
Financial Statement For Igp
21 pages
Statement 22399810
No ratings yet
Statement 22399810
1 page
RUIZAHIROVIpaper UseofCarbonfiber 7 ENG A4
No ratings yet
RUIZAHIROVIpaper UseofCarbonfiber 7 ENG A4
16 pages
Adult Income Prediction
No ratings yet
Adult Income Prediction
9 pages
BB 107 Fall 2020 Group Assignment Minkyu
No ratings yet
BB 107 Fall 2020 Group Assignment Minkyu
6 pages
Lec 1 Part 2
No ratings yet
Lec 1 Part 2
19 pages
Reviewer Final Exam
No ratings yet
Reviewer Final Exam
12 pages
Understanding Wage Characteristics
No ratings yet
Understanding Wage Characteristics
13 pages
MA SecA Group7
No ratings yet
MA SecA Group7
20 pages
LOI кукуруза PDF
No ratings yet
LOI кукуруза PDF
1 page
Theory of Quants For 11-12-2024
No ratings yet
Theory of Quants For 11-12-2024
30 pages
Probability Theory 1st Week
No ratings yet
Probability Theory 1st Week
44 pages
DataAnalysis 101
No ratings yet
DataAnalysis 101
3 pages
Economic Growth-WPS Office
No ratings yet
Economic Growth-WPS Office
16 pages
AI Report
No ratings yet
AI Report
16 pages
Nature of EMCT
No ratings yet
Nature of EMCT
4 pages
Afroplast Energy Audit Report 25112017
No ratings yet
Afroplast Energy Audit Report 25112017
62 pages
Simulation LC Transfer 01
No ratings yet
Simulation LC Transfer 01
3 pages
MAE 301: Applied Experimental Statistics
No ratings yet
MAE 301: Applied Experimental Statistics
10 pages
Introduction To Econometrics: Temesgen Worku T.w.bezabih@vu - NL
No ratings yet
Introduction To Econometrics: Temesgen Worku T.w.bezabih@vu - NL
60 pages
The Narrowing Male-Female Unemployment Differential
No ratings yet
The Narrowing Male-Female Unemployment Differential
20 pages
SimpleRegression Transcript
No ratings yet
SimpleRegression Transcript
4 pages
Conditional Deed of Sale
No ratings yet
Conditional Deed of Sale
3 pages
Report 1 AI17C DBM302m KhaiHoan BaoChau VanThu
No ratings yet
Report 1 AI17C DBM302m KhaiHoan BaoChau VanThu
6 pages
JobCard Invoice List BISWAJIT MAHATA PDF
100% (1)
JobCard Invoice List BISWAJIT MAHATA PDF
3 pages
Kaushik Project
No ratings yet
Kaushik Project
13 pages
Stats Project Reportv1.0
No ratings yet
Stats Project Reportv1.0
14 pages
Wooldridge 7e Ch01 IM
No ratings yet
Wooldridge 7e Ch01 IM
8 pages
2022bbe1052 Ecotrix Merged
No ratings yet
2022bbe1052 Ecotrix Merged
18 pages
KRA 2 Ok
No ratings yet
KRA 2 Ok
12 pages
Project3 1
No ratings yet
Project3 1
2 pages
Stats Ch.13 Linear Regression
No ratings yet
Stats Ch.13 Linear Regression
42 pages
Nguyen Final Project Report
No ratings yet
Nguyen Final Project Report
10 pages
Technical Specification Cranefrigor WDV Qvo 465
No ratings yet
Technical Specification Cranefrigor WDV Qvo 465
4 pages
Final AB 19-21 PIM3 Basics of Business Statistics
No ratings yet
Final AB 19-21 PIM3 Basics of Business Statistics
37 pages
Geopolitics N Geoeconomics
No ratings yet
Geopolitics N Geoeconomics
6 pages
What Is Statistics
No ratings yet
What Is Statistics
7 pages
Solution Manual For Introductory Econometrics 6th Edition by Woolridge
0% (3)
Solution Manual For Introductory Econometrics 6th Edition by Woolridge
7 pages
DS Practical 01
No ratings yet
DS Practical 01
9 pages
2.1 Descriptive Statistics (Tabular and Graphical)
No ratings yet
2.1 Descriptive Statistics (Tabular and Graphical)
8 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
45 pages
Indian Standard: (First Revision)
No ratings yet
Indian Standard: (First Revision)
8 pages
Charles Business Plan
No ratings yet
Charles Business Plan
29 pages
4.2 Solution
No ratings yet
4.2 Solution
4 pages
Get Pay Right: How to Achieve Pay Equity That Works
From Everand
Get Pay Right: How to Achieve Pay Equity That Works
Kent Plunkett
No ratings yet
Business Statistics I Essentials
From Everand
Business Statistics I Essentials
Louise Clark
5/5 (5)
Statistics: Practical Concept of Statistics for Data Scientists
From Everand
Statistics: Practical Concept of Statistics for Data Scientists
John Slavio
No ratings yet
Essentials of Data Analysis
From Everand
Essentials of Data Analysis
Agasti Khatri
No ratings yet
Strategic Employee Surveys: Evidence-based Guidelines for Driving Organizational Success
From Everand
Strategic Employee Surveys: Evidence-based Guidelines for Driving Organizational Success
Jack Wiley
No ratings yet
All About Data Science: Learn Data Science from scratch
From Everand
All About Data Science: Learn Data Science from scratch
Devi Prasad
No ratings yet
Strategies to Explore Ways to Improve Efficiency While Reducing Health Care Costs
From Everand
Strategies to Explore Ways to Improve Efficiency While Reducing Health Care Costs
Calvin Tchatchoua
No ratings yet
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
From Everand
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
Peter Bradley
No ratings yet
"Data Analysis" Basic Concepts and Applications
From Everand
"Data Analysis" Basic Concepts and Applications
Sukanta Bhattacharya
No ratings yet
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Notes on Population Health: The Healthcare Guys
From Everand
Notes on Population Health: The Healthcare Guys
The Healthcare Guys
No ratings yet

Income Prediction Analysis

Uploaded by

Income Prediction Analysis

Uploaded by

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

STATISTICS PROJECT REPORT

Thesis · October 2023

The user has requested enhancement of the downloaded file.

Dr. Arnab Nayak

BDA 610: Advanced Business Statistics

Income Prediction Analysis

3 About the Dataset

3.1 Data Dictionary

The data dictionary of the variables in this dataset is in Table 2.

3.2 Data Visualization

4 Data Cleaning Process

5 Income Prediction Analysis

5.1 Models for Income Prediction

5.1.1 Methodology for Income Prediction

These predictors include:

exper (Experience): With a coefficient of 0.20, experience is positively related to wages. On

5.1.3 Logistic Regression

Here are the VIF Values from the Logistic Regression

“Income and Poverty in the US: 2020” US Census Bureau, URL:

“Is Adult Income Dataset Imbalanced” Analytics Vidhya, URL:

You might also like