Ba Report
Ba Report
The dataset includes variables such as age, gender, entry exam scores, work
experience, and known salary details. Using statistical techniques such as
hypothesis testing, regression analysis, and descriptive statistics, the
analysis is done to make predictions about the placement and salary of
individuals whose data may be incomplete or missing.
In this report, the dataset provided included several missing values, marked
as either “NA” (Not Available) or “NP” (Not Provided), which were removed or
addressed before conducting further analysis.
The logistic regression model was built using the following variables:
Exam Score: Higher entry exam scores were also positively correlated
with placement probability, indicating that students with higher
academic performance were more attractive to employers.
Gender: Gender was not found to have a statistically significant
impact on placement probability in this analysis.
Results:
The p-value obtained from the t-test was 0.23, which is higher than the
alpha level of 0.05. This result indicates that there is no statistically
significant difference between the salaries of males and females in the
dataset. Therefore, the null hypothesis cannot be rejected, suggesting that
gender alone does not explain salary variations in this sample.
Work experience was another variable tested for its impact on salary. We
hypothesized that individuals with more work experience would have higher
salaries. An ANOVA (Analysis of Variance) test was conducted to
determine whether the mean salaries differ significantly among different
levels of work experience.
The ANOVA test was conducted across various experience ranges (e.g., 0-2
years, 3-5 years, 6+ years).
Results:
The ANOVA test resulted in a p-value of 0.01, which is below the 0.05
threshold. This indicates that there is a statistically significant difference in
salaries based on work experience. Specifically, individuals with more work
experience tend to have higher salaries, confirming that experience is a
strong determinant of salary outcomes.
A similar analysis was conducted to explore whether entry exam scores have
a significant effect on salary. Higher scores could be expected to correlate
with better job placements and higher salaries.
The ANOVA test was conducted by grouping the data into ranges of exam
scores (e.g., below 600, 600-700, above 700).
Results:
The regression model was trained using the available data of individuals
whose salary information was known. The resulting coefficients from the
model were applied to predict the salaries of individuals with missing salary
data. Key findings include:
For example: