Advanced Business Analytics Project: Prepared By: Group 10 Lohith Kumar Vamshi Aparna Samarth
Advanced Business Analytics Project: Prepared By: Group 10 Lohith Kumar Vamshi Aparna Samarth
PREPARED BY :
GROUP 10
LOHITH KUMAR VAMSHI
APARNA SAMARTH
OBJECTIVE
• The objective of this project is to predict the attrition rate for each employee to find out who’s
more likely to leave the organization
• Given data set presents an employee survey from IBM, indicating if there is attrition or not. The
data set contains approximately 1500 entries
• Given the limited size of the data set, the model should only be expected to provide modest
improvement in identification of attrition vs a random allocation of probability of attrition
SCOPE
• Organizations can use this prediction model to prevent attrition and plan in advance hiring of new employee
• Scope of this project can be extended to multiple industries
2
DATASET
Dataset has information on
employee satisfaction, income,
seniority and some demographics. It
includes the data of 1470 employees.
Steps
1. Data collection and cleaning – Data was structured and did not require any cleaning for further analysis
2. To perform logistic regression, Attrition column was converted to binary form (1,0)
3. Similarly all categorical variables were converted to factor variables
4. In the next step, data was split in 80:20 ratio for training and testing data respectively
5. Further, a logistic regression models were built to train the dataset and the model with higher AIC score was
chosen to further test the data
6. Once model was chosen, accuracy test was done using testing dataset
3
EXPLORATORY DATA ANALYSIS
4
TABLEAU INSIGHTS
5
OBSERVATIONS
Model 1: In Model, all the variables were considered as Model 2: In Model, only significant variables were
independent variables considered as independent variables
We have learned how to analyze employee attrition using “LOGISTIC REGRESSION” with the help of R
software. Only with a couple of codes and a proper data set, a company can easily understand which
areas needed to look after to make the workplace more comfortable for their employees and restore their
human resource power for a longer period