100% found this document useful (1 vote)

3K views11 pages

Business Analytics Project

This document presents a summary of a dataset on employee absenteeism at a Brazilian courier company from 2007 to 2010. It contains 20 attributes related to employee demographics, health issues, and expenses. Summary statistics, visualizations, linear regression, and hypothesis tests were performed to analyze relationships between attributes like distance from work and expenses, age and service time, and the effect of education level on absenteeism.

Uploaded by

Aurva Bhardwaj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

3K views11 pages

Business Analytics Project

Uploaded by

Aurva Bhardwaj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Business Analytics

Project

Submitted to-
Dr. S. Maheswaran

By-
Aurva Bhardwaj-201922066
Koushik G-201922077
Muzammil Quazi-201922083
Neerav Bhardwaj-201922084
Tulika Das-201922105
Overview
The database was created with records of absenteeism at work from
July 2007 to July 2010 at a courier company in Brazil.
Relevant Information:
The data set allows for several new combinations of attributes and
attribute exclusions, or the modification of the attribute type
(categorical, integer, or real) depending on the purpose of the
research. The data set (Absenteeism at work - Part I) was used in
academic research at the Universidade Nove de Julho - Postgraduate
Program in Informatics and Knowledge Management. Data captures
various attributes and their effects in the employee absenteeism,
various factors like age , distance from residence , transportation,
expenses etc. The dataset captures various reasons for the employee’s
absenteeism such as various kinds of diseases that might effect the
employees.
Some of the Attribute description are-
1) Certain infectious and parasitic diseases
2)Neoplasms
3)Diseases of the blood and blood-forming organs and certain
disorders involving the immune mechanism
4)Endocrine, nutritional and metabolic diseases
5)Mental and behavioural disorders
6)Diseases of the nervous system
7)Diseases of the eye and adnexa
8)Diseases of the ear and mastoid process
9)Diseases of the circulatory system
10)Diseases of the respiratory system
11)Diseases of the digestive system
12)Diseases of the skin and subcutaneous tissue
13)Diseases of the musculoskeletal system and connective tissue
14)Diseases of the genitourinary system
15)Pregnancy, childbirth and the puerperium
16)Certain conditions originating in the perinatal period
17)Congenital malformations, deformations and chromosomal
abnormalities
18)Symptoms, signs and abnormal clinical and laboratory findings,
not elsewhere classified
19)Injury, poisoning and certain other consequences of external
causes
20)External causes of morbidity and mortality
21)Factors influencing health status and contact with health services.
Dataset contains both real and integer values such as education
and age.

Description of Data
Data Set Multivariate, No. of Instances 740
Characteristics Time series
Attribute Integer, Real No. of Attributes 20
Characteristics
Associated Tasks Classification , Missing Values N/A
Clustering

Dataset Review
 Out of the total instances of 740 entries a sample of 350 entries
has been taken.
 Dataset consists of both ordinal and nominal data
 Quantitative attributes like age, weight, height and body mass
index are present
 A total of 20 attributes are present
Dataset is multivariate and can be analysed using both descriptive
and inferential statistics. Using summary statistics measures of
central tendency can be calculated to find mean, median, mode of
various attributes. Measure of variation can be used to calculate the
variation in the data , for example standard deviation can be used to
measure deviation in data.
Visual Statistics can also be used to define the data and represent
the data in more comprehensible manner. Various Visual statistics
tools are there to present data like Pie charts, histograms, Box plots
etc.

Statistical Analysis
Tools
Summary Statistics

Mean 226.7020057
Standard Error 3.575000623
Median 231
Mode 179
Standard Deviation 66.78652319
Sample Variance 4460.43968
Kurtosis -0.455562906
Skewness 0.195363311
Range 270
Minimum 118
Maximum 388
Sum 79119
Count 349

For example mean of transport expense is around 226.7 and

median is 231. The minimum transport expense is 118 and
maximum is 388.
Similarly above table shows summary statistics for distance from
work. The mean of distance from work is 36.3 Kilometres and
median is 36. The minimum distance from home is 27 kilometers
and maximum is 58.

Visual Statistics

Following histogram shows amount of people according the

level of education. We can see no. of people having studied till
high school have the most no. of employees.
Following pie chart shows the amount of absenteeism according
to weekdays. Monday has the highest number of absenteeism
according to weekday.
Linear Regression
Distance from Residence to Work
60
y = 0.0581x + 16.772
50 R² = 0.0687
Travel Expense

30 Distance from
Residence to Work
20 Linear (Distance from
Residence to Work)
10

0
0 100 200 300 400 500
Distance

Above linear model shows the relation of distance from

residence to work to total expense. 6.8% change of the total
expenses is explained by distance from residence to work.

Regression Statistics
Multiple R 0.663525156
R Square 0.440265633
Adjusted R Square 0.438652565
Standard Error 2.960793027
Observations 349

Above linear regression model shows the relation between

service time and age. The correlation coefficient is fairly
correlated that is the age and service time in hours are positively
correlated.
Coefficient of determination or goodness of fit is 44% , that is
only 44 % of values fit our regression model.

Parameters Coefficients
Age -2.79337902
Service time 0.418098451

Above scatter plot diagram shows the linear regression equation

and the model.
Correlation
Significance
Service
Distance from residence to work
time
Distance from residence to work NA 0.00924438
Service time 0.00924438 NA

Above correlation between distance from work and service time

is positive and highly correlated which means that service time
is affected by the distance from residence to work.

Inferential Statistics
Hypothesis Testing
T-test for one sample mean

Ho: mean of age is less than 35 years

H1: mean of age is more than 35 years

Age
Mean 36.30
Variance 39.33
Observations 349.00
Hypothesized Mean
Difference 0.00
df 348.00
t Stat 108.13
P(T<=t) one-tail 0.00
t Critical one-tail 1.65
P(T<=t) two-tail 0.00
t Critical two-tail 1.97

Since p-value for the test is less than level of significance of

0.05 we will reject null hypothesis that is mean age of
employees is more than 35 years.
ANOVA

Ho: There is no difference between the mean of absenteeism all

education groups of Employees
HA: There is a difference between the mean of absenteeism all
education groups of employees

Anova: Single Factor

SUMMARY
Varianc
Groups Count Sum Average e
7.42783 212.186
Highschool 582 4323 5 7
6.39130 45.6212
Graduate 46 294 4 6
master and doctor 4 21 5.25 10.25
5.47297
postgraduate 74 405 3 67.0472

ANOVA
Source of Variation SS df MS F P-value F crit
97.9764 0.52802 0.66315 2.6175
Between Groups 293.9294517 3 8 3 7 9
185.553
Within Groups 130258.6215 702 6

Total 130552.551 705

Since F-value is less than the F-critical value we will accept the
null hypothesis that is there is no difference between the mean
absenteeism of the different education groups.
References-
 www.kaggle.com
 http://archive.ics.uci.edu/ml/datasets/Absenteeism+at+work
 www.statcraft.com

Tools Used
 Microsoft excel
 Statcraft

Business Analytical Project
100% (1)
Business Analytical Project
22 pages
Summer Internship Report 122013901037
80% (5)
Summer Internship Report 122013901037
25 pages
Bi Tools A Synergistic Opportunity For Organizational Growth and Quality Decision Making in It Organizations (2) - 1
No ratings yet
Bi Tools A Synergistic Opportunity For Organizational Growth and Quality Decision Making in It Organizations (2) - 1
79 pages
MBA Project
No ratings yet
MBA Project
67 pages
MBA Business Analysis SIP Project
80% (10)
MBA Business Analysis SIP Project
50 pages
Project Report
0% (1)
Project Report
82 pages
Sip On Data Analysis Using Power Bi
100% (3)
Sip On Data Analysis Using Power Bi
44 pages
Capstone Project Final Report - Patanjali
0% (1)
Capstone Project Final Report - Patanjali
60 pages
Project Report On Data Analytics
100% (1)
Project Report On Data Analytics
44 pages
174819-Market Basket Analysis
No ratings yet
174819-Market Basket Analysis
54 pages
Financial Analysis Using PowerBI
No ratings yet
Financial Analysis Using PowerBI
47 pages
CTS INTERNSHIP REPORT - Mohak
50% (4)
CTS INTERNSHIP REPORT - Mohak
32 pages
List of MBA Business Analytics Project Topics
No ratings yet
List of MBA Business Analytics Project Topics
3 pages
Mini Project by Yashvi Sharma Mba 1st Sem
No ratings yet
Mini Project by Yashvi Sharma Mba 1st Sem
32 pages
Mohamed Afzal Internship Report 45 Days 2
No ratings yet
Mohamed Afzal Internship Report 45 Days 2
87 pages
Job Portal
82% (11)
Job Portal
17 pages
Real-Time Business Intelligence
100% (1)
Real-Time Business Intelligence
75 pages
IT Skill Lab-2: Raj School of Management & Sciences
No ratings yet
IT Skill Lab-2: Raj School of Management & Sciences
38 pages
Mba Marketing Project
No ratings yet
Mba Marketing Project
98 pages
Research Project Report by RG
100% (1)
Research Project Report by RG
84 pages
Project Report Format Predictive Analytics
0% (1)
Project Report Format Predictive Analytics
30 pages
Vanshika Luthra MBA Marketing 2320982127 Internship Report
No ratings yet
Vanshika Luthra MBA Marketing 2320982127 Internship Report
78 pages
Pratik Prem Maurya, Mba Project
100% (1)
Pratik Prem Maurya, Mba Project
54 pages
Project Report On Customer Preference and Decision Making Towards Oyo Homes
50% (2)
Project Report On Customer Preference and Decision Making Towards Oyo Homes
92 pages
MBA Project
100% (1)
MBA Project
63 pages
Online Retail Market Basket Analysis
No ratings yet
Online Retail Market Basket Analysis
51 pages
Oracle DB Basic Commands
75% (4)
Oracle DB Basic Commands
1 page
INTERNSHIP REPORT Baseer
No ratings yet
INTERNSHIP REPORT Baseer
23 pages
Math 110-Fundamentals
No ratings yet
Math 110-Fundamentals
52 pages
Dr. Babasaheb Ambedkar Marthwada University, Aurangabad
100% (2)
Dr. Babasaheb Ambedkar Marthwada University, Aurangabad
78 pages
Report of The Summer Internship Project
No ratings yet
Report of The Summer Internship Project
25 pages
It Skill Lab-2
No ratings yet
It Skill Lab-2
12 pages
Business Intelligence MBA II ND SEMESTER
No ratings yet
Business Intelligence MBA II ND SEMESTER
29 pages
Tanisha Minor Project Final
No ratings yet
Tanisha Minor Project Final
49 pages
It Skills Lab-2 Report: (KMBN-251)
No ratings yet
It Skills Lab-2 Report: (KMBN-251)
12 pages
Business Analytics Project
100% (1)
Business Analytics Project
30 pages
A Project Report Submitted To The SRM University in Partial Fulfilment of The Requirements For The Award of The Degree of
No ratings yet
A Project Report Submitted To The SRM University in Partial Fulfilment of The Requirements For The Award of The Degree of
37 pages
SQL Report
No ratings yet
SQL Report
52 pages
A Project Report On Business Intelligence and It's Use in Decision Making" at TARANG SOFTWARE TECHNOLOGY LTD, Bangalore
100% (1)
A Project Report On Business Intelligence and It's Use in Decision Making" at TARANG SOFTWARE TECHNOLOGY LTD, Bangalore
65 pages
305 BA Machine Learning and Cognitive Intelligence Using Python
100% (1)
305 BA Machine Learning and Cognitive Intelligence Using Python
10 pages
Marketing Project On Reliance Retail Mar PDF
No ratings yet
Marketing Project On Reliance Retail Mar PDF
58 pages
Karan Pandey - Internship Report
No ratings yet
Karan Pandey - Internship Report
40 pages
Power Bi
No ratings yet
Power Bi
66 pages
Creating Dashboards Using Power BI
No ratings yet
Creating Dashboards Using Power BI
63 pages
ASI Show Orlando 2025 Exhibitor List
No ratings yet
ASI Show Orlando 2025 Exhibitor List
16 pages
A Study On Emotional Intelligence of Employees at Avion Systems PVT LTD, Chennai
100% (1)
A Study On Emotional Intelligence of Employees at Avion Systems PVT LTD, Chennai
4 pages
Business Intelligence MBA II ND SEMESTER
No ratings yet
Business Intelligence MBA II ND SEMESTER
36 pages
Lovely Professional University Department of Management
No ratings yet
Lovely Professional University Department of Management
23 pages
Summer Training Report
No ratings yet
Summer Training Report
65 pages
My Sip Report
No ratings yet
My Sip Report
38 pages
Assessment Task 2: Activity No. 1
No ratings yet
Assessment Task 2: Activity No. 1
5 pages
On Job Annual Training Plan 2023
No ratings yet
On Job Annual Training Plan 2023
3 pages
Iso 3960 2007 en FR PDF
No ratings yet
Iso 3960 2007 en FR PDF
6 pages
Amit Kumar: Bigmart Sales Prediction A Project Report
No ratings yet
Amit Kumar: Bigmart Sales Prediction A Project Report
47 pages
Chemistry Class 10
No ratings yet
Chemistry Class 10
8 pages
What Is Defensive Driving?
No ratings yet
What Is Defensive Driving?
3 pages
SIP Report
No ratings yet
SIP Report
47 pages
Internship Report
No ratings yet
Internship Report
9 pages
Kafd A1 111 Comn BF1 XXXXX SHP Arc Asb 00023
No ratings yet
Kafd A1 111 Comn BF1 XXXXX SHP Arc Asb 00023
1 page
Project On Analysis
No ratings yet
Project On Analysis
33 pages
Project Report - Advanced - Stats - Final PDF
No ratings yet
Project Report - Advanced - Stats - Final PDF
25 pages
Business Analytics Project
No ratings yet
Business Analytics Project
16 pages
UCSP 1st Q Budget Work
No ratings yet
UCSP 1st Q Budget Work
1 page
Creating Pivot Table Through The Date Range & Formatting Techniques
No ratings yet
Creating Pivot Table Through The Date Range & Formatting Techniques
21 pages
Internship Report Jitender
No ratings yet
Internship Report Jitender
53 pages
December Final Capstone File PDF
No ratings yet
December Final Capstone File PDF
21 pages
Ynspire Magazin-1-23 EN
No ratings yet
Ynspire Magazin-1-23 EN
48 pages
Research Project: A Study of Investors Pattern Towards Mutual Fund"
No ratings yet
Research Project: A Study of Investors Pattern Towards Mutual Fund"
85 pages
Ground Improvement Methods
No ratings yet
Ground Improvement Methods
32 pages
To 15a8-4-10-3 Navair 03-30ak-103
No ratings yet
To 15a8-4-10-3 Navair 03-30ak-103
42 pages
Modeling Class X AI
No ratings yet
Modeling Class X AI
24 pages
Vi Sem Bba Internship Guidelines
No ratings yet
Vi Sem Bba Internship Guidelines
8 pages
Project Employee Absenteeism
No ratings yet
Project Employee Absenteeism
33 pages
ISO 9001 Clauses Simply Explained Rev.1
No ratings yet
ISO 9001 Clauses Simply Explained Rev.1
26 pages
Mba 3 Sem Talent Management Kmbnhr01 Feb 2023
No ratings yet
Mba 3 Sem Talent Management Kmbnhr01 Feb 2023
2 pages
Intro To Psych L6
No ratings yet
Intro To Psych L6
10 pages
Grade 7 History Term 1 Worksheets 2023
No ratings yet
Grade 7 History Term 1 Worksheets 2023
23 pages
BC2402 Designing and Developing Databases - Course Outline
No ratings yet
BC2402 Designing and Developing Databases - Course Outline
11 pages
Anly 530 - Final Project Report
No ratings yet
Anly 530 - Final Project Report
18 pages
3 Recessed
No ratings yet
3 Recessed
11 pages
Shops & Estt
No ratings yet
Shops & Estt
4 pages
Anu Arora Report
No ratings yet
Anu Arora Report
8 pages
School Plan of Activities Sembreak
No ratings yet
School Plan of Activities Sembreak
2 pages
Water Supply Base Map of Bellary City: Allipura Impounding Reservoir - 12633 ML
No ratings yet
Water Supply Base Map of Bellary City: Allipura Impounding Reservoir - 12633 ML
1 page
KT Remote G PowerRemote en
No ratings yet
KT Remote G PowerRemote en
2 pages
MODULE 4 MAT Antepartum Flexible Learning
No ratings yet
MODULE 4 MAT Antepartum Flexible Learning
2 pages
Pressure Transmitter Offer
No ratings yet
Pressure Transmitter Offer
2 pages
Borang
No ratings yet
Borang
1 page
Wk08 Proforma Invoice Algeria 6.5
No ratings yet
Wk08 Proforma Invoice Algeria 6.5
1 page
Lexicology Summary 1
No ratings yet
Lexicology Summary 1
1 page

Business Analytics Project

Uploaded by

Business Analytics Project

Uploaded by

Business Analytics

For example mean of transport expense is around 226.7 and

Following histogram shows amount of people according the

Above linear model shows the relation of distance from

Above linear regression model shows the relation between

Above scatter plot diagram shows the linear regression equation

Above correlation between distance from work and service time

Ho: mean of age is less than 35 years

Since p-value for the test is less than level of significance of

Ho: There is no difference between the mean of absenteeism all

Anova: Single Factor

You might also like