Descriptive Analysis
Descriptive Analysis
PRESENTS
RESEARCH WEEK
26th April to 30th April 2024
ORGANIZED BY
B.K. SCHOOL OF MANAGEMENT
2
Analysis vs. Analytics
Business Analytics for Competitive Advantage -> Develops Data-Driven Approaches to Managerial Decisions
using these key Analytics capabilities.
Comprehensive Approach
• method to discover a pattern in large data sets using databases
Text Analytics or data mining tools.
• shows "Why did it happen?" by finding the cause from the insight
Diagnostic Analytics found in Statistical Analysis.
Data Analysis
Optimization
Univariate Bivariate Multivariate
Techniques
Univariate
Descriptive
Analysis
Measures of
Measures of Measures of Measures of
Central
Dispersion Skewness Kurtosis
Tendency
7
Univariate Analysis: Summary Measures
8
9
Descriptive Analysis of Univariate Data:
KURTOSIS
The height and sharpness of the peak relative to the rest of the data or the slope of the curve and the
frequency height is called Kurtosis. Balanda and Mac Gillivary say that increasing Kurtosis is associated with
the movement of the probability mass from the shoulders of a probability distribution into its center and
tails
10
Descriptive Analysis of Univariate
Data: Kurtosis
11
Kurtosis
Measurement Scales
Scale Descriptive Measures Inferential Measures
Nominal Percentage, Mode, Chi-Square, Binomial
Count test
Ordinal Quartiles, Percentiles, Spearman’s Rank
Deciles, Median Correlation, Friedman,
ANOVA
Interval Mean, Range, Product-moment
Standard Deviation correlation, t tests,
regression.
Ratio Geometric Mean, All Statistical tests can
Harmonic Mean, be performed.
Coefficient of
Variation
13
Now we will run Descriptive Statistics in
SPSS
Click on OK.
14
BIVARIATE ANALYSIS
Bivariate data is used for little complex analysis than as compared with univariate data.
Bivariate data is the data in which analysis are based on two variables per observation
simultaneously.
Purpose: Determining empirical relationship between two variables
Bivariate analysis is a simple (two variable) special case of multivariate analysis.
Bivariate analysis can be helpful in testing simple hypotheses.
It can help to determine at what extent it becomes easier to know and predict a value for one
variable (possibly a dependent variable) if we know the value of the other variable (possibly
the independent variable).
Cross classification, correlation, analysis of variance, simple regression etc. are some applications
of bivariate analysis.
15
Tools and Techniques for Bivariate Analysis
Bivariate
Analysis
Correlation coefficient—number used to describe the strength and direction of association between
variables
▪ Very strong = .80 through 1
▪ Moderately strong = .60 through .79
▪ Moderate = .50 through .59
▪ Moderately weak = .30 through .49
▪ Very weak to no relationship 0 to .29
Regression analysis examines associative relationships between a metric dependent variable and one or
more independent variables in the following ways:
Determine whether the independent variables explain a significant variation in the dependent variable:
whether a relationship exists.
Determine how much of the variation in the dependent variable can be explained by the independent
variables: strength of the relationship.
Determine the structure or form of the relationship: the mathematical equation relating the independent
and dependent variables.
Predict the values of the dependent variable.
Control for other independent variables when evaluating the contributions of a specific variable or set of
variables.
Regression analysis is concerned with the nature and degree of association between variables and does
not imply or assume any causality.
21
Simple Linear Regression Model
y = β0 + β1x + ε
Linear component Random Error
component
24
Population Linear Regression
y y = β0 + β1x + ε
Observed
Value of y for
xi
εi Slope =
Predicted β
Random Error 1
Value of y for
xi for this x value
Intercept =
β0
xi x
25
Estimated Regression Model
b0 and b1 are obtained by finding the values of b0 and b1 that minimize the sum of the squared residuals
e 2
= (y −ŷ) 2
= (y − (b 0 + b1x)) 2
The Least Squares Equation
b1 =
( x − x )( y − y )
sy
(x − x) 2
b1 = r
algebraic equivalent:
sx
xy − n x y and
b1 = b0 = y − b1 x
x 2
−
( x ) 2
n
Chap 13-27
28
Interpretation of the Slope and the Intercept
The coefficient of determination is the portion of the total variation in the dependent variable that is
explained by variation in the independent variable
SSR
R = 2
SST where 0 R 1 2
30
Coefficient of Determination
(continued)
R2
Coefficient of determination
R =r 2 2
where:
R2 = Coefficient of determination
r = Simple correlation coefficient
31
Examples of Approximate R2 Values
y
R2 = 1
Perfect linear relationship
between x and y:
x
R2 = 1
y 100% of the variation in y
is explained by variation in
x
x
R2 = +1
32
Examples of Approximate R2 Values
y
0 < R2 < 1
x
33
Examples of Approximate R2 Values
y R2 = 0
No linear relationship
between x and y:
Click on the Statistics button and select: Estimates, Confidence Intervals, Model fit, Descriptives, Part and
Partial correlations and Collinearity diagnostics.