Oup 5
Oup 5
By
JITHESH K, P221821
JITHIN CHANDRA, P221822
KADEEJA RINOSHA, P221823
K HRUTHIK KUMAR, P221824
KENDRE KOMAL, P221825
August 2023
Submitted to
Dr. S.VISALAKSHMI
Assistant Professor
Departmentof Management
CENTRAL UNIVERSITY OF TAMIL NADU
Thiruvarur–610005
1
TABLE OF CONTENTS
1 FREQUENCIES 3-8
2
FREQUENCIES
3
4
5
6
7
INTERPRETATION:
For the "Height" variable:
Similar to height, there are 10 valid cases in total, with no missing values.
The most common weight value is 116, which appears once, making it the mode.
The mean weight is 133.
The median weight is 132.
The weights have a standard deviation of about 13.416, indicating a relatively higher spread
compared to the mean.
The range of weights is 37, with the smallest value being 116 and the largest being 153.
8
DESCRIPTIVE STATISTICS 1
9
INTERPRETATION:
From these descriptive statistics, we can infer that the GDP values in the
dataset vary significantly, with a wide range from a minimum of 496.1 to a
maximum of 10243.6. The mean GDP is around 3573, and the distribution has
a positive skewness and slightly heavier tails than a normal distribution. This
suggests that there might be some countries with very high GDP values that
are contributing to the skewness and kurtosis of the distribution. It would be
helpful to visualize the data using histograms or other graphs to better
understand the distribution.
10
DESCRIPTIVE STATISTICS 2
11
INTERPREATION:
This indicates that the average number of employees is around 30.5, with a
relatively high variability. The variance represents the spread of data points
around the mean. The average age is approximately 37.683, with a standard
deviation of 10.934. This suggests that the ages are relatively close to the
mean, with a smaller spread compared to the "Number of Employees"
variable. The average age is approximately 37.683, with a standard deviation of
10.934. This suggests that the ages are relatively close to the mean, with a
smaller spread compared to the "Number of Employees" variable. Overall,
these statistics provide insight into the central tendency and variability of the
data for both variables.
12
SIMPLE CORRELATION:
13
STEP 3 : VIEW THE DATA WHICH IS IMPORTED IN VARIABLE VIEW.
14
STEP 5: SELECT THE FLITER OF YOUR CHOICE IN FILTERS AND
SELECTS THE DIFFERENT TYPES OF METHODS AND DATA
VIEWS.
15
Correlations:
Correlation is a statistical measure that expresses the extent to which
two variables are linearly related (meaning they change together at a
constant rate). It’s a common tool for describing simple relationships
without making a statement about cause and effect.
Simple correlation:
Simple linear correlation is a measure of the degree to which two
variables vary together, or a measure of the intensity of the association
between two variables. • Correlation often is abused. You need to show
that one variable actually is affecting another variable.
[DataSet1]
Descriptive Statistics
Std.
Mean Deviation N
Income 56.00 11.121 7
Expendit
53.57 8.886 7
ure
Correlations
Income Expenditure
Income Pearson Correlation 1 .830*
Sig. (2-tailed) .021
Sum of Squares and
742.000 492.000
Cross-products
Covariance 123.667 82.000
N 7 7
Expenditure Pearson Correlation .830* 1
Sig. (2-tailed) .021
Sum of Squares and
492.000 473.714
Cross-products
Covariance 82.000 78.952
N 7 7
16
*. Correlation is significant at the 0.05 level (2-tailed).
Interpret:
The descriptive statistics and correlation coefficients:
Descriptive Statistics:
Income:
Correlations:
The Pearson correlation coefficient measures the strength and direction
of a linear relationship between two variables. In this case, it is used to
assess the relationship between "Income" and "Expenditure."
17
significant, suggesting that the observed correlation is unlikely to have
occurred by random chance.
Nonparametric Correlations
Correlations
Incom Expendit
e ure
Kendall's IncomeCorrelation
1.000 .619
tau_b Coefficient
Sig. (2-tailed) . .051
N 7 7
Expendit Correlation
.619 1.000
ure Coefficient
Sig. (2-tailed) .051 .
N 7 7
Spearman's Income Correlation
1.000 .750
rho Coefficient
Sig. (2-tailed) . .052
N 7 7
Expendit Correlation
.750 1.000
ure Coefficient
Sig. (2-tailed) .052 .
N 7 7
Nonparametric correlations are used when the data does not meet the
assumptions required for parametric correlations (such as Pearson's
18
correlation) due to non-normality or when the relationship is not linear.
Kendall's tau_b:
Correlation between Income and Expenditure:
Correlation Coefficient: 0.619
Significance (p-value): 0.051 (not significant at the 0.05 level)
The Kendall's tau_b correlation coefficient of 0.619 suggests a moderate
positive correlation between Income and Expenditure. However, since
the p-value (0.051) is slightly above the typical significance level of 0.05,
the correlation is not considered statistically significant. This means that
we do not have strong evidence to conclude that the observed
correlation is beyond what could be expected by random chance.
Spearman's rho:
19
MULTIPLE CORRELATION:
20
STEP 3 : VIEW THE DATA WHICH IS IMPORTED IN VARIABLE VIEW.
21
STEP 5: SELECT THE FLITER OF YOUR CHOICE IN FILTERS AND
SELECTS THE DIFFERENT TYPES OF METHODS AND DATA
VIEWS.
Correlations
[DataSet1]
Descriptive Statistics
22
Mean Std. Deviation N
GDP 1142153.22 233846.634 37
GOVTEXP 125973.30 32913.541 37
CONS 674818.65 136232.151 37
INV 374394.76 88811.183 37
Correlations
GDP GOVTEXP CONS INV
GDP Pearson Correlation 1 .928** .964** .977**
Sig. (2-tailed) .000 .000 .000
Sum of Squares and Cross- 1968632943898 257135841240. 1105983777189 730298878187.
products .271 622 .811 946
Covariance 54684248441.6 7142662256.68 30721771588.6 20286079949.6
19 4 06 65
N 37 37 37 37
** **
GOVTEXP Pearson Correlation .928 1 .885 .872**
Sig. (2-tailed) .000 .000 .000
Sum of Squares and Cross- 257135841240. 38998841699.7 142890139079. 91754715923.6
products 622 30 865 76
Covariance 7142662256.68 1083301158.32 3969170529.99 2548742108.99
4 6 6 1
N 37 37 37 37
CONS Pearson Correlation .964** .885** 1 .952**
Sig. (2-tailed) .000 .000 .000
Sum of Squares and Cross- 1105983777189 142890139079. 668131158636. 414552161462.
products .811 865 432 838
Covariance 30721771588.6 3969170529.99 18559198851.0 11515337818.4
06 6 12 12
N 37 37 37 37
INV Pearson Correlation .977** .872** .952** 1
Sig. (2-tailed) .000 .000 .000
Sum of Squares and Cross- 730298878187. 91754715923.6 414552161462. 283947343806.
products 946 76 838 811
Covariance 20286079949.6 2548742108.99 11515337818.4 7887426216.85
65 1 12 6
N 37 37 37 37
**. Correlation is significant at the 0.01 level (2-tailed).
23
INTERPRETATION:
interpret provided descriptive statistics and correlation coefficients for
the variables: GDP, GOVTEXP, CONS, and INV.
Descriptive Statistics:
GDP:
Correlations:
The Pearson correlation coefficient measures the strength and direction
of a linear relationship between two variables.
24
Pearson Correlation Coefficient: 0.977
Significance (p-value): < 0.01
Correlation between GOVTEXP and CONS:
25
Nonparametric Correlations
Correlations
GDP GOVTEXP CONS INV
Kendall's tau_b GDP Correlation Coefficient 1.000 .766** .859** .874**
Sig. (2-tailed) . .000 .000 .000
N 37 37 37 37
GOVTEXP Correlation Coefficient .766** 1.000 .703** .676**
Sig. (2-tailed) .000 . .000 .000
N 37 37 37 37
CONS Correlation Coefficient .859** .703** 1.000 .793**
Sig. (2-tailed) .000 .000 . .000
N 37 37 37 37
INV Correlation Coefficient .874** .676** .793** 1.000
Sig. (2-tailed) .000 .000 .000 .
N 37 37 37 37
Spearman's rho GDP Correlation Coefficient 1.000 .922** .965** .972**
Sig. (2-tailed) . .000 .000 .000
N 37 37 37 37
** **
GOVTEXP Correlation Coefficient .922 1.000 .877 .864**
Sig. (2-tailed) .000 . .000 .000
N 37 37 37 37
CONS Correlation Coefficient .965** .877** 1.000 .946**
Sig. (2-tailed) .000 .000 . .000
N 37 37 37 37
** ** **
INV Correlation Coefficient .972 .864 .946 1.000
Sig. (2-tailed) .000 .000 .000 .
N 37 37 37 37
**. Correlation is significant at the 0.01 level (2-tailed).
INTERPRETATION:
26
GOVTEXP, CONS, and INV.
Kendall's tau_b:
Spearman's rho:
27
Correlation between GOVTEXP and CONS:
Interpretation:
All the correlation coefficients (both Kendall's tau_b and Spearman's rho)
are positive, indicating a positive monotonic relationship between the
variables.
The significance levels (p-values) for all correlations are less than 0.01,
indicating that these correlations are statistically significant at the 0.01
level. This means that the observed relationships are unlikely to have
occurred by random chance.
28
SIMPLE REGRESSION
29
30
INTERPRETATION:
The dependent variable is "Maintenance cost." The independent variable is
"Age of a car." All requested variables have been included in the analysis. In
summary, the data suggests a strong positive correlation between the age of a
car and its maintenance cost. As the age of a car increases, its maintenance
cost tends to increase as well. The regression analysis helps quantify and
understand this relationship, and the high correlation coefficient along with
the low p-value suggest that the relationship is statistically significant.
31
MULTIPLE REGRESSION
32
33
INTERPRETATION:
Mean, Std. Deviation, and N (sample size) values provide basic information
about the data distribution for each variable. The Correlations table shows
Pearson correlation coefficients between the variables. The closer the value is
to 1, the stronger the positive correlation. It seems that GDP and CONS, as
well as GDP and INV, have strong positive correlations.
In brief, this analysis suggests that GDP is strongly correlated with
Consumption and Investment, while also being influenced by Government
Expenditure. The model aims to predict GDP using all the specified variables.
Keep in mind that further context and domain knowledge are needed for a
comprehensive interpretation. In brief, this analysis suggests that GDP is
strongly correlated with Consumption and Investment, while also being
influenced by Government Expenditure. The model aims to predict GDP using
all the specified variables. Keep in mind that further context and domain
knowledge are needed for a comprehensive interpretation.
34
CHI SQUARE TEST
35
INTERPRETATION:
The Chi-Square analysis appears to be evaluating the association between
smoking and cancer. The low p-values in the Chi-Square tests suggest that
there may be a statistically significant relationship between smoking and the
occurrence of cancer. However, the "Exact Sig (2-sided)" value of 0.982
suggests that the association might not be statistically significant when
considering exact probabilities. In summary, the data indicates a potential
relationship between smoking and cancer, but the exact significance level
suggests caution in interpreting the significance of this relationship.
36
ONE SAMPLE T TEST
37
38
INTERPRETATION:
The data suggests that the scores are significantly different from the
hypothesized mean of 30.04. The p-values (both one-sided and two-sided) are
both less than 0.001, indicating a highly significant difference. The effect sizes,
represented by Cohen's d and Hedges correction, provide information about
the practical significance of the observed difference in scores compared to the
hypothesized mean. Overall, this analysis demonstrates a statistically
significant difference in scores compared to the hypothesized mean, and the
effect sizes provide insights into the magnitude of this difference.
39
PAIRED SAMPLE T TEST
40
INTERPRETATION:
The mean "Before Sales" is higher (14.50) than the mean "After Sales"
(10.00). The correlation between "Before Sales" and "After Sales" is 0.841,
suggesting a strong positive relationship. The significance values for the
correlation test indicate statistical significance. The paired samples test
indicates a mean difference of -3.000, with a statistically significant difference.
The effect size measures (Cohen's d and Hedges' correction) show the
magnitude of the difference between the two conditions. Overall, the data
suggests a significant difference between Before Sales and "After Sales, with
effect size measures indicating the practical significance of this difference.
41
42