ESM 507 Statistical Analysis B
ESM 507 Statistical Analysis B
The first statistical task is to do a descriptive analysis of all variables collected. In this
analysis it is necessary to present results obtained for each type of variable. Qualitative
variable results must be presented as frequencies and percentages while quantitative variables are
presented as means and deviations. The next statistical analysis depends on the objectives of the
study. The objective of a survey is to obtain information about certain specific situations
of the population, the analysis maybe to access the attribute of a specific variable or the
association between variables. You can use software packages like SPSS, EPInfo, STATA,
Minitab, Open Epi, Graph pad and many others.
This is the act of summarizing every data collected to describe the general picture of what is
been researched on. It provides an overview of the situation and provide information that shed
greater light to the real investigation been made, procedures for this analysis includes;
1. Frequency counts: Charts and graphs: this is the counting and grouping of a population
according to a particular scheme with a deliberate act of categorization for the purpose of
frequency analysis such as percentages, ratios.
2. Measures of Central Tendency: Mode, Median, and Mean help to condense data, and
summarize it to a single value. Such a value is usually somewhere in the center and
represent the entire data set.
3. Variability: Standard deviation, Variance, range, inter-quartile range, summarizes the
spread or dispersion of distribution.
1
Inferential statistical analysis
This provides a way of drawing inductive inferences from data and distinguishing the signal (the
phenomenon of interest) from the noise (statistical fluctuations) present in the data. This involves
more complex statistical procedures. The form of the analysis is determined by the specific
qualitative approach taken and the form of the data. Tests conducted to deduce inference may
include:
1. Chi-square test: A chi-square test is used when you want to see if there is a relationship
between two categorical variables (it is used for testing difference between paired
variables).
2. One-way ANOVA: A one-way analysis of variance (ANOVA) is used when you have a
categorical independent variable (with two or more categories) and a normally distributed
interval dependent variable and you wish to test for differences in the means of the
dependent variable broken down by the levels of the independent variable.
3. Paired t-test: A paired (samples) t-test is used when you have two related observations
(i.e. two observations per subject) and you want to see if the means on these two
normally distributed interval variables differ from one another.
4. Correlation: A correlation is useful when you want to see the linear relationship between
two (or more) normally distributed interval variables. Although it is assumed that the
variables are interval and normally distributed, we can include dummy variables when
performing correlations. \
5. Simple linear regression: Simple linear regression allows us to look at the linear
relationship between one normally distributed interval predictor and one normally
distributed interval outcome variable.
2
Table 1: Shows the factual differences between descriptive statistics and inferential statistics
Arrange, analyze and reflect the data in a Correlate, test and anticipate future
2
meaningful mode. outcomes.
Concluding outcomes are represented in the Final outcomes are the probability
3
form of charts, tables and graphs. scores.
Conclusion
It is important to always ensure data integrity and make accurate and appropriate research
analysis. Improper statistical analyses distort scientific findings, mislead casual readers and may
negatively influence public perception. Integrity issues are just as relevant to analysis as they are
in all other works of life.