Statistics 2
Statistics 2
methods of social investigations. This has been orchestrated by global challenges that require
the application of positivist methodologies in scientific inquiries. With the aid of relevant
examples:
1. Distribution: Descriptive statistics involve studying the distribution of data, which refers
to the frequency of each value in the data set. Example: In a survey of 200 people, 50
people said they visited the library 0-4 times in the past year, 40 people visited 5-8
times, and so on.
2. Central tendency: This refers to the average of the data set, which can be measured
using the mean, median, or mode. Example: The mean number of library visits is 9.5, the
median is 7.5, and the mode is 3.
3. Mean: This is the sum of all values divided by the total number of values. It is sensitive
to outliers. Example: The mean number of library visits is 9.5.
4. Median: This is the middle value in a data set. It is not affected by outliers. Example: The
median number of library visits is 7.5.
5. Mode: This is the most frequently occurring value in a data set. Example: The mode
number of library visits is 3.
6. Variability: This refers to how spread out the data is. It can be measured using the
range, variance, or standard deviation. Example: The standard deviation of library visits
is 9.18.
7. Range: This is the difference between the highest and lowest values in a data set.
Example: The range of library visits is 24.
8. Variance: This is the average of the squared deviations from the mean. It reflects the
degree of spread in the data set. Example: The variance of library visits is 84.3.
9. Standard deviation: This is the square root of the variance. It is a measure of the
average distance of each value from the mean. Example: The standard deviation of
library visits is 9.18.
10. Univariate descriptive statistics: This involves studying one variable at a time. Example:
The mean number of library visits is 9.5, the standard deviation is 9.18, and the range is
24.
11. Bivariate descriptive statistics: This involves studying the relationship between two
variables. Example: In a survey of 200 people, we found that women visited the library
more often than men.
12. Frequency distribution: This is a table or graph that shows the frequency of each value
in a data set. Example: A table showing the number of people who visited the library 0-4
times, 5-8 times, and so on.
Inferential statistics is a branch of statistics that involves making inferences or predictions
about a population based on a sample of data. It allows researchers to draw conclusions about
a larger group based on the characteristics of a smaller group.
3. Standard error: The standard error is a measure of the variability of a sample statistic,
such as the mean. It is used to calculate confidence intervals for population parameters.
Example: A researcher wants to estimate the mean score on a test for all students in a
school. They take a random sample of 50 students and calculate the mean score. The
standard error is used to calculate a confidence interval for the population mean score.
7. Type I and Type II errors: Inferential statistics involves the risk of making type I and type
II errors. Example: A researcher tests whether a new drug is effective in treating a
disease. A type I error would occur if they conclude that the drug is effective when it is
not. A type II error would occur if they conclude that the drug is not effective when it is.
8. Power: The power of a statistical test is the probability of correctly rejecting the null
hypothesis when it is false. Example: A researcher tests whether a new drug is effective
in treating a disease. The power of the test is the probability of correctly concluding that
the drug is effective when it is.
9. Parametric and nonparametric tests: Parametric tests assume a specific distribution for
the population, such as a normal distribution. Nonparametric tests do not assume a
specific distribution. Example: A researcher wants to compare the median income of
two groups. They use a nonparametric test because they do not want to assume a
specific distribution for the population.
10. Correlation and regression: Inferential statistics can be used to study the relationship
between two variables. Example: A researcher wants to study the relationship between
hours of study and test scores. They use correlation and regression analysis to model
the relationship.
11. Analysis of variance (ANOVA): ANOVA is a statistical technique used to compare the
means of more than two groups. Example: A researcher wants to compare the mean
income of workers in three different departments. They use ANOVA to test whether the
differences in means are statistically significant.
(b) Illustrating your answers with relevant examples, identify and explain four [4]
research designs usually used by positivists researchers.
The researcher controls all other variables to ensure that any changes in the
dependent variable are due to the manipulation of the independent variable.
This design is often used in scientific research, such as testing the effectiveness
of a new drug.
This design involves examining the relationship between two or more variables
without manipulating them.
The researcher measures the variables and looks for patterns of association.
This design is often used to explore potential causal relationships, but it cannot
establish causality on its own.
This design is often used in social science research, such as studying the
relationship between income and education level.
This design involves collecting data from a large sample of people using a
standardized questionnaire.
This design is often used to gather information about people's attitudes, beliefs,
and behaviors.
This design is often used to explore complex phenomena that cannot be easily
studied using other research designs.
(c) By the aid of local examples, explain four [4] probability sampling techniques used in
quantitative studies.
2. Stratified Sampling
3. Systematic Sampling
This method is often used when the population is large and it is not feasible to
list all members.
Example: A researcher wants to study the quality of water in a local river. They
take a sample every 100 meters along the length of the river.
4. Cluster Sampling
This method is often used when the population is geographically dispersed and it
is not feasible to list all members.
Each of these sampling techniques has its own advantages and disadvantages, and the choice of
technique depends on the research question, the nature of the population, and the resources
available to the researcher.
BIBLIOGRAPHY
1. King, G., Keohane, R. O., & Verba, S. (1994). Designing Social Inquiry: Scientific Inference
in Qualitative Research. Princeton University Press.
2. Lazarsfeld, P. F. (1955). Mathematical thinking in the social sciences. The Free Press.
3. Lieberson, S. (1992). Making it count: The improvement of social research and theory.
University of California Press.
5. Abbott, A. (2001). Time matters: On theory and method. University of Chicago Press.
6. Raftery, A. E., & Hout, M. (1993). Social dynamics of educational attainment. Sociology
of Education, 66(3), 187-203.
7. DiPrete, T. A., & Grusky, D. B. (2013). The great divide: Global income inequality and its
alternatives. Journal of Economic Literature, 51(1), 3-30.
8. Western, B., & Pettit, B. (2010). Incarceration and social inequality. Daedalus, 139(3), 8-
19.
9. McCall, L., & Percheski, C. (2010). Gender and educational attainment. Annual Review of
Sociology, 36, 375-395.
10. DiPrete, T. A., & Eirich, G. M. (2006). Cumulative advantage as a mechanism for
inequality. American Sociological Review, 71(5), 709-733.
11. Western, B., & Wildeman, C. (2009). The black family and mass incarceration. The
Annals of the American Academy of Political and Social Science, 621(1), 221-242.
12. DiPrete, T. A., & Buchmann, C. (2013). The rise of women: The growing gender gap in
education and what it means for American schools.