Descriptive Statistics
Descriptive Statistics
Objectives
Discussion
Descriptive statistics refers to the branch of statistics that deals with the summarization
patterns, trends, and relationships. Unlike inferential statistics, which seeks to draw
Comparing datasets.
Key Characteristics
2. Variables
Variables are characteristics that can vary among individuals or objects. They are
Types of Variables
1. Independent Variable:
2. Dependent Variable:
variable.
(independent).
3. Data Types
1. Qualitative Data:
o Examples: Eye color (blue, green, brown), marital status (single, married).
2. Quantitative Data:
o Discrete Data:
o Continuous Data:
Levels of Measurement
3. Interval Scale: Numerical data without a true zero (e.g., temperature in Celsius).
4. Ratio Scale: Numerical data with a true zero (e.g., height, weight).
Central tendency provides a single value that represents the center of a dataset.
Mean:
The arithmetic average, calculated by summing all data points and dividing by
Median:
The middle value in a sorted dataset. If the dataset has an even number of
Mode:
The most frequently occurring value in a dataset. Some datasets may have more
Range:
Example: For data points 10, 15, 20, the range is 20−10=1020 - 10 = 10.
Standard Deviation:
Measures the average distance of each data point from the mean. A higher
Variance:
The square of the standard deviation, showing the degree of spread in the dataset.
The range of the middle 50% of data, calculated as the difference between the third
Formula: IQR=Q3−Q1\text{IQR} = Q3 - Q1
6. Data Visualization
plane.
Business:
Healthcare:
Education:
Research:
Before applying descriptive statistical methods, data must be cleaned to remove errors,
1. Identifying missing values and deciding whether to fill, exclude, or analyze separately.
EDA is a complementary process that uses descriptive statistics and data visualization
between variables.
1. Microsoft Excel: Widely used for basic calculations, graph creation, and
summary statistics.
visualization.
computations.
4. SPSS (Statistical Package for the Social Sciences): Commonly used in social
science research.
References
1. Gravetter, F. J., & Wallnau, L. B. (2016). Statistics for the Behavioral Sciences.
Cengage Learning.
2. Moore, D. S., Notz, W. I., & Fligner, M. A. (2018). The Basic Practice of
1. What does descriptive statistics primarily focus on? a. Drawing conclusions about
color d. Income
4. The difference between the highest and lowest values is called: a. Mean b.
5. Which measure describes the spread of data around the mean? a. Mode b.
6. In a normal distribution, the mean, median, and mode are: a. Different b. Equal c.
Unrelated d. Undefined
7. Continuous data can: a. Only take specific values b. Take any value within a
10. Pie charts are best suited for: a. Displaying frequencies b. Showing proportions
Enumeration
1. b
2. c
3. c
4. b
5. c
6. b
7. b
8. b
9. b
10. b
Enumeration Answers:
2. Discrete, Continuous
4. Business, Healthcare
5. Range, Standard Deviation, Variance