Unit II Descriptive-Statistics-And-Correlation
Unit II Descriptive-Statistics-And-Correlation
DATA
RELATIONSHIPS
: A DEEP D I V E
INTO
DESCRIPTIVE
STATISTICS A N D
CORRELATION
Unit 2: Statistics:
• Descriptive Statistics
• Correlation
• distributions and probability
• Statistical Inference: Populations and samples
• Statistical modelling
• probability distributions
• fitting a model
• Hypothesis Testing
INTRODUCTION TO DATA
RELATIONSHIPS
Definition: The range is the difference between the largest and smallest values in
a dataset.
Formula: Range = Maximum Value − Minimum Value
Example: If a data set contains values 2, 5, 8, 10, and 12,
the range is: 12−2=10
Explanation: The range gives a quick sense of the spread of the data, but it is
affected by extreme values (outliers).
2. Variance
3. Standard Deviation
Example-
Exam Scores Suppose you have the following scores of 20 students on an exam:
85, 90, 75, 92, 88, 79, 83, 95, 87, 91, 78, 86, 89, 94, 82, 80, 84, 93, 88, 81
To calculate descriptive statistics:
• Mean: Add up all the scores and divide by the number of scores. Mean = (85 + 90 + 75 + 92 + 88 + 79 + 83
+ 95 + 87 + 91 + 78 + 86 + 89 + 94 + 82 + 80 + 84 + 93 + 88 + 81) / 20 = 1770 / 20 = 88.5
• Median: Arrange the scores in ascending order and find the middle value. Median = 86 (middle value)
• Mode: Identify the score(s) that appear(s) most frequently. Mode = 88
• Range: Calculate the difference between the highest and lowest scores. Range = 95 - 75 = 20
• Variance: Calculate the average of the squared differences from the mean. Variance = [(85-88.5)^2 + (90-
88.5)^2 + ... + (81-88.5)^2] / 20 = 33.25
• Standard Deviation: Take the square root of the variance. Standard Deviation = √33.25 = 5.77
VISUALIZING DESCRIPTIVE
STATISTICS