Da Stats Topic02 Introduction To Descriptive and Predictive Analysis
Da Stats Topic02 Introduction To Descriptive and Predictive Analysis
What Happened?
• Discrete Variables
Can take on only a finite number of values
No meaningful values exist between any two adjacent values
To find statistical features of sets of discrete data it is permissible
to use “in-between” values
• Continuous Variables
Theoretically have an infinite number of points between any two
numbers
Variables do not have gaps between adjacent numbers
1
𝑓=
𝑇
Σ𝑋
𝜇=
𝑁
𝜇 = the symbol for the mean of a population
𝑋 = a score in the distribution
𝑁 = population size
Σ = sum up a set of scores, Σ𝑋 = 𝑋1 + 𝑋2 + 𝑋3 + ⋯ + 𝑋𝑁
5, 8, 10, 11, 12
5 + 8 + 10 + 11 + 12 46
𝜇= = = 9.20
5 5
Σ𝑋𝑓
𝜇=
Σ𝑓
𝑓 = frequency with which a score appears
• Weighted Mean
𝑛1 𝑉1 + 𝑛2 𝑉2 + 𝑛3 𝑉3 + ⋯ 𝑛4 𝑉5
𝑀=
𝑛1 + 𝑛2 + 𝑛3 + ⋯ 𝑛𝑛
Mean: 𝑀1 = 50 𝑀2 = 70 𝑀3 = 65
4 scores 4 scores
Median
2 scores 2 scores
15+16
Median = = 15.5
2
105
• Range
𝑅𝑎𝑛𝑔𝑒 = 𝑋𝐻 − 𝑋𝐿
Range = 50 – 17 = 33
Σ 𝑋−𝜇
𝑀𝐷 =
𝑁
𝑋 = a raw score
𝜇 = the population mean
𝑁 = the number of scores in the population
Σ 𝑋−𝜇 2
𝜎=
𝑁
𝜎 = the symbol for the standard deviation
𝑋 = a raw score
𝜇 = the population mean
𝑁 = the number of scores in the population
N
Σi=1 X i − 𝜇 3 /N
𝑔1 =
s3
N
Σi=1 X i − 𝜇 4 /N
𝑔1 =
s4
𝜌 = correlation coefficient
𝑥𝑖 = values of the 𝑥 variable in a sample
𝑦𝑖 = values of the 𝑦 variable in a sample
𝜇𝑥 = mean of 𝑥 values
𝜇𝑦 = mean of 𝑦 values
𝑛𝑐 − 𝑛𝑑
𝜏=
𝑛(𝑛 − 1)/2
𝜏 = Kendall rank 𝜏
𝑛𝑐 = number of concordant pairs
𝑛𝑑 = number of discordant pairs
https://www.statisticshowto.com/kendalls-tau/
DA-STATS - Topic 01: Introduction to Descriptive and Predictive
68
Analysis
Spearman Rank Correlation
• Spearman rank correlation is a non-parametric measure of
relationship between two ranked variables.
• Suited for correlation analysis of variables on ordinal
scale.
6Σ𝑑𝑖2
𝜌 =1−
𝑛 𝑛2 − 1
𝜌 = Spearman Rank Correlation
𝑑𝑖 = difference between the ranks of corresponding pairs
𝑛 = number of observations
https://www.youtube.com/watch?v=DE58QuNKA-c
DA-STATS - Topic 01: Introduction to Descriptive and Predictive
69
Analysis
Spearman Rank Correlation
• Assumptions
data must be at least ordinal