Q3 Lectures STATS
Q3 Lectures STATS
Statistics- It is the science of conducting studies to (1) Collect, (2) Organizes, (3) Present, (4) Analyze, and (5)
Interpret data. [COPAI]
b. Continuous random variable - A variable which takes an infinite number of possible values on a
continuous scale. It represents measured data.
Examples:
- Length of time it takes to go from Pampanga to Manila via bus.
- Weight of an infant
- Height of a grade 11 pupil
-
X + 2 = 6 (to find the value of x solve it by using the subtraction property of equality.)
Solution:
X+2=6
X+2–2=6–2
x=4
Random Variable
A random variable has a whole set of values, and it could take on any of those values, randomly.
Example 1: throw a die once
(random variable) X = "the score shown on the top face"
X = 1, 2, 3, 4, 5 or 6
Sample space = { 1, 2, 3, 4, 5, 6 }
We can show the probability of any one value using this style:
P(X = value) = probability of that value
(random variable) X = { 1, 2, 3, 4, 5, 6 }
in this case they are all equally likely, so the probability of any one is 1/6
P(X = 1) = 1/6
P(X = 2) = 1/6
P(X = 3) = 1/6
P(X = 4) = 1/6
P(X = 5) = 1/6
P(X = 6) = 1/6
P(X) = 6/6
Note that the sum of the probabilities = 1, as it should be.
Example 2: Drawing balls from a box
Two balls are drawn in succession without replacement from box containing 5 red balls and 6 blue balls. Let A
be the random variable representing the number of blue balls. Find the values of the random variable a
representing the number of blue balls.
First determine the probabilities of events.
Red-red = (5/11)(4 /10) Blue-red = (6 /11)(5/10)
=20/110 = 30/110
= 2/11 = 3/11
Red-blue = (5/11)(6 /10) Blue-blue = (6 /11)(5/10)
= 30/110 =30/110
= 3/11 = 3/11
(random variable) A = “the color of the picked ball”
(random variable) A = 0, 1, 2
Sample space = { 0, 1, 2 }
P(A = 0) = 2/11
P(A = 1) = 6/11
P(A= 2) = 3/11
P(A) = 11/11
X P(Z)
0 2/11
1 6/11
2 3/11
Standard Deviation (𝜎) of a Probability Distribution - is the square root of variance and is used to calculate the
amount of dispersion of the given data set values.
The symbol 𝜎 is read as “sigma”.
Formula to find the standard deviation probability distribution:
σ=√ Σ [ X 2 · P(X )]−μ 2
where,
X – value of the random variable
P( X)– probability of X
Σ - summation symbol
The figure above illustrates a standard normal distribution. The values of the horizontal axis are the values of the random
variable Z, the transformed values of the random variable X.
The values of Z are computed using the formula:
Four-Step Process in Finding the Areas Under the Normal Curve Given a z-Value
1. Express the given z-value in a four-digit form.
2. Using the z-Table, find the first two digits on the left column.
3. Match the third digit with the appropriate column on the right.
4. Read the area (or probability) at the intersection of the row and the column. This is the required area.
Z-score or Standard Score tells how many standard deviations a value is,
away from the mean. A negative z-score tells that the value is below the
mean, while a positive z-score tells that the value is above the mean. A z-
score is unitless, thus, even values of different units could
be compared relative to their groups.
2. We can now construct the frequency distribution of the sample means and its probability. This is the sampling distribution of
the sample means.
3. The histogram of the sampling distribution of the sample means is constructed by making a bar graph where the sample
means are plotted on the horizontal axis and the corresponding probabilities are shown in the vertical axis.
Lesson 4: Finding the Mean, Variance and Standard Deviation of the Sampling Distribution of the Sample Means
Steps in finding the mean, variance and standard deviation of the sampling distribution of the sample means:
Steps Formula
1. Compute the mean of the population μ.
Lesson 5: The Variance and the Standard Deviation of the Sampling of the Sample Means for Finite and Infinite Population
1) If x is normal, x̄ is normal. If x is not normally distributed, x̄ is approximately normal for sufficiently large sample size.
2) The population mean is equal to the mean of the sampling distribution of the sample means (μ = μ x)
2. Confidence Level (Degree of Confidence) - The confidence level of an interval estimate is the probability that the interval
estimate contains the unknown population mean. In determining an interval estimate, a degree of confidence (expressed as
percentage) must be set.
3. Margin of Error (E) Defined as the maximum likely difference between the observed sample mean (x̅) and the true value of
the population mean (μ). The length of the confidence interval is equal to twice the length of margin of error.
Lesson 3: T-Distribution
The t-distribution (aka, Student’s t-distribution) is a probability distribution that is used to estimate parameters when the sample
size is small and/or when the population variance/ standard deviation is unknown.
Properties of t-distribution
• Bell-shaped and symmetric about the mean which is 0 (same as standard normal curve)
• Standard deviation is a bit larger than 1 (slightly thicker tails than standard normal distribution)
• Gets narrower and more closely resembles standard normal distribution as df increases (nearly identical when df > 30)
• Precise shape depends on degrees of freedom
The degrees of freedom, denoted by df, refers to the number of scores in a
distribution that are free to vary without changing the mean of the distribution.
Formula: df = n – 1 (where n is the sample size)